咨询与建议

限定检索结果

文献类型

  • 549 篇 期刊文献
  • 369 篇 会议

馆藏范围

  • 918 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 595 篇 工学
    • 468 篇 计算机科学与技术...
    • 410 篇 软件工程
    • 117 篇 信息与通信工程
    • 97 篇 生物工程
    • 54 篇 光学工程
    • 52 篇 生物医学工程(可授...
    • 51 篇 控制科学与工程
    • 41 篇 电子科学与技术(可...
    • 39 篇 电气工程
    • 33 篇 化学工程与技术
    • 18 篇 机械工程
    • 16 篇 动力工程及工程热...
    • 14 篇 仪器科学与技术
    • 14 篇 土木工程
    • 11 篇 材料科学与工程(可...
    • 11 篇 建筑学
  • 394 篇 理学
    • 173 篇 物理学
    • 152 篇 数学
    • 114 篇 生物学
    • 48 篇 统计学(可授理学、...
    • 34 篇 化学
    • 26 篇 系统科学
  • 140 篇 管理学
    • 88 篇 图书情报与档案管...
    • 54 篇 管理科学与工程(可...
    • 23 篇 工商管理
  • 38 篇 医学
    • 37 篇 临床医学
    • 29 篇 基础医学(可授医学...
    • 21 篇 药学(可授医学、理...
  • 20 篇 法学
    • 20 篇 社会学
  • 8 篇 经济学
    • 8 篇 应用经济学
  • 6 篇 农学
  • 1 篇 教育学
  • 1 篇 文学

主题

  • 31 篇 speech recogniti...
  • 31 篇 semantics
  • 27 篇 training
  • 26 篇 machine learning
  • 18 篇 signal processin...
  • 17 篇 computational mo...
  • 16 篇 speech enhanceme...
  • 16 篇 embeddings
  • 14 篇 reinforcement le...
  • 14 篇 deep learning
  • 14 篇 decoding
  • 13 篇 object detection
  • 13 篇 speech processin...
  • 13 篇 computational li...
  • 12 篇 acoustics
  • 12 篇 feature extracti...
  • 11 篇 syntactics
  • 11 篇 robustness
  • 11 篇 adaptation model...
  • 10 篇 self-supervised ...

机构

  • 171 篇 moe key lab of a...
  • 155 篇 department of co...
  • 61 篇 key laboratory o...
  • 53 篇 moe key lab of a...
  • 44 篇 department of co...
  • 32 篇 department of co...
  • 30 篇 moe key lab of a...
  • 28 篇 department of co...
  • 28 篇 x-lance lab depa...
  • 23 篇 suzhou laborator...
  • 22 篇 x-lance lab depa...
  • 22 篇 tencent ai lab
  • 20 篇 shanghai jiao to...
  • 17 篇 key lab. of shan...
  • 15 篇 aispeech co. ltd...
  • 15 篇 ji hua laborator...
  • 15 篇 shanghai jiao to...
  • 15 篇 research center ...
  • 13 篇 moe key lab of a...
  • 12 篇 department of co...

作者

  • 121 篇 yu kai
  • 97 篇 zhao hai
  • 73 篇 yan junchi
  • 64 篇 chen lu
  • 59 篇 qian yanmin
  • 41 篇 zhang zhuosheng
  • 40 篇 yanmin qian
  • 36 篇 chen xie
  • 35 篇 junchi yan
  • 35 篇 yang xiaokang
  • 33 篇 li zuchao
  • 32 篇 wu mengyue
  • 29 篇 zhu su
  • 25 篇 niu li
  • 22 篇 guo yiwei
  • 22 篇 kai yu
  • 22 篇 zhang liqing
  • 20 篇 gao xiaofeng
  • 18 篇 cao ruisheng
  • 18 篇 chen zhengyang

语言

  • 839 篇 英文
  • 75 篇 其他
  • 5 篇 中文
检索条件"机构=Department of Computer Science and Engineering & MoE Key Lab of AI"
918 条 记 录,以下是11-20 订阅
排序:
FAT-HuBERT: Front-End Adaptive Training of Hidden-Unit BERT For Distortion-Invariant Robust Speech Recognition
FAT-HuBERT: Front-End Adaptive Training of Hidden-Unit BERT ...
收藏 引用
2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
作者: Yang, Dongning Wang, Wei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute Department of Computer Science and Engineering Shanghai China
Advancements in monaural speech enhancement (SE) techniques have greatly improved the perceptual quality of speech. However, integrating these techniques into automatic speech recognition (ASR) systems has not yielded... 详细信息
来源: 评论
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models  31
Converging to a Lingua Franca: Evolution of Linguistic Regio...
收藏 引用
31st International Conference on Computational Linguistics, COLING 2025
作者: Zeng, Hongchuan Han, Senyu Chen, Lu Yu, Kai X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence SJTU AI Institute Shanghai Jiao Tong University Shanghai China Suzhou Laboratory Suzhou China
Large language models (LLMs) have demonstrated remarkable performance, particularly in multilingual contexts. While recent studies suggest that LLMs can transfer skills learned in one language to others, the internal ... 详细信息
来源: 评论
From Generalist to Specialist: A Survey of Large Language Models for Chemistry  31
From Generalist to Specialist: A Survey of Large Language Mo...
收藏 引用
31st International Conference on Computational Linguistics, COLING 2025
作者: Han, Yang Wan, Ziping Chen, Lu Yu, Kai Chen, Xin X-LANCE Lab Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence SJTU AI Institute Shanghai Jiao Tong University Shanghai China Suzhou Laboratory Suzhou China
Large Language Models (LLMs) have significantly transformed our daily life and established a new paradigm in natural language processing (NLP). However, the predominant pretraining of LLMs on extensive web-based texts... 详细信息
来源: 评论
Fast-Hubert: an Efficient Training Framework for Self-Supervised Speech Representation Learning
Fast-Hubert: an Efficient Training Framework for Self-Superv...
收藏 引用
2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
作者: Yang, Guanrou Ma, Ziyang Zheng, Zhisheng Song, Yakun Niu, Zhikang Chen, Xie Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering China
Recent years have witnessed significant advancements in self-supervised learning (SSL) methods for speech-processing tasks. Various speech-based SSL models have been developed and present promising performance on a ra... 详细信息
来源: 评论
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech Separation  48
Predictive Skim: Contrastive Predictive Coding for Low-Laten...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Li, Chenda Wu, Yifei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering China
In online speech separation, there is a trade-off between inherent latency and speech separation performance. When processing the current input audio, looking ahead to more future context usually brings better speech ... 详细信息
来源: 评论
Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction
Advanced Zero-Shot Text-to-Speech for Background Removal and...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Zhang, Leying Zhang, Wangyou Chen, Zhengyang Qian, Yanmin Auditory Cognition and Computational Acoustics Lab MoE Key Lab of Artificial Intelligence AI Institute Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China
The acoustic background plays a crucial role in natural conversation. It provides context and helps listeners understand the environment, but a strong background makes it difficult for listeners to understand spoken w... 详细信息
来源: 评论
Advancing Non-intrusive Suppression on Enhancement Distortion for Noise Robust ASR
Advancing Non-intrusive Suppression on Enhancement Distortio...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Wang, Wei Zhao, Siyi Qian, Yanmin Auditory Cognition and Computational Acoustics Lab MoE Key Lab of Artificial Intelligence AI Institute Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China
Recent advancements in speech enhancement (SE) techniques have greatly improved speech clarity and intelligibility in challenging acoustic environments. However, integrating SE into automatic speech recognition (ASR) ... 详细信息
来源: 评论
Exploring Time-Frequency Domain Target Speaker Extraction For Causal and Non-Causal Processing
Exploring Time-Frequency Domain Target Speaker Extraction Fo...
收藏 引用
2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
作者: Zhang, Wangyou Yang, Lei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute Department of Computer Science and Engineering Shanghai China China
In recent years, target speaker extraction (TSE) has drawn increasing interest as an alternative to speech separation in realistic applications. While time-domain methods have been widely used in recent studies to ach... 详细信息
来源: 评论
Robust Audio-Visual ASR with Unified Cross-Modal Attention  48
Robust Audio-Visual ASR with Unified Cross-Modal Attention
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Li, Jiahong Li, Chenda Wu, Yifei Qian, Yanmin Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Audio-visual speech recognition (AVSR) takes advantage of noise-invariant visual information to improve the robustness of automatic speech recognition (ASR) systems. While previous works mainly focused on the clean co... 详细信息
来源: 评论
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-label Guidance  48
Emodiff: Intensity Controllable Emotional Text-to-Speech wit...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Guo, Yiwei Du, Chenpeng Chen, Xie Yu, Kai Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence Ai Institute X-LANCE Lab Department of Computer Science and Engineering Shanghai China
Although current neural text-to-speech (TTS) models are able to generate high-quality speech, intensity controllable emotional TTS is still a challenging task. Most existing methods need external optimizations for int... 详细信息
来源: 评论