咨询与建议

限定检索结果

文献类型

  • 530 篇 会议
  • 298 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 831 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 521 篇 工学
    • 390 篇 计算机科学与技术...
    • 338 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 292 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 63 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 122 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 17 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 15 篇 法学
    • 13 篇 社会学
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 11 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 74 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 natural language...
  • 18 篇 data models
  • 17 篇 neural networks
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 71 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 26 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 22 篇 zhen-hua ling
  • 19 篇 yang ai
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 699 篇 英文
  • 127 篇 其他
  • 12 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
831 条 记 录,以下是91-100 订阅
排序:
DUALSEP: A LIGHT-WEIGHT DUAL-ENCODER CONVOLUTIONAL RECURRENT NETWORK FOR REAL-TIME IN-CAR speech SEPARATION
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Ziqian Sun, Jiayao Zhang, Zihan Li, Xingchen Liu, Jie Xie, Lei Audio Speech and Language Processing Group [ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China Huawei Cloud
Advancements in deep learning and voice-activated technologies have driven the development of human-vehicle interaction. Distributed microphone arrays are widely used in in-car scenarios because they can accurately ca... 详细信息
来源: 评论
A Composite Predictive-Generative Approach to Monaural Universal speech Enhancement
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 2312-2325页
作者: Jie Zhang Haoyin Yan Xiaofei Li National Engineering Research Center for Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China (USTC) Hefei China School of Engineering Westlake University Hangzhou China
It is promising to design a single model that can suppress various distortions and improve speech quality, i.e., universal speech enhancement (USE). Compared to supervised learning-based predictive methods, diffusion-... 详细信息
来源: 评论
Dualsep: A Light-Weight Dual-Encoder Convolutional Recurrent Network For Real-Time In-Car speech Separation
Dualsep: A Light-Weight Dual-Encoder Convolutional Recurrent...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Ziqian Wang Jiayao Sun Zihan Zhang Xingchen Li Jie Liu Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Huawei Cloud
Advancements in deep learning and voice-activated technologies have driven the development of human-vehicle interaction. Distributed microphone arrays are widely used in incar scenarios because they can accurately cap... 详细信息
来源: 评论
KhmerFormer: Multi-Scale CNNs-Transformer with External Attention for Ancient Khmer Palm Leaf Isolated Glyph Classification
KhmerFormer: Multi-Scale CNNs-Transformer with External Atte...
收藏 引用
Asia-Pacific Signal and Information processing Association Annual Summit and Conference (APSIPA)
作者: Nimol Thuon Jun Du National Engineering Research Center of Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China Hefei China
Ancient Khmer palm leaf manuscripts are invaluable cultural artifacts in Southeast Asia, especially in Cambodia. The preservation and study of these manuscripts are hindered by their complex glyph structures and the s... 详细信息
来源: 评论
MUSA: Multi-Lingual Speaker Anonymization via Serial Disentanglement
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 1664-1674页
作者: Jixun Yao Qing Wang Pengcheng Guo Ziqian Ning Yuguang Yang Yu Pan Lei Xie Audio Speech and Language Processing Group School of Computer Science Northwestern Polytechnical University Xi'an Shaanxi China Department of Electronic & Computer Engineering Hong Kong University of Science and Technology Hong Kong SAR China Kyushu University Fukuoka Japan
Speaker anonymization is an effective privacy protection solution designed to conceal the speaker's identity while preserving the linguistic content and para-linguistic information of the original speech. While mo... 详细信息
来源: 评论
On language Spaces, Scales and Cross-Lingual Transfer of UD Parsers  26
On Language Spaces, Scales and Cross-Lingual Transfer of UD ...
收藏 引用
26th Conference on Computational Natural language Learning, CoNLL 2022 collocated and co-organized with EMNLP 2022
作者: Samardžić, Tanja Gutierrez-Vasque, Ximena Van Der Goot, Rob Müller-Eberstein, Max Pelloni, Olga Plank, Barbara Text Group URPP Language and Space University of Zurich Switzerland Department of Computer Science IT University of Copenhagen Denmark Center for Information and Language Processing LMU Munich Germany
Cross-lingual transfer of parsing models has been shown to work well for several closelyrelated languages, but predicting the success in other cases remains hard. Our study is a comprehensive analysis of the impact of... 详细信息
来源: 评论
LONGEMBED: Extending Embedding Models for Long Context Retrieval
LONGEMBED: Extending Embedding Models for Long Context Retri...
收藏 引用
2024 Conference on Empirical Methods in Natural language processing, EMNLP 2024
作者: Zhu, Dawei Wang, Liang Yang, Nan Song, Yifan Wu, Wenhao Wei, Furu Li, Sujian School of Computer Science Peking University China National Key Laboratory for Multimedia Information Processing Peking University China Jiangsu Collaborative Innovation Center for Language Ability Jiangsu Normal University China Microsoft Corporation United States
Embedding models play a pivotal role in modern NLP applications such as document retrieval. However, existing embedding models are limited to encoding short documents of typically 512 tokens, restrained from applicati... 详细信息
来源: 评论
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-speech Synthesis
Incremental Disentanglement for Environment-Aware Zero-Shot ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Ye-Xin Lu Hui-Peng Du Zheng-Yan Sheng Yang Ai Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P. R. China
This paper proposes an Incremental Disentanglement-based Environment-Aware zero-shot text-to-speech (TTS) method, dubbed IDEA-TTS, that can synthesize speech for unseen speakers while preserving the acoustic character... 详细信息
来源: 评论
Towards robust one-shot voice conversion with cycle phonetic posteriorgrams and multi-scale speaker representations  24
Towards robust one-shot voice conversion with cycle phonetic...
收藏 引用
24th International Congress on Acoustics, ICA 2022
作者: Chen, Yannian Liu, Lijuan Hu, Yajun Ling, Zhenhua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China IFLYTEK Research IFLYTEK Co. Ltd. China
One-shot voice conversion (VC) aims to convert the voice across arbitrary speakers even unseen during training, with only one reference utterance from the target speaker. It is still a challenging task as both content... 详细信息
来源: 评论
Bitext Mining for Low-Resource languages via Contrastive Learning
arXiv
收藏 引用
arXiv 2022年
作者: Tan, Weiting Koehn, Philipp Center for Language and Speech Processing Computer Science Department Johns Hopkins University United States
Mining high-quality bitexts for low-resource languages is challenging. This paper shows that sentence representation of language models fine-tuned with multiple negatives ranking loss, a contrastive objective, helps r...
来源: 评论