咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是81-90 订阅
排序:
CASC-XVC: Zero-Shot Cross-Lingual Voice Conversion with Content Accordant and Speaker Contrastive Losses
CASC-XVC: Zero-Shot Cross-Lingual Voice Conversion with Cont...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Han-Jie Guo Hui-Peng Du Zheng-Yan Sheng Li-Ping Chen Yang Ai Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R.China
Cross-lingual voice conversion (XVC) is a technology that modifies speaker identity while preserving linguistic content in scenarios where the source and target speakers use different languages. Previous non-parallel ... 详细信息
来源: 评论
Recursive Feature Learning from Pre-Trained Models for Spoofing speech Detection
Recursive Feature Learning from Pre-Trained Models for Spoof...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Yu Guan Yang Ai Zuoliang Li Shengyu Peng Wu Guo National Engineering Research Center for Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China (USTC) Hefei China
It was recently revealed that using features extracted from pre-trained models can achieve much better performance than using conventional hand-crafted acoustic features for spoofing speech detection. In this paper, w... 详细信息
来源: 评论
A Machine Learning Approach for MIDI to Guitar Tablature Conversion  19
A Machine Learning Approach for MIDI to Guitar Tablature Con...
收藏 引用
19th Sound and Music Computing Conference, SMC 2022
作者: Kaliakatsos-Papakostas, Maximos Bastas, Grigoris Makris, Dimos Herremans, Dorrien Katsouros, Vassilis Maragos, Petros Institute for Language and Speech Processing Athena R.C. Athens Greece School of Electrical and Computer Engineering NTUA Athens Greece Department of Computer Science and Design Pillar SUTD Singapore
Guitar tablature transcription consists in deducing the string and the fret number on which each note should be played to reproduce the actual musical part. This assignment should lead to playable string-fret combinat... 详细信息
来源: 评论
A Fresh Review on Chinese Pronunciation Acquisition: Insights and Recommendations for L2 Foreign Children
A Fresh Review on Chinese Pronunciation Acquisition: Insight...
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Mewlude Nijat Dong Wang Askar Hamdulla School of Computer Science and Technology Xinjiang University Center for Speech and Language Technologies BNRist Tsinghua University
This review paper offers a brief summary of recent research on Chinese pronunciation acquisition, with a particular focus on children learning Chinese as a second language (L2). Af-ter a concise introduction to the Ch... 详细信息
来源: 评论
Bs-Plcnet: Band-Split Packet Loss Concealment Network with Multi-Task Learning Framework and Multi-Discriminators
Bs-Plcnet: Band-Split Packet Loss Concealment Network with M...
收藏 引用
Acoustics, speech, and Signal processing Workshops (ICASSPW), IEEE International Conference on
作者: Zihan Zhang Jiayao Sun Xianjun Xia Chuanzeng Huang Yijian Xiao Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China ByteDance China
Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems. To deal with the problem, we propose a band-split packet loss concealment network (BS-PLCNet). Specifically, we split the fu... 详细信息
来源: 评论
A Study of Multi-Scale Feature Learning From Pre-Trained Models on Speaker Verification
A Study of Multi-Scale Feature Learning From Pre-Trained Mod...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Shengyu Peng Wu Guo Jie Zhang Zuoliang Li Yu Guan Bin Gu Yang Ai National Engineering Research Center for Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China (USTC) Hefei China
In this paper, a multi-scale feature fusion paradigm is proposed to fully exploit the power of the pre-trained models for text-independent speaker verification. It contains a front-end feature extractor and an enhance... 详细信息
来源: 评论
DUALSEP: A LIGHT-WEIGHT DUAL-ENCODER CONVOLUTIONAL RECURRENT NETWORK FOR REAL-TIME IN-CAR speech SEPARATION
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Ziqian Sun, Jiayao Zhang, Zihan Li, Xingchen Liu, Jie Xie, Lei Audio Speech and Language Processing Group [ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China Huawei Cloud
Advancements in deep learning and voice-activated technologies have driven the development of human-vehicle interaction. Distributed microphone arrays are widely used in in-car scenarios because they can accurately ca... 详细信息
来源: 评论
KhmerFormer: Multi-Scale CNNs-Transformer with External Attention for Ancient Khmer Palm Leaf Isolated Glyph Classification
KhmerFormer: Multi-Scale CNNs-Transformer with External Atte...
收藏 引用
Asia-Pacific Signal and Information processing Association Annual Summit and Conference (APSIPA)
作者: Nimol Thuon Jun Du National Engineering Research Center of Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China Hefei China
Ancient Khmer palm leaf manuscripts are invaluable cultural artifacts in Southeast Asia, especially in Cambodia. The preservation and study of these manuscripts are hindered by their complex glyph structures and the s... 详细信息
来源: 评论
A Composite Predictive-Generative Approach to Monaural Universal speech Enhancement
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 2312-2325页
作者: Jie Zhang Haoyin Yan Xiaofei Li National Engineering Research Center for Speech and Language Information Processing (NERC-SLIP) University of Science and Technology of China (USTC) Hefei China School of Engineering Westlake University Hangzhou China
It is promising to design a single model that can suppress various distortions and improve speech quality, i.e., universal speech enhancement (USE). Compared to supervised learning-based predictive methods, diffusion-... 详细信息
来源: 评论
MUSA: Multi-Lingual Speaker Anonymization via Serial Disentanglement
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 1664-1674页
作者: Jixun Yao Qing Wang Pengcheng Guo Ziqian Ning Yuguang Yang Yu Pan Lei Xie Audio Speech and Language Processing Group School of Computer Science Northwestern Polytechnical University Xi'an Shaanxi China Department of Electronic & Computer Engineering Hong Kong University of Science and Technology Hong Kong SAR China Kyushu University Fukuoka Japan
Speaker anonymization is an effective privacy protection solution designed to conceal the speaker's identity while preserving the linguistic content and para-linguistic information of the original speech. While mo... 详细信息
来源: 评论