咨询与建议

限定检索结果

文献类型

  • 292 篇 会议
  • 145 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 440 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 296 篇 工学
    • 220 篇 计算机科学与技术...
    • 189 篇 软件工程
    • 80 篇 信息与通信工程
    • 32 篇 生物工程
    • 21 篇 控制科学与工程
    • 17 篇 仪器科学与技术
    • 16 篇 生物医学工程(可授...
    • 15 篇 化学工程与技术
    • 13 篇 电子科学与技术(可...
    • 9 篇 机械工程
    • 9 篇 电气工程
    • 6 篇 光学工程
    • 5 篇 材料科学与工程(可...
    • 4 篇 动力工程及工程热...
  • 169 篇 理学
    • 95 篇 物理学
    • 68 篇 数学
    • 38 篇 统计学(可授理学、...
    • 37 篇 生物学
    • 15 篇 化学
    • 13 篇 系统科学
  • 73 篇 管理学
    • 52 篇 图书情报与档案管...
    • 21 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 12 篇 医学
    • 11 篇 临床医学
    • 8 篇 基础医学(可授医学...
    • 7 篇 药学(可授医学、理...
  • 10 篇 文学
    • 8 篇 外国语言文学
    • 7 篇 中国语言文学
  • 9 篇 法学
    • 8 篇 社会学
  • 7 篇 农学
    • 5 篇 作物学
  • 2 篇 经济学
  • 2 篇 教育学
  • 1 篇 军事学
  • 1 篇 艺术学

主题

  • 49 篇 speech recogniti...
  • 24 篇 speech
  • 21 篇 hidden markov mo...
  • 21 篇 training
  • 19 篇 speech processin...
  • 14 篇 acoustics
  • 13 篇 decoding
  • 13 篇 natural language...
  • 12 篇 computational mo...
  • 11 篇 signal processin...
  • 9 篇 computational li...
  • 9 篇 databases
  • 9 篇 feature extracti...
  • 8 篇 natural language...
  • 8 篇 syntactics
  • 8 篇 automatic speech...
  • 7 篇 training data
  • 7 篇 testing
  • 7 篇 speaker recognit...
  • 6 篇 machine translat...

机构

  • 27 篇 department of co...
  • 24 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 department of co...
  • 16 篇 mainlp center fo...
  • 12 篇 munich
  • 11 篇 department of co...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...
  • 7 篇 department of co...
  • 7 篇 human language t...
  • 7 篇 department of el...
  • 7 篇 center for langu...
  • 7 篇 center for langu...
  • 6 篇 center for speec...
  • 6 篇 center for infor...
  • 6 篇 speechlab depart...
  • 6 篇 center for langu...
  • 6 篇 national enginee...

作者

  • 21 篇 plank barbara
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 17 篇 thomas fang zhen...
  • 15 篇 van der goot rob
  • 14 篇 khudanpur sanjee...
  • 12 篇 wang dong
  • 12 篇 sanjeev khudanpu...
  • 11 篇 callison-burch c...
  • 11 篇 eisner jason
  • 9 篇 schütze hinrich
  • 9 篇 lei xie
  • 9 篇 koehn philipp
  • 9 篇 cotterell ryan
  • 8 篇 du xiaojiang
  • 8 篇 smith noah a.
  • 8 篇 zhu liehuang
  • 8 篇 watanabe shinji
  • 7 篇 li zhifei
  • 7 篇 dredze mark

语言

  • 426 篇 英文
  • 11 篇 其他
  • 5 篇 中文
检索条件"机构=Department of Computer Science and Center for Language and Speech Processing"
440 条 记 录,以下是101-110 订阅
排序:
The Database and Benchmark For the Source Speaker Tracing Challenge 2024
The Database and Benchmark For the Source Speaker Tracing Ch...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Ze Li Yuke Lin Tian Yao Hongbin Suo Pengyuan Zhang Yanzhen Ren Zexin Cai Hiromitsu Nishizaki Ming Li School of Computer Science Wuhan University Wuhan China Suzhou Municipal Key Laboratory of Multimodal Intelligent Systems Duke Kunshan University Kunshan China AI Center OPPO Beijing China Key Laboratory of Speech Acoustics and Content Understanding Institute of Acoustics CAS China Key Laboratory of Aerospace Information Security and Trusted Computing Ministry of Education School of Cyber Science and Engineering Wuhan University Center for Language and Speech Processing Johns Hopkins University USA Integrated Graduate School of Medicine Engineering and Agricultural Sciences University of Yamanashi 4-4-37 Takeda Kofu Yamanashi Japan
Voice conversion (VC) systems can transform audio to mimic another speaker’s voice, thereby attacking speaker verification (SV) systems. However, ongoing studies on source speaker verification (SSV) are hindered by l... 详细信息
来源: 评论
Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences
arXiv
收藏 引用
arXiv 2023年
作者: Pataki, Zador Altillawi, Mohammad Kanakis, Menelaos Pautrat, Rémi Shen, Fengyi Liu, Ziyuan Van Gool, Luc Pollefeys, Marc The Computer Vision and Geometry Lab Department of Computer Science ETH Zurich Switzerland The Computer Vision Center CVC-Barcelona The Intelligent Robotics Cloud Technology lab of Huawei-Munich Germany The Computer Vision Lab Department Electrical Engineering ETH Zurich Switzerland The Intelligent Robotics Cloud Technology lab of Huawei-Munich Germany The Intelligent Robotics Cloud Technology lab of Huawei-Munich Germany The Center for Processing Speech and Images KU Leuven The Computer Vision Lab ETH Zurich Switzerland
Modern learning-based visual feature extraction networks perform well in intra-domain localization, however, their performance significantly declines when image pairs are captured across long-term visual domain variat... 详细信息
来源: 评论
An Asynchronous WFST-Based Decoder for Automatic speech Recognition
An Asynchronous WFST-Based Decoder for Automatic Speech Reco...
收藏 引用
IEEE International Conference on Acoustics, speech and Signal processing
作者: Hang Lv Zhehuai Chen Hainan Xu Daniel Povey Lei Xie Sanjeev Khudanpur Audio Speech and Language Processing Lab (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Center of Language and Speech Processing Johns Hopkins University Baltimore MD USA Shanghai Jiao Tong University Xiaomi Corporation Beijing China Human Language Technology Center of Excellence Johns Hopkins University Baltimore MD USA
We introduce asynchronous dynamic decoder, which adopts an efficient A~* algorithm to incorporate big language models in the one-pass decoding for large vocabulary continuous speech recognition. Unlike standard one-pa... 详细信息
来源: 评论
WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization  18th
WINVC: One-Shot Voice Conversion with Weight Adaptive Instan...
收藏 引用
18th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2021
作者: Huang, Shengjie Chen, Mingjie Xu, Yanyan Ke, Dengfeng Hain, Thomas School of Information Science and Technology Beijing Forestry University Beijing China Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration Beijing China Computer Science Department University of Sheffield Sheffield United Kingdom School of Information Science Beijing Language and Culture University Beijing China
This paper proposes a one-shot voice conversion (VC) solution. In many one-shot voice conversion solutions (e.g., Auto-encoder-based VC methods), performances have dramatically been improved due to instance normalizat... 详细信息
来源: 评论
THE DATABASE AND BENCHMARK FOR THE SOURCE SPEAKER TRACING CHALLENGE 2024
arXiv
收藏 引用
arXiv 2024年
作者: Li, Ze Lin, Yuke Yao, Tian Suo, Hongbin Zhang, Pengyuan Ren, Yanzhen Cai, Zexin Nishizaki, Hiromitsu Li, Ming School of Computer Science Wuhan University Wuhan China Suzhou Municipal Key Laboratory of Multimodal Intelligent Systems Duke Kunshan University Kunshan China AI Center OPPO Beijing China Key Laboratory of Speech Acoustics and Content Understanding Institute of Acoustics CAS China Key Laboratory of Aerospace Information Security and Trusted Computing Ministry of Education School of Cyber Science and Engineering Wuhan University China Center for Language and Speech Processing Johns Hopkins University United States Integrated Graduate School of Medicine Engineering and Agricultural Sciences University of Yamanashi 4-4-37 Takeda Yamanashi Kofu400-8510 Japan
Voice conversion (VC) systems can transform audio to mimic another speaker’s voice, thereby attacking speaker verification (SV) systems. However, ongoing studies on source speaker verification (SSV) are hindered by l... 详细信息
来源: 评论
An asynchronous wfst-based decoder for automatic speech recognition
arXiv
收藏 引用
arXiv 2021年
作者: Lv, Hang Chen, Zhehuai Xu, Hainan Povey, Daniel Xie, Lei Khudanpur, Sanjeev School of Computer Science Northwestern Polytechnical University Xi'an China Center of Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Xiaomi Corporation Beijing China SpeechLab Department of Computer Science and Engineering Shanghai Jiao Tong University China
We introduce asynchronous dynamic decoder, which adopts an efficient A∗ algorithm to incorporate big language models in the onepass decoding for large vocabulary continuous speech recognition. Unlike standard one-pass... 详细信息
来源: 评论
Trainable reference-based evaluation metric for identifying quality of English-Gujarati machine translation system
收藏 引用
AIP Conference Proceedings 2025年 第1期3253卷
作者: Nisheeth Joshi Pragya Katyayan Palak Arora Speech and Language Processing Lab Centre for Artificial Intelligence Banasthali Vidyapith Raj. Niwai India Department of Computer Science Banasthali Vidyapith Raj. Niwai India
Machine Translation (MT) Evaluation is an integral part of the MT development life cycle. Without analyzing the outputs of MT engines, it is impossible to performance of an MT system. Through experiments, it has been ...
来源: 评论
LMP-GAN: Out-Of-Distribution Detection For Non-Control Data Malware Attacks
收藏 引用
IEEE Transactions on Pattern Analysis and Machine Intelligence 2025年 第7期PP卷 PP页
作者: Wood, David Kapp, David Kebede, Temesgen Hirakawa, Keigo Wuhan University School of Computer Science China Wuhan University National Engineering Research Center for Multimedia Software Hubei Key Laboratory of Multimedia and Network Communication Engineering China Zhongguancun Academy China Wuhan University State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing China Sun Yat-sen University School of Geography and Planning China Mohamed bin Zayed University of Artificial Intelligence United Arab Emirates Chongqing University College of Computer Science China The University of Tokyo Japan RIKEN Center for Advanced Intelligence Project Japan Intelligent Science & Technology Academy Limited CASIC China iFlytek Company Ltd. National Engineering Research Center of Speech and Language Information Processing China Nanyang Technological University College of Computing & Data Science Singapore Henan Academy of Sciences Aerospace Information Research Institute China
Anomaly detection is a common application of machine learning. Out-of-distribution (OOD) detection in particular is a semi-supervised anomaly detection technique where the detection method is trained only on the inlie... 详细信息
来源: 评论
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Di Zhang, Jing Xu, Minqiang Liu, Lin Wang, Dongsheng Gao, Erzhong Han, Chengxi Guo, Haonan Du, Bo Tao, Dacheng Zhang, Liangpei School of Computer Science Wuhan University Wuhan430072 China Institute of Artificial Intelligence Wuhan University Wuhan430072 China National Engineering Research Center for Multimedia Software Wuhan University Wuhan430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan430072 China School of Computer Science Faculty of Engineering The University of Sydney Australia iFlytek Co Ltd National Engineering Research Center of Speech and Language Information Processing Hefei230088 China State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing Wuhan University Wuhan430079 China School of Computer Science and Engineering Nanyang Technological University Singapore Singapore
Foundation models have reshaped the landscape of Remote Sensing (RS) by enhancing various image interpretation tasks. Pretraining is an active research topic, encompassing supervised and self-supervised learning metho... 详细信息
来源: 评论
SAV-SE: Scene-aware Audio-Visual speech Enhancement with Selective State Space Model
arXiv
收藏 引用
arXiv 2024年
作者: Qian, Xinyuan Gao, Jiaran Zhang, Yaodan Zhang, Qiquan Liu, Hexin Garcia, Leibny Paola Li, Haizhou The School of Computer and Communication Engineering University of Science and Technology Beijing Beijing100083 China The School of Electrical Engineering and Telecommunications The University of New South Wales Sydney2052 Australia The College of Computing and Data Science Nanyang Technological University Singapore The Center for Language and Speech Processing Johns Hopkins University United States The Guangdong Provincial Key Laboratory of Big Data Computing The Chinese University of Hong Kong Shenzhen518172 China Shenzhen Research Institute of Big data Shenzhen51872 China
speech enhancement plays an essential role in various applications, and the integration of visual information has been demonstrated to bring substantial advantages. However, the majority of current research concentrat... 详细信息
来源: 评论