咨询与建议

限定检索结果

文献类型

  • 292 篇 会议
  • 145 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 440 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 296 篇 工学
    • 220 篇 计算机科学与技术...
    • 189 篇 软件工程
    • 80 篇 信息与通信工程
    • 32 篇 生物工程
    • 21 篇 控制科学与工程
    • 17 篇 仪器科学与技术
    • 16 篇 生物医学工程(可授...
    • 15 篇 化学工程与技术
    • 13 篇 电子科学与技术(可...
    • 9 篇 机械工程
    • 9 篇 电气工程
    • 6 篇 光学工程
    • 5 篇 材料科学与工程(可...
    • 4 篇 动力工程及工程热...
  • 169 篇 理学
    • 95 篇 物理学
    • 68 篇 数学
    • 38 篇 统计学(可授理学、...
    • 37 篇 生物学
    • 15 篇 化学
    • 13 篇 系统科学
  • 73 篇 管理学
    • 52 篇 图书情报与档案管...
    • 21 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 12 篇 医学
    • 11 篇 临床医学
    • 8 篇 基础医学(可授医学...
    • 7 篇 药学(可授医学、理...
  • 10 篇 文学
    • 8 篇 外国语言文学
    • 7 篇 中国语言文学
  • 9 篇 法学
    • 8 篇 社会学
  • 7 篇 农学
    • 5 篇 作物学
  • 2 篇 经济学
  • 2 篇 教育学
  • 1 篇 军事学
  • 1 篇 艺术学

主题

  • 49 篇 speech recogniti...
  • 24 篇 speech
  • 21 篇 hidden markov mo...
  • 21 篇 training
  • 19 篇 speech processin...
  • 14 篇 acoustics
  • 13 篇 decoding
  • 13 篇 natural language...
  • 12 篇 computational mo...
  • 11 篇 signal processin...
  • 9 篇 computational li...
  • 9 篇 databases
  • 9 篇 feature extracti...
  • 8 篇 natural language...
  • 8 篇 syntactics
  • 8 篇 automatic speech...
  • 7 篇 training data
  • 7 篇 testing
  • 7 篇 speaker recognit...
  • 6 篇 machine translat...

机构

  • 27 篇 department of co...
  • 24 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 department of co...
  • 16 篇 mainlp center fo...
  • 12 篇 munich
  • 11 篇 department of co...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...
  • 7 篇 department of co...
  • 7 篇 human language t...
  • 7 篇 department of el...
  • 7 篇 center for langu...
  • 7 篇 center for langu...
  • 6 篇 center for speec...
  • 6 篇 center for infor...
  • 6 篇 speechlab depart...
  • 6 篇 center for langu...
  • 6 篇 national enginee...

作者

  • 21 篇 plank barbara
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 17 篇 thomas fang zhen...
  • 15 篇 van der goot rob
  • 14 篇 khudanpur sanjee...
  • 12 篇 wang dong
  • 12 篇 sanjeev khudanpu...
  • 11 篇 callison-burch c...
  • 11 篇 eisner jason
  • 9 篇 schütze hinrich
  • 9 篇 lei xie
  • 9 篇 koehn philipp
  • 9 篇 cotterell ryan
  • 8 篇 du xiaojiang
  • 8 篇 smith noah a.
  • 8 篇 zhu liehuang
  • 8 篇 watanabe shinji
  • 7 篇 li zhifei
  • 7 篇 dredze mark

语言

  • 426 篇 英文
  • 11 篇 其他
  • 5 篇 中文
检索条件"机构=Department of Computer Science and Center for Language and Speech Processing"
440 条 记 录,以下是131-140 订阅
排序:
Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN
arXiv
收藏 引用
arXiv 2022年
作者: Mo, Yaozong Li, Chaofeng Ren, Wenqi Shang, Shaopeng Wang, Wenwu Wu, Xiao-Jun The Institute of Logistics Science and Engineering Shanghai Maritime University Shanghai201306 China The School of Cyber Science and Technology Sun Yatsen University Shenzhen518000 China The Vocational College Shanghai Jian Qiao University Shanghai201306 China The Center for Vision Speech and Signal Processing Department of Electrical and Electronic Engineering University of Surrey SurreyGU2 7XH United Kingdom The School of Artificial Intelligence and Computer Science Jiangnan University Wuxi214122 China
Deep learning-based methods have achieved significant performance for image defogging. However, existing methods are mainly developed for land scenes and perform poorly when dealing with overwater foggy images, since ... 详细信息
来源: 评论
End-to-end multi-speaker speech recognition with transformer
arXiv
收藏 引用
arXiv 2020年
作者: Chang, Xuankai Zhang, Wangyou Qian, Yanmin Le Roux, Jonathan Watanabe, Shinji Center for Language and Speech Processing Johns Hopkins University United States MoE Key Lab of Artificial Intelligence &SpeechLab Department of Computer Science and Engineering Shanghai Jiao Tong University China United States
Recently, fully recurrent neural network (RNN) based end-to-end models have been proven to be effective for multi-speaker speech recognition in both the single-channel and multi-channel scenarios. In this work, we exp... 详细信息
来源: 评论
Wake Word Detection with Alignment-Free Lattice-Free MMI
arXiv
收藏 引用
arXiv 2020年
作者: Wang, Yiming Lv, Hang Povey, Daniel Xie, Lei Khudanpur, Sanjeev Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Xiaomi Inc. Beijing China ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China
Always-on spoken language interfaces, e.g. personal digital assistants, rely on a wake word to start processing spoken input. We present novel methods to train a hybrid DNN/HMM wake word detection system from partiall... 详细信息
来源: 评论
Using ASR methods for OCR  15
Using ASR methods for OCR
收藏 引用
15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
作者: Arora, Ashish Garcia, Paola Watanabe, Shinji Manohar, Vimal Shao, Yiwen Khudanpur, Sanjeev Chang, Chun Chieh Rekabdar, Babak Babaali, Bagher Povey, Daniel Etter, David Raj, Desh Hadian, Hossein Trmal, Jan Center for Language and Speech Processing Johns Hopkins University Baltimore United States Human Language Technology Center of Excellence Johns Hopkins University Baltimore United States Department of Computer Engineering Sharif University of Technology Iran School of Mathematics Statistics and Computer Sciences College of Science University of Tehran Iran
Hybrid deep neural network hidden Markov models (DNN-HMM) have achieved impressive results on large vocabulary continuous speech recognition (LVCSR) tasks. However, the recent approaches using DNN-HMM models are not e... 详细信息
来源: 评论
End-to-end far-field speech recognition with unified dereverberation and beamforming
arXiv
收藏 引用
arXiv 2020年
作者: Zhang, Wangyou Subramanian, Aswin Shanmugam Chang, Xuankai Watanabe, Shinji Qian, Yanmin MoE Key Lab of Artificial Intelligence & SpeechLab Department of Computer Science and Engineering AI Institute Shanghai Jiao Tong University Shanghai China Center for Language and Speech Processing Johns Hopkins University United States
Despite successful applications of end-to-end approaches in multi-channel speech recognition, the performance still degrades severely when the speech is corrupted by reverberation. In this paper, we integrate the dere... 详细信息
来源: 评论
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
arXiv
收藏 引用
arXiv 2020年
作者: Shi, Jing Chang, Xuankai Guo, Pengcheng Watanabe, Shinji Fujita, Yusuke Xu, Jiaming Xu, Bo Xie, Lei Center for Language and Speech Processing Johns Hopkins University Beijing China ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China Hitachi Ltd. Research & Development Group
Neural sequence-to-sequence models are well established for applications which can be cast as mapping a single input sequence into a single output sequence. In this work, we focus on one-to-many sequence transduction ... 详细信息
来源: 评论
Isolating Host Environment by Booting Android from OTG Devices
收藏 引用
Chinese Journal of Electronics 2018年 第3期27卷 617-624页
作者: XUE Yuan ZHANG Xiaosong YU Xiao ZHANG Yaoyuan TAN Yu'an LI Yuanzhang School of Computer Science and Technology Beijing Institute of Technology Department of Computer Science and Technology Tangshan University Research Center of Massive Language Information Processing and Cloud Computing Application
With the integration of smartphone into daily life, end users store a large amount of sensitive information into Android device. For protecting the sensitive information, a method of multi-booting Android OS from On-T... 详细信息
来源: 评论
Machine learning-based longitudinal prediction for GJB2-related sensorineural hearing loss
收藏 引用
computers in Biology and Medicine 2024年 176卷 108597-108597页
作者: Chen, Pey-Yu Yang, Ta-Wei Tseng, Yi-Shan Tsai, Cheng-Yu Yeh, Chiung-Szu Lee, Yen-Hui Lin, Pei-Hsuan Lin, Ting-Chun Wu, Yu-Jen Yang, Ting-Hua Chiang, Yu-Ting Hsu, Jacob Shu-Jui Hsu, Chuan-Jen Chen, Pei-Lung Chou, Chen-Fu Wu, Chen-Chi Department of Otolaryngology MacKay Memorial Hospital Taipei Taiwan Department of Audiology and Speech-Language Pathology Mackay Medical College New Taipei City Taiwan Department of Otolaryngology National Taiwan University Hospital Taipei Taiwan Graduate Institute of Networking and Multimedia National Taiwan University Taipei Taiwan Department of Computer Science & Information Engineering National Taiwan University Taipei Taiwan Graduate Institute of Medical Genomics and Proteomics National Taiwan University College of Medicine Taipei Taiwan Department of Otolaryngology National Taiwan University Biomedical Park Hospital Hsinchu County Taiwan Department of Otolaryngology National Taiwan University Hospital Hsin-Chu Branch Hsinchu City Taiwan Graduate Institute of Clinical Medicine College of Medicine National Taiwan University Taipei Taiwan Department of Otorhinolaryngology-Head and Neck Surgery Taichung Tzu Chi Hospital Buddhist Tzu Chi Medical Foundation Taichung Taiwan Hearing and Speech Center National Taiwan University Hospital Taipei Taiwan School of Medicine Tzu Chi University Hualien Taiwan Department of Medical Genetics National Taiwan University Hospital Taipei Taiwan Department of Medical Research National Taiwan University Hospital Hsin-Chu Branch Hsin-Chu Taiwan
Background: Recessive GJB2 variants, the most common genetic cause of hearing loss, may contribute to progressive sensorineural hearing loss (SNHL). The aim of this study is to build a realistic predictive model for G... 详细信息
来源: 评论
How phonotactics affect multilingual and zero-shot ASR performance
arXiv
收藏 引用
arXiv 2020年
作者: Feng, Siyuan Zelasko, Piotr Moro-Velázquez, Laureano Abavisani, Ali Hasegawa-Johnson, Mark Scharenborg, Odette Dehak, Najim Multimedia Computing Group Delft University of Technology Delft Netherlands Center for Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign IL United States
The idea of combining multiple languages’ recordings to train a single automatic speech recognition (ASR) model brings the promise of the emergence of universal speech representation. Recently, a Transformer encoder-... 详细信息
来源: 评论
CN-Celeb: A Challenging Chinese Speaker Recognition Dataset
CN-Celeb: A Challenging Chinese Speaker Recognition Dataset
收藏 引用
IEEE International Conference on Acoustics, speech and Signal processing
作者: Y. Fan J.W. Kang L.T. Li K.C. Li H.L. Chen S.T. Cheng P.Y. Zhang Z.Y. Zhou Y.Q. Cai D. Wang Key Laboratory of Transient Physics Nanjing University of Science and Technology China Center for Speech and Language Technologies Tsinghua University China Department of Computer Science and Technology Tsinghua University China Beijing National Research Center for Information Science and Technology China
Recently, researchers set an ambitious goal of conducting speaker recognition in unconstrained conditions where the variations on ambient, channel and emotion could be arbitrary. However, most publicly available datas...
来源: 评论