咨询与建议

限定检索结果

文献类型

  • 99 篇 会议
  • 64 篇 期刊文献
  • 1 册 图书

馆藏范围

  • 164 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 96 篇 工学
    • 51 篇 计算机科学与技术...
    • 48 篇 软件工程
    • 38 篇 信息与通信工程
    • 15 篇 生物工程
    • 11 篇 生物医学工程(可授...
    • 10 篇 电气工程
    • 10 篇 电子科学与技术(可...
    • 7 篇 光学工程
    • 7 篇 化学工程与技术
    • 6 篇 仪器科学与技术
    • 6 篇 控制科学与工程
    • 3 篇 机械工程
    • 3 篇 动力工程及工程热...
    • 2 篇 环境科学与工程(可...
  • 74 篇 理学
    • 49 篇 物理学
    • 28 篇 数学
    • 19 篇 生物学
    • 14 篇 统计学(可授理学、...
    • 7 篇 化学
    • 4 篇 系统科学
  • 21 篇 管理学
    • 14 篇 图书情报与档案管...
    • 9 篇 管理科学与工程(可...
    • 4 篇 工商管理
  • 5 篇 法学
    • 4 篇 社会学
    • 1 篇 法学
  • 5 篇 医学
    • 5 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 2 篇 药学(可授医学、理...
  • 4 篇 文学
    • 3 篇 新闻传播学
  • 2 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
    • 1 篇 应用经济学

主题

  • 23 篇 speech recogniti...
  • 15 篇 hidden markov mo...
  • 11 篇 speech
  • 11 篇 speech processin...
  • 9 篇 training
  • 7 篇 natural language...
  • 7 篇 computational mo...
  • 7 篇 feature extracti...
  • 6 篇 signal processin...
  • 6 篇 automatic speech...
  • 5 篇 neural networks
  • 5 篇 speech enhanceme...
  • 5 篇 acoustics
  • 5 篇 decoding
  • 4 篇 noise measuremen...
  • 4 篇 training data
  • 4 篇 vocabulary
  • 3 篇 support vector m...
  • 3 篇 lattices
  • 3 篇 reverberation

机构

  • 7 篇 department of el...
  • 7 篇 center for langu...
  • 5 篇 department of co...
  • 5 篇 center for langu...
  • 5 篇 national enginee...
  • 4 篇 iflytek research
  • 4 篇 department of el...
  • 4 篇 speechlab depart...
  • 4 篇 school of electr...
  • 3 篇 university of pe...
  • 3 篇 department of co...
  • 3 篇 vector institute...
  • 3 篇 erlangen-nürnber...
  • 3 篇 radboud institut...
  • 3 篇 department of bi...
  • 3 篇 centre for medic...
  • 3 篇 general robotics...
  • 3 篇 department of di...
  • 3 篇 university of po...
  • 3 篇 institute of aco...

作者

  • 9 篇 sanjeev khudanpu...
  • 7 篇 khudanpur sanjee...
  • 7 篇 watanabe shinji
  • 6 篇 damianos karakos
  • 6 篇 lirong dai
  • 6 篇 hui jiang
  • 5 篇 du xiaojiang
  • 5 篇 guizani mohsen
  • 5 篇 ling zhen-hua
  • 5 篇 drosatos george
  • 5 篇 zhu liehuang
  • 5 篇 sharif kashif
  • 5 篇 efraimidis pavlo...
  • 5 篇 byrne william
  • 4 篇 garcia leibny pa...
  • 4 篇 stern richard
  • 4 篇 dredze mark
  • 4 篇 novoa josé
  • 4 篇 chang xuankai
  • 4 篇 liu hexin

语言

  • 157 篇 英文
  • 6 篇 其他
  • 1 篇 中文
检索条件"机构=Center for Language and Speech Processing and Department of Electrical and Computer Engineering"
164 条 记 录,以下是51-60 订阅
排序:
How phonotactics affect multilingual and zero-shot ASR performance
arXiv
收藏 引用
arXiv 2020年
作者: Feng, Siyuan Zelasko, Piotr Moro-Velázquez, Laureano Abavisani, Ali Hasegawa-Johnson, Mark Scharenborg, Odette Dehak, Najim Multimedia Computing Group Delft University of Technology Delft Netherlands Center for Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign IL United States
The idea of combining multiple languages’ recordings to train a single automatic speech recognition (ASR) model brings the promise of the emergence of universal speech representation. Recently, a Transformer encoder-... 详细信息
来源: 评论
End-to-end multi-speaker speech recognition with transformer
arXiv
收藏 引用
arXiv 2020年
作者: Chang, Xuankai Zhang, Wangyou Qian, Yanmin Le Roux, Jonathan Watanabe, Shinji Center for Language and Speech Processing Johns Hopkins University United States MoE Key Lab of Artificial Intelligence &SpeechLab Department of Computer Science and Engineering Shanghai Jiao Tong University China United States
Recently, fully recurrent neural network (RNN) based end-to-end models have been proven to be effective for multi-speaker speech recognition in both the single-channel and multi-channel scenarios. In this work, we exp... 详细信息
来源: 评论
Using ASR methods for OCR  15
Using ASR methods for OCR
收藏 引用
15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
作者: Arora, Ashish Garcia, Paola Watanabe, Shinji Manohar, Vimal Shao, Yiwen Khudanpur, Sanjeev Chang, Chun Chieh Rekabdar, Babak Babaali, Bagher Povey, Daniel Etter, David Raj, Desh Hadian, Hossein Trmal, Jan Center for Language and Speech Processing Johns Hopkins University Baltimore United States Human Language Technology Center of Excellence Johns Hopkins University Baltimore United States Department of Computer Engineering Sharif University of Technology Iran School of Mathematics Statistics and Computer Sciences College of Science University of Tehran Iran
Hybrid deep neural network hidden Markov models (DNN-HMM) have achieved impressive results on large vocabulary continuous speech recognition (LVCSR) tasks. However, the recent approaches using DNN-HMM models are not e... 详细信息
来源: 评论
End-to-end far-field speech recognition with unified dereverberation and beamforming
arXiv
收藏 引用
arXiv 2020年
作者: Zhang, Wangyou Subramanian, Aswin Shanmugam Chang, Xuankai Watanabe, Shinji Qian, Yanmin MoE Key Lab of Artificial Intelligence & SpeechLab Department of Computer Science and Engineering AI Institute Shanghai Jiao Tong University Shanghai China Center for Language and Speech Processing Johns Hopkins University United States
Despite successful applications of end-to-end approaches in multi-channel speech recognition, the performance still degrades severely when the speech is corrupted by reverberation. In this paper, we integrate the dere... 详细信息
来源: 评论
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
arXiv
收藏 引用
arXiv 2020年
作者: Raj, Desh Denisov, Pavel Chen, Zhuo Erdogan, Hakan Huang, Zili He, Maokui Watanabe, Shinji Du, Jun Yoshioka, Takuya Luo, Yi Kanda, Naoyuki Li, Jinyu Wisdom, Scott Hershey, John R. Center for Language and Speech Processing Johns Hopkins University BaltimoreMD United States Institute for Natural Language Processing University of Stuttgart Germany Microsoft Corp RedmondWA United States Google Research CambridgeMA United States University of Science and Technology of China HeFei China Department of Electrical Engineering Columbia University NY United States
Multi-speaker speech recognition of unsegmented recordings has diverse applications such as meeting transcription and automatic subtitle generation. With technical advances in systems dealing with speech separation, s... 详细信息
来源: 评论
A continual learning survey: Defying forgetting in classification tasks
arXiv
收藏 引用
arXiv 2019年
作者: de Lange, Matthias Aljundi, Rahaf Masana, Marc Parisot, Sarah Jia, Xu Leonardis, Aleš Slabaugh, Gregory Tuytelaars, Tinne Center for Processing Speech and Images Department Electrical Engineering KU Leuven Computer Vision Center UAB Huawei
Artificial neural networks thrive in solving the classification problem for a particular rigid task, acquiring knowledge through generalized learning behaviour from a distinct training phase. The resulting network res... 详细信息
来源: 评论
speech enhancement via deep spectrum image translation network
arXiv
收藏 引用
arXiv 2019年
作者: Kashani, Hamidreza Baradaran Goodarzi, Mohammad Mohsen Jodeiri, Ata Rezaei, Iman Sarraf Electrical Engineering Faculty Amirkabir University of Technology Tehran Iran Department of Electrical and Computer Engineering Buein Zahra Technical University Qazvin Iran School of Electrical & Computer Engineering College of Engineering University of Tehran Tehran Iran Speech and Language Processing Group Research Center for Development of Advanced Technologies Tehran Iran
Quality and intelligibility of speech signals are degraded under additive background noise which is a critical problem for hearing aid and cochlear implant users. Motivated to address this problem, we propose a novel ... 详细信息
来源: 评论
speech Enhancement via Deep Spectrum Image Translation Network
Speech Enhancement via Deep Spectrum Image Translation Netwo...
收藏 引用
Iranian Conference of Biomedical engineering (ICBME)
作者: Hamidreza Baradaran Kashani Ata Jodeiri Mohammad Mohsen Goodarzi Iman Sarraf Rezaei Electrical Engineering Faculty Amirkabir University of Technology Tehran Iran School of Electrical & Computer Engineering College of Engineering University of Tehran Tehran Iran Department of Electrical and Computer Engineering Buein Zahra Technical University Qazvin Iran Research Center for Development of Advanced Technologies Speech and Language Processing Group Tehran Iran
Quality and intelligibility of speech signals are degraded under additive background noise which is a critical problem for hearing aid and cochlear implant users. Motivated to address this problem, we propose a novel ... 详细信息
来源: 评论
MIMO-speech: End-to-end multi-channel multi-speaker speech recognition
arXiv
收藏 引用
arXiv 2019年
作者: Chang, Xuankai Zhang, Wangyou Qian, Yanmin Le Roux, Jonathan Watanabe, Shinji Center for Language and Speech Processing Johns Hopkins University United States SpeechLab Department of Computer Science and Engineering Shanghai Jiao Tong University China United States
Recently, the end-to-end approach has proven its efficacy in monaural multi-speaker speech recognition. However, high word error rates (WERs) still prevent these systems from being used in practical applications. On t... 详细信息
来源: 评论
Speaker Embedding Extraction with Virtual Phonetic Information
Speaker Embedding Extraction with Virtual Phonetic Informati...
收藏 引用
IEEE Global Conference on Signal and Information processing (GlobalSIP)
作者: S. Sreekanth Shaik Mohammad Rafi B K Sri Rama Murty Saurabhchand Bhati Department of Electronics and Communications Engineering IIIT RK Valley RGUKT-AP Department of Electrical Engineering Indian Institute of Technology Hyderabad India Center for Language and Speech Processing The Johns Hopkins University USA
In the recent past, deep neural networks have been successfully employed to extract fixed-dimensional speaker embeddings from the speech signal. The commonly used x-vectors are extracted by projecting the magnitude sp...
来源: 评论