咨询与建议

限定检索结果

文献类型

  • 48 篇 会议
  • 14 篇 期刊文献

馆藏范围

  • 62 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 58 篇 工学
    • 46 篇 计算机科学与技术...
    • 24 篇 电气工程
    • 16 篇 软件工程
    • 8 篇 信息与通信工程
    • 5 篇 电子科学与技术(可...
    • 4 篇 控制科学与工程
    • 2 篇 网络空间安全
    • 1 篇 土木工程
    • 1 篇 水利工程
    • 1 篇 石油与天然气工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 生物医学工程(可授...
  • 14 篇 医学
    • 13 篇 临床医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 特种医学
  • 10 篇 理学
    • 9 篇 物理学
    • 1 篇 地球物理学
  • 7 篇 文学
    • 5 篇 外国语言文学
    • 2 篇 中国语言文学
    • 1 篇 新闻传播学
  • 3 篇 管理学
    • 3 篇 图书情报与档案管...
  • 2 篇 教育学
    • 2 篇 教育学
    • 2 篇 心理学(可授教育学...
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 62 篇 sequence-to-sequ...
  • 10 篇 speech recogniti...
  • 4 篇 deep learning
  • 4 篇 natural language...
  • 4 篇 attention
  • 3 篇 text summarizati...
  • 3 篇 end-to-end
  • 3 篇 attention models
  • 3 篇 speech synthesis
  • 3 篇 recurrent neural...
  • 3 篇 neural machine t...
  • 2 篇 medical staff mo...
  • 2 篇 event simulation
  • 2 篇 lstm
  • 2 篇 keyword spotting
  • 2 篇 arabic translite...
  • 2 篇 minimum bayes ri...
  • 2 篇 text-to-text tra...
  • 2 篇 low-resource lan...
  • 2 篇 arabic language

机构

  • 5 篇 google inc mount...
  • 2 篇 univ waterloo da...
  • 2 篇 univ edinburgh c...
  • 2 篇 univ fed rio gra...
  • 1 篇 cas ctr excellen...
  • 1 篇 inria
  • 1 篇 univ edinburgh c...
  • 1 篇 google res mount...
  • 1 篇 south china univ...
  • 1 篇 florida atlantic...
  • 1 篇 univ essex sch c...
  • 1 篇 avignon univ lia...
  • 1 篇 google inc 1600 ...
  • 1 篇 vignans fdn sci ...
  • 1 篇 univ sheffield d...
  • 1 篇 niger volta lang...
  • 1 篇 aix-marseille un...
  • 1 篇 karlsruhe inst t...
  • 1 篇 northwestern pol...
  • 1 篇 indian inst tech...

作者

  • 5 篇 prabhavalkar roh...
  • 4 篇 sainath tara n.
  • 3 篇 besacier laurent
  • 3 篇 rao kanishka
  • 2 篇 pundak golan
  • 2 篇 lowe michael
  • 2 篇 skerry-ryan r. j...
  • 2 篇 pradeep ronak
  • 2 篇 leevy joffrey l.
  • 2 篇 khoshgoftaar tag...
  • 2 篇 lin jimmy
  • 2 篇 villavicencio al...
  • 2 篇 stanton daisy
  • 2 篇 gallegos pilar o...
  • 2 篇 boito marcely za...
  • 2 篇 niehues jan
  • 2 篇 prusa joseph d.
  • 2 篇 kannan anjuli
  • 2 篇 king simon
  • 1 篇 mariooryad soroo...

语言

  • 60 篇 英文
  • 2 篇 其他
检索条件"主题词=sequence-to-sequence models"
62 条 记 录,以下是41-50 订阅
排序:
LOCATION-RELATIVE ATTENTION MECHANISMS FOR ROBUST LONG-FORM SPEECH SYNTHESIS
LOCATION-RELATIVE ATTENTION MECHANISMS FOR ROBUST LONG-FORM ...
收藏 引用
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Battenberg, Eric Skerry-Ryan, R. J. Mariooryad, Soroosh Stanton, Daisy Kao, David Shannon, Matt Bagby, Tom Google Res Mountain View CA 94043 USA
Despite the ability to produce human-level speech for in-domain text, attention-based end-to-end text-to-speech (TTS) systems suffer from text alignment failures that increase in frequency for out-of-domain text. We s... 详细信息
来源: 评论
A DATA EFFICIENT END-TO-END SPOKEN LANGUAGE UNDERSTANDING ARCHITECTURE
A DATA EFFICIENT END-TO-END SPOKEN LANGUAGE UNDERSTANDING AR...
收藏 引用
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Dinarelli, Marco Kapoor, Nikita Jabaian, Bassam Besacier, Laurent Univ Grenoble Alpes LIG Grenoble France Avignon Univ LIA Avignon France
End-to-end architectures have been recently proposed for spoken language understanding (SLU) and semantic parsing. Based on a large amount of data, those models learn jointly acoustic and linguistic-sequential feature... 详细信息
来源: 评论
Efficient neural speech synthesis for low-resource languages through multilingual modeling  21
Efficient neural speech synthesis for low-resource languages...
收藏 引用
Interspeech Conference
作者: de Korte, Marcel Kim, Jaebok Klabbers, Esther ReadSpeaker Huis Ter Heide Netherlands
Recent advances in neural TTS have led to models that can produce high-quality synthetic speech. However, these models typically require large amounts of training data, which can make it costly to produce a new voice ... 详细信息
来源: 评论
An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets  21
An Unsupervised Method to Select a Speaker Subset from Large...
收藏 引用
Interspeech Conference
作者: Gallegos, Pilar Oplustil Williams, Jennifer Rownicka, Joanna King, Simon Univ Edinburgh Ctr Speech Technol Res Edinburgh Midlothian Scotland
Large multi-speaker datasets for TTS typically contain diverse speakers, recording conditions, styles and quality of data. Although one might generally presume that more data is better, in this paper we show that a mo... 详细信息
来源: 评论
sequence-to-sequence Speech Recognition for Air Traffic Control Communication  32
Sequence-to-Sequence Speech Recognition for Air Traffic Cont...
收藏 引用
32nd Benelux Conference on Artificial Intelligence, BNAIC 2020 and 29th Annual Belgian-Dutch Conference on Machine Learning, BeneLearn 2020
作者: Rozenbroek, Tijs Radboud University Nijmegen Netherlands
来源: 评论
Controllable sentence simplification  12
Controllable sentence simplification
收藏 引用
12th International Conference on Language Resources and Evaluation, LREC 2020
作者: Martin, Louis de la Clergerie, Éric Villemonte Sagot, Benoît Bordes, Antoine Facebook AI Research 6 Rue Ménars Paris75002 France Inria Sorbonne Université 2 rue Simone Iff Paris275012 France
Text simplification aims at making a text easier to read and understand by simplifying grammar and structure while keeping the underlying information identical. It is often considered an all-purpose generic task where... 详细信息
来源: 评论
A multi-encoder neural conversation model
收藏 引用
NEUROCOMPUTING 2019年 358卷 344-354页
作者: Ren, Da Cai, Yi Lei, Xue Xu, Jingyun Li, Qing Leung, Ho-fung South China Univ Technol Sch Software Engn Guangzhou Guangdong Peoples R China Hong Kong Polytech Univ Dept Comp Hung Hom Kowloon Hong Kong Peoples R China Chinese Univ Hong Kong Dept Comp Sci & Engn Hong Kong Peoples R China
With the development of deep neural networks, sequence to sequence (Seq2Seq) models become a popular technique of conversation models. Current Seq2Seq models with single encoder-decoder structures tend to generate res... 详细信息
来源: 评论
sequence-TO-sequence MODELLING OF F0 FOR SPEECH EMOTION CONVERSION  44
SEQUENCE-TO-SEQUENCE MODELLING OF F0 FOR SPEECH EMOTION CONV...
收藏 引用
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Robinson, Carl Obin, Nicolas Roebel, Axel Sorbonne Univ CNRS IRCAM Paris France
Voice interfaces are becoming wildly popular and driving demand for more advanced speech synthesis and voice transformation systems. Current text-to-speech methods produce realistic sounding voices, but they lack the ... 详细信息
来源: 评论
CONTEXTUAL SPEECH RECOGNITION WITH DIFFICULT NEGATIVE TRAINING EXAMPLES  44
CONTEXTUAL SPEECH RECOGNITION WITH DIFFICULT NEGATIVE TRAINI...
收藏 引用
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Alon, Uri Pundak, Golan Sainath, Tara N. Technion Haifa Israel Google Inc Mountain View CA USA
Improving the representation of contextual information is key to unlocking the potential of end-to-end (E2E) automatic speech recognition (ASR). In this work, we present a novel and simple approach for training an ASR... 详细信息
来源: 评论
Very Deep Self-Attention Networks for End-to-End Speech Recognition  20
Very Deep Self-Attention Networks for End-to-End Speech Reco...
收藏 引用
Interspeech Conference
作者: Ngoc-Quan Pham Thai-Son Nguyen Niehues, Jan Mueller, Markus Waibel, Alex Karlsruhe Inst Technol Interact Syst Lab Karlsruhe Germany Carnegie Mellon Univ Pittsburgh PA 15213 USA
Recently, end-to-end sequence-to-sequence models for speech recognition have gained significant interest in the research community. While previous architecture choices revolve around time-delay neural networks (TDNN) ... 详细信息
来源: 评论