咨询与建议

限定检索结果

文献类型

  • 48 篇 会议
  • 14 篇 期刊文献

馆藏范围

  • 62 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 58 篇 工学
    • 46 篇 计算机科学与技术...
    • 24 篇 电气工程
    • 16 篇 软件工程
    • 8 篇 信息与通信工程
    • 5 篇 电子科学与技术(可...
    • 4 篇 控制科学与工程
    • 2 篇 网络空间安全
    • 1 篇 土木工程
    • 1 篇 水利工程
    • 1 篇 石油与天然气工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 生物医学工程(可授...
  • 14 篇 医学
    • 13 篇 临床医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 特种医学
  • 10 篇 理学
    • 9 篇 物理学
    • 1 篇 地球物理学
  • 7 篇 文学
    • 5 篇 外国语言文学
    • 2 篇 中国语言文学
    • 1 篇 新闻传播学
  • 3 篇 管理学
    • 3 篇 图书情报与档案管...
  • 2 篇 教育学
    • 2 篇 教育学
    • 2 篇 心理学(可授教育学...
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 62 篇 sequence-to-sequ...
  • 10 篇 speech recogniti...
  • 4 篇 deep learning
  • 4 篇 natural language...
  • 4 篇 attention
  • 3 篇 text summarizati...
  • 3 篇 end-to-end
  • 3 篇 attention models
  • 3 篇 speech synthesis
  • 3 篇 recurrent neural...
  • 3 篇 neural machine t...
  • 2 篇 medical staff mo...
  • 2 篇 event simulation
  • 2 篇 lstm
  • 2 篇 keyword spotting
  • 2 篇 arabic translite...
  • 2 篇 minimum bayes ri...
  • 2 篇 text-to-text tra...
  • 2 篇 low-resource lan...
  • 2 篇 arabic language

机构

  • 5 篇 google inc mount...
  • 2 篇 univ waterloo da...
  • 2 篇 univ edinburgh c...
  • 2 篇 univ fed rio gra...
  • 1 篇 cas ctr excellen...
  • 1 篇 inria
  • 1 篇 univ edinburgh c...
  • 1 篇 google res mount...
  • 1 篇 south china univ...
  • 1 篇 florida atlantic...
  • 1 篇 univ essex sch c...
  • 1 篇 avignon univ lia...
  • 1 篇 google inc 1600 ...
  • 1 篇 vignans fdn sci ...
  • 1 篇 univ sheffield d...
  • 1 篇 niger volta lang...
  • 1 篇 aix-marseille un...
  • 1 篇 karlsruhe inst t...
  • 1 篇 northwestern pol...
  • 1 篇 indian inst tech...

作者

  • 5 篇 prabhavalkar roh...
  • 4 篇 sainath tara n.
  • 3 篇 besacier laurent
  • 3 篇 rao kanishka
  • 2 篇 pundak golan
  • 2 篇 lowe michael
  • 2 篇 skerry-ryan r. j...
  • 2 篇 pradeep ronak
  • 2 篇 leevy joffrey l.
  • 2 篇 khoshgoftaar tag...
  • 2 篇 lin jimmy
  • 2 篇 villavicencio al...
  • 2 篇 stanton daisy
  • 2 篇 gallegos pilar o...
  • 2 篇 boito marcely za...
  • 2 篇 niehues jan
  • 2 篇 prusa joseph d.
  • 2 篇 kannan anjuli
  • 2 篇 king simon
  • 1 篇 mariooryad soroo...

语言

  • 60 篇 英文
  • 2 篇 其他
检索条件"主题词=sequence-to-sequence models"
62 条 记 录,以下是11-20 订阅
排序:
A Comparison of sequence-to-sequence models for Speech Recognition  18
A Comparison of Sequence-to-Sequence Models for Speech Recog...
收藏 引用
18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017)
作者: Prabhavalkar, Rohit Rao, Kanishka Sainath, Tara N. Li, Bo Johnson, Leif Jaitly, Navdeep Google Inc Mountain View CA 94043 USA NVIDIA Santa Clara CA USA
In this work, we conduct a detailed evaluation of various all neural, end-to-end trained, sequence-to-sequence models applied to the task of speech recognition. Notably. each of these systems directly predicts graphem... 详细信息
来源: 评论
Exploring sequence-to-sequence Transformer-Transducer models for Keyword Spotting  48
Exploring Sequence-to-Sequence Transformer-Transducer Models...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Labrador, Beltrán Zhao, Guanlong López Moreno, Ignacio Scorza Scarpati, Angelo Fowl, Liam Wang, Quan Spain Google Llc United States
In this paper, we present a novel approach to adapt a sequence-to-sequence Transformer-Transducer ASR system to the keyword spotting (KWS) task. We achieve this by replacing the keyword in the text transcription with ... 详细信息
来源: 评论
Prosody recognition in Persian poetry
收藏 引用
SPEECH COMMUNICATION 2025年 170卷
作者: Shahrestani, Mohammadreza Chehreghani, Mostafa Haghir Amirkabir Univ Technol Tehran Polytech Dept Comp Engn Tehran Iran
Classical Persian poetry, like traditional poetry from other cultures, follows set metrical patterns, known as prosody. Recognizing prosody of a given poetry is very useful in understanding and analyzing Persian langu... 详细信息
来源: 评论
Advancing machine learning with OCR2SEQ: an innovative approach to multi-modal data augmentation
收藏 引用
JOURNAL OF BIG DATA 2024年 第1期11卷 86页
作者: Lowe, Michael Prusa, Joseph D. Leevy, Joffrey L. Khoshgoftaar, Taghi M. Florida Atlantic Univ 777 Glades Rd Boca Raton FL 33431 USA
OCR2SEQ represents an innovative advancement in Optical Character Recognition (OCR) technology, leveraging a multi-modal generative augmentation strategy to overcome traditional limitations in OCR systems. This paper ... 详细信息
来源: 评论
Context-Relevant Denoising for Unsupervised Domain-Adapted Sentence Embeddings  25
Context-Relevant Denoising for Unsupervised Domain-Adapted S...
收藏 引用
25th IEEE International Conference on Information Reuse and Integration for Data Science (IEEE IRI)
作者: Lowe, Michael Prusa, Joseph D. Leevy, Joffrey L. Khoshgoftaar, Taghi M. Florida Atlantic Univ Boca Raton FL 33431 USA
In closed-system domains, such as healthcare databases, record scarcity and data quality often act as barriers to applying state-of-the-art language processing techniques. Addressing these challenges requires the adju... 详细信息
来源: 评论
Feature Extraction Approach for Predicting Protein-DNA Binding Residues Using Transformer Encoder-Decoder Architecture  20th
Feature Extraction Approach for Predicting Protein-DNA Bindi...
收藏 引用
20th International Conference on Intelligent Computing (ICIC)
作者: Qiu, Yi Cheng, Long Xu, Man Chen, Jing Wu, Hongjie Suzhou Univ Sci & Technol Sch Elect & Informat Engn Suzhou 215009 Jiangsu Peoples R China
In the realm of biology, the effects of protein binding with other molecules are of paramount importance, especially in the context of DNA binding. Precisely identifying the residues implicated in protein-DNA binding ... 详细信息
来源: 评论
sequence-to-sequence Multi-Modal Speech In-Painting  24
Sequence-to-Sequence Multi-Modal Speech In-Painting
收藏 引用
Interspeech Conference
作者: Elyaderani, Mahsa Kadkhodaei Shirani, Shahram McMaster Univ Dept Computat Sci & Engn Hamilton ON Canada
Speech in-painting is the task of regenerating missing audio contents using reliable context information. Despite various recent studies in multi-modal perception of audio in-painting, there is still a need for an eff... 详细信息
来源: 评论
Learn Spelling from Teachers: Transferring Knowledge from Language models to sequence-to-sequence Speech Recognition  20
Learn Spelling from Teachers: Transferring Knowledge from La...
收藏 引用
Interspeech Conference
作者: Bai, Ye Yi, Jiangyan Tao, Jianhua Tian, Zhengkun Wen, Zhengqi Chinese Acad Sci Inst Automat NLPR Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China CAS Ctr Excellence Brain Sci & Intelligence Techn Shanghai Peoples R China
Integrating an external language model into a sequence-to-sequence speech recognition system is non-trivial. Previous works utilize linear interpolation or a fusion network to integrate external language models. Howev... 详细信息
来源: 评论
Investigating the robustness of sequence-to-sequence text-to-speech models to imperfectly-transcribed training data  20
Investigating the robustness of sequence-to-sequence text-to...
收藏 引用
Interspeech Conference
作者: Fong, Jason Gallegos, Pilar Oplustil Hodari, Zack King, Simon Univ Edinburgh Ctr Speech Technol Res Edinburgh Midlothian Scotland
sequence-to-sequence (S2S) text-to-speech (TTS) models can synthesise high quality speech when large amounts of annotated training data are available. Transcription errors exist in all data and are especially prevalen... 详细信息
来源: 评论
Abstract Representation for Multi-Intent Spoken Language Understanding  48
Abstract Representation for Multi-Intent Spoken Language Und...
收藏 引用
48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
作者: Abrougui, Rim Damnati, Géraldine Heinecke, Johannes Béchet, Frédéric Orange Innovation Lannion France Aix-Marseille University Cnrs Marseille France
Current sequence tagging models based on Deep Neural Network models with pretrained language models achieve almost perfect results on many SLU benchmarks with a flat semantic annotation at the token level such as ATIS... 详细信息
来源: 评论