咨询与建议

限定检索结果

文献类型

  • 167 篇 会议
  • 125 篇 期刊文献
  • 4 篇 学位论文

馆藏范围

  • 296 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 281 篇 工学
    • 208 篇 计算机科学与技术...
    • 115 篇 电气工程
    • 50 篇 软件工程
    • 37 篇 信息与通信工程
    • 20 篇 控制科学与工程
    • 7 篇 机械工程
    • 7 篇 仪器科学与技术
    • 7 篇 石油与天然气工程
    • 6 篇 土木工程
    • 5 篇 水利工程
    • 5 篇 生物医学工程(可授...
    • 4 篇 环境科学与工程(可...
    • 3 篇 动力工程及工程热...
    • 2 篇 电子科学与技术(可...
    • 2 篇 建筑学
  • 64 篇 理学
    • 47 篇 物理学
    • 8 篇 生物学
    • 4 篇 化学
    • 4 篇 地球物理学
    • 3 篇 数学
    • 2 篇 大气科学
  • 56 篇 医学
    • 52 篇 临床医学
    • 2 篇 基础医学(可授医学...
  • 17 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 图书情报与档案管...
  • 4 篇 文学
    • 3 篇 外国语言文学
    • 1 篇 中国语言文学
  • 2 篇 农学
    • 2 篇 作物学
  • 1 篇 经济学
    • 1 篇 理论经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 296 篇 sequence-to-sequ...
  • 42 篇 deep learning
  • 23 篇 transformer
  • 21 篇 speech recogniti...
  • 17 篇 lstm
  • 16 篇 encoder-decoder
  • 16 篇 end-to-end
  • 14 篇 attention mechan...
  • 14 篇 attention
  • 11 篇 neural networks
  • 11 篇 recurrent neural...
  • 10 篇 task analysis
  • 10 篇 long short-term ...
  • 10 篇 speech synthesis
  • 9 篇 training
  • 8 篇 voice conversion
  • 8 篇 neural network
  • 7 篇 automatic sleep ...
  • 7 篇 self-attention
  • 7 篇 natural language...

机构

  • 5 篇 univ chinese aca...
  • 5 篇 chinese acad sci...
  • 5 篇 univ sci & techn...
  • 4 篇 alibaba grp peop...
  • 4 篇 amazon alexa mac...
  • 3 篇 tech univ cluj n...
  • 3 篇 google mountain ...
  • 3 篇 brno univ techno...
  • 3 篇 google inc mount...
  • 3 篇 karlsruhe inst t...
  • 3 篇 univ chinese aca...
  • 3 篇 johns hopkins un...
  • 2 篇 singapore manage...
  • 2 篇 univ southern qu...
  • 2 篇 tsinghua univ de...
  • 2 篇 indiana univ blo...
  • 2 篇 univ augsburg ch...
  • 2 篇 univ sci & techn...
  • 2 篇 univ alberta edm...
  • 2 篇 natl univ singap...

作者

  • 5 篇 mouchtaris athan...
  • 5 篇 xu bo
  • 5 篇 ling zhen-hua
  • 5 篇 prabhavalkar roh...
  • 4 篇 sainath tara n.
  • 4 篇 de vos maarten
  • 4 篇 chen oliver y.
  • 4 篇 radfar martin
  • 4 篇 watanabe shinji
  • 4 篇 dai li-rong
  • 3 篇 bruguier antoine
  • 3 篇 wang jian
  • 3 篇 hayashi tomoki
  • 3 篇 andres-ferrer je...
  • 3 篇 waibel alex
  • 3 篇 mertins alfred
  • 3 篇 rybach david
  • 3 篇 xu shuang
  • 3 篇 zhou shiyu
  • 3 篇 li haizhou

语言

  • 291 篇 英文
  • 3 篇 其他
  • 1 篇 中文
检索条件"主题词=sequence-to-sequence"
296 条 记 录,以下是21-30 订阅
排序:
SMILE: sequence-to-sequence DOMAIN ADAPTATION WITH MINIMIZING LATENT ENTROPY FOR TEXT IMAGE RECOGNITION  29
SMILE: SEQUENCE-TO-SEQUENCE DOMAIN ADAPTATION WITH MINIMIZIN...
收藏 引用
IEEE International Conference on Image Processing (ICIP)
作者: Chang, Yen-Cheng Chen, Yi-Chang Chang, Yu-Chuan Yeh, Yi-Ren E SUN Financial Holding Co Ltd Taipei Taiwan Natl Kaohsiung Normal Univ Dept Math Kaohsiung Taiwan
Excellent text recognition results have been obtained by training recognition models with synthetic images. However, recognizing text from real-world images still faces challenges due to the domain shift between synth... 详细信息
来源: 评论
Rescoring sequence-to-sequence Models for Text Line Recognition with CTC-Prefixes  15th
Rescoring Sequence-to-Sequence Models for Text Line Recognit...
收藏 引用
15th IAPR International Workshop on Document Analysis Systems (DAS)
作者: Wick, Christoph Zollner, Jochen Gruning, Tobias Planet AI GmbH Warnowufer 60 D-18057 Rostock Germany Univ Rostock Computat Intelligence Technol Lab Dept Math D-18051 Rostock Germany
In contrast to Connectionist Temporal Classification (CTC) approaches, sequence-to-sequence (S2S) models for Handwritten Text Recognition (HTR) suffer from errors such as skipped or repeated words which often occur at... 详细信息
来源: 评论
An Overview & Analysis of sequence-to-sequence Emotional Voice Conversion  23
An Overview & Analysis of Sequence-to-Sequence Emotional Voi...
收藏 引用
Interspeech Conference
作者: Yang, Zijiang Jing, Xin Triantafyllopoulos, Andreas Song, Meishu Aslan, Ilhan Schuller, Bjoern W. Univ Augsburg Chair Embedded Intelligence Hlth Care & Wellbeing Augsburg Germany Univ Tokyo Educ Physiol Lab Tokyo Japan Huawei Technol Device Software Lab Munich Res Ctr Munich Germany Imperial Coll London GLAM Grp Language Audio & Mus London England
Emotional voice conversion (EVC) focuses on converting a speech utterance from a source to a target emotion;it can thus be a key enabling technology for human-computer interaction applications and beyond. However, EVC... 详细信息
来源: 评论
UnitNet: A sequence-to-sequence Acoustic Model for Concatenative Speech Synthesis
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2021年 29卷 2643-2655页
作者: Zhou, Xiao Ling, Zhen-Hua Dai, Li-Rong Univ Sci & Technol China Natl Engn Lab Speech & Language Informat Proc Hefei 230027 Peoples R China
This paper presents UnitNet, a sequence-to-sequence (Seq2Seq) acoustic model for concatenative speech synthesis. Comparing with the Tacotron2 model for Seq2Seq speech synthesis, UnitNet utilizes the phone boundaries o... 详细信息
来源: 评论
SSS-AE: Anomaly Detection Using Self-Attention Based sequence-to-sequence Auto-Encoder in SMD Assembly Machine Sound
收藏 引用
IEEE ACCESS 2021年 9卷 131191-131202页
作者: Nam, Ki Hyun Song, Young Jong Yun, Il Dong Hankuk Univ Foreign Studies Dept Comp Engn Yongin 17035 South Korea
A Surface-Mounted Device (SMD) assembly machine continuously assembles various products in real field. Unwanted situations such as assembly failure and device breakdown can occur at any time during the assembly proces... 详细信息
来源: 评论
Multi-Step Prediction of Wind Power Based on Hybrid Model with Improved Variational Mode Decomposition and sequence-to-sequence Network
收藏 引用
PROCESSES 2024年 第1期12卷 191页
作者: Bai, Wangwang Jin, Mengxue Li, Wanwei Zhao, Juan Feng, Bin Xie, Tuo Li, Siyao Li, Hui Econ & Tech Res Inst State Grid Gansu Power Co Lanzhou 730050 Peoples R China State Grid Changzhi Power Supply Co Changzhi 046011 Peoples R China Northwest Power Design Inst Co Ltd China Power Engn Consultant Grp Xian 710075 Peoples R China Xian Univ Technol Sch Elect Engn Xian 710048 Peoples R China
Due to the complexity of wind power, traditional prediction models are incapable of fully extracting the hidden features of multidimensional strong fluctuation data, which results in poor multi-step prediction perform... 详细信息
来源: 评论
AN INVESTIGATION OF STREAMING NON-AUTOREGRESSIVE sequence-to-sequence VOICE CONVERSION  47
AN INVESTIGATION OF STREAMING NON-AUTOREGRESSIVE SEQUENCE-TO...
收藏 引用
47th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Hayashi, Tomoki Kobayashi, Kazuhiro Toda, Tomoki TARVO Inc Nagoya Aichi Japan Nagoya Univ Nagoya Aichi Japan
Recent advances in sequence-to-sequence (S2S) models have improved the quality of voice conversion (VC), but it requires the entire sequence to perform inference, which prevents using it in real-time applications. To ... 详细信息
来源: 评论
A Realistic Drum Accompaniment Generator Using sequence-to-sequence Model and MIDI Music Database  30
A Realistic Drum Accompaniment Generator Using Sequence-to-S...
收藏 引用
30th IEEE Signal Processing and Communications Applications Conference (SIU)
作者: Akyuz, Yavuz Batuhan Gumustekin, Sevket Izmir Yuksek Teknol Enstitusu Elekt Elekt Muhendisligi TR-35430 Urla Izmir Turkiye
In this work, artificial intelligence reinterpretation and/or addition of drum parts for musical pieces supplied in Musical Instruments Digital Interface (MIDI) format, have been carried out. To achieve this, sequence... 详细信息
来源: 评论
A Hierarchical sequence-to-sequence Model for Korean POS Tagging
收藏 引用
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING 2021年 第2期20卷 1–13页
作者: Jin, Guozhe Yu, Zhezhou Jilin Univ Coll Comp Sci & Technol Qianjin St 2699 Changchun Jilin Peoples R China
Part-of-speech (POS) tagging is a fundamental task in natural language processing. Korean POS tagging consists of two subtasks: morphological analysis and POS tagging. In recent years, scholars have tended to use the ... 详细信息
来源: 评论
Pretraining Techniques for sequence-to-sequence Voice Conversion
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2021年 29卷 745-755页
作者: Huang, Wen-Chin Hayashi, Tomoki Wu, Yi-Chiao Kameoka, Hirokazu Toda, Tomoki Nagoya Univ Grad Sch Informat Nagoya Aichi 4648601 Japan Nagoya Univ Human Dataware Lab Co Ltd Nagoya Aichi 4648601 Japan Nagoya Univ Grad Sch Informat Sci Nagoya Aichi 4648601 Japan NTT Corp NTT Commun Sci Labs Atsugi Kanagawa 2430198 Japan Nagoya Univ Informat Technol Ctr Nagoya Aichi 4648601 Japan
sequence-to-sequence (seq2seq) voice conversion (VC) models are attractive owing to their ability to convert prosody. Nonetheless, without sufficient data, seq2seq VC models can suffer from unstable training and mispr... 详细信息
来源: 评论