咨询与建议

限定检索结果

文献类型

  • 267 篇 会议
  • 155 篇 期刊文献

馆藏范围

  • 422 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 282 篇 工学
    • 184 篇 计算机科学与技术...
    • 164 篇 软件工程
    • 111 篇 信息与通信工程
    • 28 篇 生物工程
    • 27 篇 电子科学与技术(可...
    • 24 篇 电气工程
    • 23 篇 控制科学与工程
    • 21 篇 仪器科学与技术
    • 19 篇 化学工程与技术
    • 11 篇 机械工程
    • 8 篇 生物医学工程(可授...
    • 6 篇 光学工程
    • 5 篇 建筑学
    • 4 篇 土木工程
    • 3 篇 材料科学与工程(可...
  • 176 篇 理学
    • 137 篇 物理学
    • 56 篇 数学
    • 31 篇 生物学
    • 19 篇 化学
    • 16 篇 统计学(可授理学、...
    • 8 篇 系统科学
  • 44 篇 管理学
    • 37 篇 图书情报与档案管...
    • 7 篇 管理科学与工程(可...
  • 11 篇 法学
    • 11 篇 社会学
  • 8 篇 医学
    • 7 篇 临床医学
    • 6 篇 基础医学(可授医学...
    • 5 篇 药学(可授医学、理...
  • 7 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 4 篇 教育学
    • 4 篇 教育学
  • 3 篇 农学
  • 2 篇 艺术学

主题

  • 59 篇 speech recogniti...
  • 52 篇 training
  • 33 篇 acoustics
  • 31 篇 speech
  • 20 篇 speech processin...
  • 19 篇 feature extracti...
  • 18 篇 hidden markov mo...
  • 18 篇 signal processin...
  • 16 篇 computational mo...
  • 15 篇 conferences
  • 14 篇 speech enhanceme...
  • 13 篇 predictive model...
  • 13 篇 decoding
  • 12 篇 machine translat...
  • 11 篇 speech synthesis
  • 10 篇 training data
  • 10 篇 neural networks
  • 10 篇 data models
  • 9 篇 transformers
  • 9 篇 self-supervised ...

机构

  • 71 篇 national enginee...
  • 51 篇 human language t...
  • 46 篇 center for langu...
  • 31 篇 human language t...
  • 21 篇 center for langu...
  • 21 篇 center for langu...
  • 13 篇 center for langu...
  • 11 篇 iflytek research
  • 10 篇 center for langu...
  • 9 篇 ict cluster sing...
  • 9 篇 human language t...
  • 8 篇 national enginee...
  • 8 篇 center for langu...
  • 8 篇 human language t...
  • 7 篇 center for langu...
  • 7 篇 human language t...
  • 7 篇 university of sc...
  • 7 篇 xiaomi corp.
  • 6 篇 university of sc...
  • 6 篇 state key labora...

作者

  • 49 篇 ling zhen-hua
  • 47 篇 khudanpur sanjee...
  • 35 篇 dehak najim
  • 32 篇 ai yang
  • 29 篇 sanjeev khudanpu...
  • 23 篇 zhen-hua ling
  • 23 篇 dredze mark
  • 19 篇 povey daniel
  • 19 篇 yang ai
  • 18 篇 villalba jesús
  • 18 篇 van durme benjam...
  • 18 篇 daniel povey
  • 17 篇 post matt
  • 16 篇 hermansky hynek
  • 16 篇 lu ye-xin
  • 15 篇 zelasko piotr
  • 14 篇 du hui-peng
  • 13 篇 raj desh
  • 13 篇 gu jia-chen
  • 13 篇 watanabe shinji

语言

  • 344 篇 英文
  • 78 篇 其他
  • 2 篇 中文
检索条件"机构=Center for Language and Speech Processing & Human Language Technology"
422 条 记 录,以下是171-180 订阅
排序:
Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis
arXiv
收藏 引用
arXiv 2024年
作者: Kouzelis, Theodoros Plitsis, Manos Nicolaou, Mihalis A. Panagakis, Yannis National Technical University of Athens Athens Greece Archimedes AI Athena RC Athens Greece Institute for Language and Speech Athena RC Processing Athens Greece Department of Informatics and Telecommunications National and Kapodistrian University of Athens Athens Greece Computation-based Science and Technology Research Center The Cyprus Institute Nicosia Cyprus
Recent advances in Diffusion Models (DMs) have led to significant progress in visual synthesis and editing tasks, establishing them as a strong competitor to Generative Adversarial Networks (GANs). However, the latent... 详细信息
来源: 评论
An Asynchronous WFST-Based Decoder for Automatic speech Recognition
An Asynchronous WFST-Based Decoder for Automatic Speech Reco...
收藏 引用
IEEE International Conference on Acoustics, speech and Signal processing
作者: Hang Lv Zhehuai Chen Hainan Xu Daniel Povey Lei Xie Sanjeev Khudanpur Audio Speech and Language Processing Lab (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Center of Language and Speech Processing Johns Hopkins University Baltimore MD USA Shanghai Jiao Tong University Xiaomi Corporation Beijing China Human Language Technology Center of Excellence Johns Hopkins University Baltimore MD USA
We introduce asynchronous dynamic decoder, which adopts an efficient A~* algorithm to incorporate big language models in the one-pass decoding for large vocabulary continuous speech recognition. Unlike standard one-pa... 详细信息
来源: 评论
Unsupervised acoustic unit discovery by leveraging a language-independent subword discriminative feature representation
arXiv
收藏 引用
arXiv 2021年
作者: Feng, Siyuan Zelasko, Piotr Moro-Velázquez, Laureano Scharenborg, Odette Multimedia Computing Group Delft University of Technology Delft Netherlands Center for Language and Speech Processing Johns Hopkins University BaltimoreMD United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States
This paper tackles automatically discovering phone-like acoustic units (AUD) from unlabeled speech data. Past studies usually proposed single-step approaches. We propose a two-stage approach: the first stage learns a ... 详细信息
来源: 评论
Unsupervised Domain-Adaptive Semantic Segmentation for Surgical Instruments Leveraging Dropout-Enhanced Dual Heads and Coarse-Grained Classification Branch
IEEE Transactions on Medical Robotics and Bionics
收藏 引用
IEEE Transactions on Medical Robotics and Bionics 2025年
作者: Li, Ziqian Wang, Zhengyu Xu, Xinzhou Chen, Yongfa Schuller, Bjorn W. Hefei University of Technology School of Mechanical Engineering Hefei China Nanjing University of Posts and Telecommunications School of Internet of Things Nanjing China Graz University of Technology Signal Processing and Speech Communication Laboratory Graz Austria Chair of Health Informatics Munich Germany Munich Data Science Institute Munich Germany Munich Center for Machine Learning Munich Germany Imperial College London GLAM – the Group on Language Audio and Music London United Kingdom
Accurate semantic segmentation for surgical instruments is crucial in robot-assisted minimally invasive surgery, mainly regarded as a core module in surgical-instrument tracking and operation guidance. Nevertheless, i... 详细信息
来源: 评论
A parallelizable lattice rescoring strategy with neural language models
arXiv
收藏 引用
arXiv 2021年
作者: Li, Ke Povey, Daniel Khudanpur, Sanjeev Center for Language and Speech Processing The Johns Hopkins University BaltimoreMD21218 United States Human Language Technology Center of Excellence The Johns Hopkins University BaltimoreMD21218 United States Xiaomi Corp. Beijing China
This paper proposes a parallel computation strategy and a posterior-based lattice expansion algorithm for efficient lattice rescoring with neural language models (LMs) for automatic speech recognition. First, lattices... 详细信息
来源: 评论
Sources of Transfer in Multilingual Named Entity Recognition
arXiv
收藏 引用
arXiv 2020年
作者: Mueller, David Andrews, Nicholas Dredze, Mark Center for Language and Speech Processing Johns Hopkins University Human Language Technology Center of Excellence Johns Hopkins University
Named-entities are inherently multilingual, and annotations in any given language may be limited. This motivates us to consider polyglot named-entity recognition (NER), where one model is trained using annotated data ...
来源: 评论
Modelling Collocations in OntoLex-FrAC
Modelling Collocations in OntoLex-FrAC
收藏 引用
2022 Globalex Workshop on Linked Lexicography, GWLL 2022
作者: Chiarcos, Christian Gkirtzou, Katerina Ionov, Maxim Kabashi, Besim Khan, Anas Fahad Truică, Ciprian-Octavian Applied Computational Linguistics Goethe University Frankfurt Frankfurt am Main Germany Institute for Digital Humanities University of Cologne Germany Institute of Language and Speech Processing Athena Research Center Athens Greece Computational and Corpus Linguistics Friedrich-Alexander University of Erlangen-Nuremberg Germany Istituto di Linguistica Computazionale A. Zampolli Consiglio Nazionale delle Ricerche Italy Department of Information Technology Uppsala University Sweden
Following presentations of frequency and attestations, and embeddings and distributional similarity, this paper introduces the third cornerstone of the emerging OntoLex module for Frequency, Attestation and Corpus-bas... 详细信息
来源: 评论
DRAWING ORDER RECOVERY FOR HANDWRITING CHINESE CHARACTERS  44
DRAWING ORDER RECOVERY FOR HANDWRITING CHINESE CHARACTERS
收藏 引用
44th IEEE International Conference on Acoustics, speech and Signal processing (ICASSP)
作者: Zhao, Bocheng Yang, Minghao Tao, Jianhua Center for Language and Speech Processing The Johns Hopkins University Baltimore USA Human Language Technology Center of Excellence The Johns Hopkins University Baltimore USA
Recover drawing orders from a Chinese handwriting image is a challenge issue. Most of English drawing order recovery( DOR) methods perform unsatisfactorily in Chinese. This paper proposes a novel image-to-sequence alg... 详细信息
来源: 评论
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
arXiv
收藏 引用
arXiv 2020年
作者: Bhati, Saurabhchand Villalba, Jesús Żelasko, Piotr Dehak, Najim Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States
Unsupervised spoken term discovery consists of two tasks: finding the acoustic segment boundaries and labeling acoustically similar segments with the same labels. We perform segmentation based on the assumption that t... 详细信息
来源: 评论
Single channel far field feature enhancement for speaker verification in the wild
arXiv
收藏 引用
arXiv 2020年
作者: Nidadavolu, Phani Sankar Kataria, Saurabh Perera, Paola Garcia Villalba, Jesus Dehak, Najim Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States
We investigated an enhancement and a domain adaptation approach to make speaker verification systems robust to perturbations of far-field speech. In the enhancement approach, using paired (parallel) reverberant-clean ... 详细信息
来源: 评论