咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是171-180 订阅
排序:
Development of Rule-Based Chunker for Sindhi  1st
Development of Rule-Based Chunker for Sindhi
收藏 引用
1st International Conference on Computation of Artificial Intelligence and Machine Learning, ICCAIML 2024
作者: Arora, Palak Nathani, Bharti Joshi, Nisheeth Katyayan, Pragya Speech and Language Processing Lab Center for Artificial Intelligence Banasthali Vidyapith Rajasthan Radha Kishnpura India Department of Computer Science Banasthali Vidyapith Rajasthan Radha Kishnpura India Department of Mathematics and Statistics Banasthali Vidyapith Rajasthan Radha Kishnpura India
language is a primary means of communication. It is a medium through which we can interact with society. Recognizing it, each language has its own set of grammatical rules. This study focused on the development of a r... 详细信息
来源: 评论
Augmenting Context Representation with Triggers Knowledge for Relation Extraction  12th
Augmenting Context Representation with Triggers Knowledge fo...
收藏 引用
12th IFIP TC 12 International Conference on Intelligent Information processing, IIP 2022
作者: Li, En Shi, Shumin Yang, Zhikun Huang, He Yan School of Computer Science and Technology Beijing Institute of Technology Beijing China Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications Beijing China
Relation Extraction (RE) requires the model to classify the correct relation from a set of relation candidates given the corresponding sentence and two entities. Recent work mainly studies how to utilize more data or ... 详细信息
来源: 评论
Decoding speech Categorization using Microstate Cortical EEG Signals and Machine Learning
Decoding Speech Categorization using Microstate Cortical EEG...
收藏 引用
2024 IEEE Signal processing in Medicine and Biology Symposium, SPMB 2024
作者: Mahmud, M. Hasan, M. Yeasin, M. Bidelman, G. Univ. Of Tennessee Health Science Center Div. Of General Internal Medicine MemphisTN United States Middle Tennessee State University Computational And Data Science MurfreesboroTN United States University Of Memphis Department Of Electrical And Computer Engineering TN United States Indiana University Language And Hearing Sciences Department Of Speech BloomingtonIN United States
Categorical perception (CP) is a perceptual phenomenon that refers to the tendency of humans to group speech sounds into discrete units. In this work, we used cortical event-related potential (ERP) signals, recorded d... 详细信息
来源: 评论
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
arXiv
收藏 引用
arXiv 2024年
作者: Cai, Pengfei Song, Yan Li, Kang Song, Haoyu McLoughlin, Ian National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China ICT Cluster Singapore Institute of Technology Singapore The Australian National University Australia
Sound event detection (SED) methods that leverage a large pre-trained Transformer encoder network have shown promising performance in recent DCASE challenges. However, they still rely on an RNN-based context network t... 详细信息
来源: 评论
Greek Sign language Recognition for the SL-ReDu Learning Platform  7
Greek Sign Language Recognition for the SL-ReDu Learning Pla...
收藏 引用
7th Workshop on Sign language Translation and Avatar Technology: The Junction of the Visual and the Textual Challenges and Perspectives, SLTAT 2022
作者: Papadimitriou, Katerina Potamianos, Gerasimos Sapountzaki, Galini Goulas, Theodor Efthimiou, Eleni Fotinea, Stavroula-Evita Maragos, Petros Department of Electrical & Computer Engineering University of Thessaly Volos Greece Department of Special Education University of Thessaly Volos Greece Institute for Language & Speech Processing Athena Research & Innovation Center Athens Greece School of Electrical & Computer Engineering National Technical University of Athens Greece
There has been increasing interest lately in developing education tools for sign language (SL) learning that enable self-assessment and objective evaluation of learners' SL productions, assisting both students and... 详细信息
来源: 评论
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization
arXiv
收藏 引用
arXiv 2024年
作者: He, Mao-Kui Du, Jun Niu, Shu-Tong Liu, Qing-Feng Lee, Chin-Hui National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Anhui Hefei China IFlytek Hefei Anhui China School of Electrical and Computer Engineering Georgia Institute of Technology AtlantaGA United States
In this paper, we propose a quality-aware end-to-end audio-visual neural speaker diarization framework, which comprises three key techniques. First, our audio-visual model takes both audio and visual features as input... 详细信息
来源: 评论
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
arXiv
收藏 引用
arXiv 2024年
作者: Artemova, Ekaterina Blaschke, Verena Plank, Barbara MaiNLP Center for Information and Language Processing LMU Munich Germany Munich Germany Department of Computer Science IT University of Copenhagen Denmark Toloka.AI
Mainstream cross-lingual task-oriented dialogue (ToD) systems leverage the transfer learning paradigm by training a joint model for intent recognition and slot-filling in English and applying it, zero-shot, to other l... 详细信息
来源: 评论
DSPGAN: A Gan-Based Universal Vocoder for High-Fidelity TTS by Time-Frequency Domain Supervision from DSP
DSPGAN: A Gan-Based Universal Vocoder for High-Fidelity TTS ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Kun Song Yongmao Zhang Yi Lei Jian Cong Hanzhao Li Lei Xie Gang He Jinfeng Bai Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China TAL Education Group Beijing China
Recent development of neural vocoders based on the generative adversarial neural network (GAN) has shown obvious advantages of generating raw waveform conditioned on mel-spectrogram with fast inference speed and light... 详细信息
来源: 评论
Zero-Shot Emotion Transfer for Cross-Lingual speech Synthesis
Zero-Shot Emotion Transfer for Cross-Lingual Speech Synthesi...
收藏 引用
IEEE Workshop on Automatic speech Recognition and Understanding
作者: Yuke Li Xinfa Zhu Yi Lei Hai Li Junhui Liu Danming Xie Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China iQIYI Inc. Chengdu China
Zero-shot emotion transfer in cross-lingual speech synthesis aims to transfer emotion from an arbitrary speech reference in the source language to the synthetic speech in the target language. Building such a system fa...
来源: 评论
To Know or Not To Know? Analyzing Self-Consistency of Large language Models under Ambiguity
arXiv
收藏 引用
arXiv 2024年
作者: Sedova, Anastasiia Litschko, Robert Frassinelli, Diego Roth, Benjamin Plank, Barbara Faculty of Computer Science UniVie Doctoral School Computer Science Austria Faculty of Philological and Cultural Studies University of Vienna Austria MaiNLP Center for Information and Language Processing LMU Munich Germany Germany
One of the major aspects contributing to the striking performance of large language models (LLMs) is the vast amount of factual knowledge accumulated during pre-training. Yet, many LLMs suffer from self-inconsistency,... 详细信息
来源: 评论