咨询与建议

限定检索结果

文献类型

  • 530 篇 会议
  • 298 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 831 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 521 篇 工学
    • 390 篇 计算机科学与技术...
    • 338 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 292 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 63 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 122 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 17 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 15 篇 法学
    • 13 篇 社会学
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 11 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 74 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 natural language...
  • 18 篇 data models
  • 17 篇 neural networks
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 71 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 26 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 22 篇 zhen-hua ling
  • 19 篇 yang ai
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 699 篇 英文
  • 127 篇 其他
  • 12 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
831 条 记 录,以下是241-250 订阅
排序:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge...
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Kangxiang Xia Dake Guo Jixun Yao Liumeng Xue Hanzhao Li Shuai Wang Zhao Guo Lei Xie Qingqing Zhang Lei Luo Minghui Dong Peng Sun Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi'an School of Data Science The Chinese University of Hong Kong Shenzhen Shenzhen Research Institute of Big Data (SRIBD) Magic data Insitutue for Infocomm Research (I2R) China Computer Federation
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge aims to benchmark and advance zero-shot spontaneous style voice cloning, particularly focusing on generating spontaneous behaviors in conversational speech.... 详细信息
来源: 评论
Capturing Global Structural Information in Long Document Question Answering with Compressive Graph Selector Network
Capturing Global Structural Information in Long Document Que...
收藏 引用
2022 Conference on Empirical Methods in Natural language processing, EMNLP 2022
作者: Nie, Yuxiang Huang, Heyan Wei, Wei Mao, Xian-Ling School of Computer Science and Technology Beijing Institute of Technology China Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications China Beijing Institute of Technology Southeast Academy of Information Technology China Huazhong University of Science and Technology China
Long document question answering is a challenging task due to its demands for complex reasoning over long text. Previous works usually take long documents as non-structured flat texts or only consider the local struct... 详细信息
来源: 评论
USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER
arXiv
收藏 引用
arXiv 2023年
作者: Ma, Jun-Yu Gu, Jia-Chen Qi, Jiajun Ling, Zhen-Hua Liu, Quan Zhao, Xiaoyi National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China State Key Laboratory of Cognitive Intelligence iFLYTEK Research China Communication University of China China
This paper describes the system developed by the USTC-NELSLIP team for SemEval-2023 Task 2 Multilingual Complex Named Entity Recognition (MultiCoNER II). A method named Statistical Construction and Dual Adaptation of ... 详细信息
来源: 评论
Alternating Objectives Generates Stronger PGD-Based Adversarial Attacks
arXiv
收藏 引用
arXiv 2022年
作者: Nikolaos, Antoniou Georgiou, Efthymios Potamianos, Alexandros School of Electrical and Computer Engineering National Technical University of Athens Athens Greece Institute for Language and Speech Processing Athena Research Center Athens Greece
Designing powerful adversarial attacks is of paramount importance for the evaluation of p-bounded adversarial defenses. Projected Gradient Descent (PGD) is one of the most effective and conceptually simple algorithms ... 详细信息
来源: 评论
End-to-End Voice Conversion with Information Perturbation
End-to-End Voice Conversion with Information Perturbation
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Qicong Xie Shan Yang Yi Lei Lei Xie Dan Su Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Tencent AI Lab China
The ideal goal of voice conversion is to convert the source speaker’s speech to sound naturally like the target speaker while maintaining the linguistic content and the prosody of the source speech. However, current ... 详细信息
来源: 评论
IQDUBBING: PROSODY MODELING BASED ON DISCRETE SELF-SUPERVISED speech REPRESENTATION FOR EXPRESSIVE VOICE CONVERSION
arXiv
收藏 引用
arXiv 2022年
作者: Gan, Wendong Wen, Bolong Yan, Ying Chen, Haitao Wang, Zhichao Du, Hongqiang Xie, Lei Guo, Kaixuan Li, Hai IQIYI Inc Chengdu China Audio Speech and Language Processing Group ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi'An China
Prosody modeling is important, but still challenging in expressive voice conversion. As prosody is difficult to model, and other factors, e.g., speaker, environment and content, which are entangled with prosody in spe... 详细信息
来源: 评论
Multimodal Tree Decoder for Table of Contents Extraction in Document Images
arXiv
收藏 引用
arXiv 2022年
作者: Hu, Pengfei Zhang, Zhenrong Zhang, Jianshu Du, Jun Wu, Jiajia National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Anhui Hefei China iFLYTEK Research China
Table of contents (ToC) extraction aims to extract headings of different levels in documents to better understand the outline of the contents, which can be widely used for document understanding and information retrie... 详细信息
来源: 评论
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Chenxu Jian, Ping Yang, Zhen School of Computer Science and Technology Beijing Institute of Technology Beijing China Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications Beijing Institute of Technology Beijing China
Logical reading comprehension is a challenging task that involves understanding the underlying semantics of text and applying reasoning to deduce the correct answer. Prior researches have primarily focused on enhancin... 详细信息
来源: 评论
Anchored Monotonic Alignment and Representation Substitution for Rare Spontaneous Behaviors in Spontaneous speech Synthesis
Anchored Monotonic Alignment and Representation Substitution...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Ning-Qian Wu Ya-Jun Hu Liping Chen Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R.China iFLYTEK Research iFLYTEK Co. Ltd. China MoE Key Laboratory of Brain-Inspired Intelligent Perception and Cognition University of Science and Technology of China
Spontaneous behaviors in speech pose significant challenges for speech synthesis. Existing research has not adequately addressed these behaviors, with most studies relying on specially recorded datasets. In contrast, ... 详细信息
来源: 评论
KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis
arXiv
收藏 引用
arXiv 2025年
作者: Zhan, Weidong Wang, Yue Hu, Nan Xiao, Liming Ma, Jingyuan Qin, Yuhang Li, Zheng Yang, Yixin Deng, Sirui Ding, Jinkun Ma, Wenhan Li, Rui Luo, Weilin Liu, Qun Sui, Zhifang Center for Chinese Linguistics Department of Chinese Language and Literature Peking University China School of Computer Science State Key Laboratory of Multimedia Information Processing Peking University China Huawei Noah’s Ark Lab China
Current evaluations of commonsense reasoning in LLMs are hindered by the scarcity of natural language corpora with structured annotations for reasoning tasks. To address this, we introduce KnowLogic, a benchmark gener... 详细信息
来源: 评论