检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

文献类型

260 篇 会议
94 篇 期刊文献

馆藏范围

354 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

238 篇 工学
- 174 篇 计算机科学与技术...
- 160 篇 软件工程
- 63 篇 信息与通信工程
- 22 篇 机械工程
- 18 篇 控制科学与工程
- 12 篇 电子科学与技术（可...
- 11 篇 电气工程
- 9 篇 化学工程与技术
- 8 篇 光学工程
- 8 篇 生物工程
- 6 篇 生物医学工程（可授...
- 3 篇 仪器科学与技术
- 2 篇 动力工程及工程热...
- 1 篇 材料科学与工程（可...
- 1 篇 农业工程
151 篇 理学
- 107 篇 物理学
- 60 篇 数学
- 32 篇 统计学（可授理学、...
- 19 篇 系统科学
- 10 篇 化学
- 10 篇 生物学
- 1 篇 地球物理学
27 篇 管理学
- 22 篇 图书情报与档案管...
- 4 篇 管理科学与工程(可...
- 3 篇 工商管理
2 篇 法学
- 2 篇 社会学
2 篇 文学
- 2 篇 外国语言文学
- 1 篇 中国语言文学
1 篇 经济学
- 1 篇 应用经济学
1 篇 农学
- 1 篇 作物学
1 篇 医学
1 篇 艺术学

主题

68 篇 speech recogniti...
41 篇 training
38 篇 hidden markov mo...
22 篇 neural machine t...
20 篇 machine translat...
19 篇 decoding
18 篇 computer aided l...
18 篇 handwriting reco...
15 篇 feature extracti...
15 篇 transducers
15 篇 recurrent neural...
14 篇 vocabulary
13 篇 error analysis
12 篇 databases
10 篇 modeling languag...
10 篇 speech
10 篇 humans
9 篇 training data
9 篇 signal processin...
9 篇 optimization

机构

52 篇 human language t...
40 篇 apptek gmbh aach...
40 篇 human language t...
32 篇 human language t...
31 篇 human language t...
26 篇 apptek gmbh aach...
21 篇 human language t...
16 篇 human language t...
14 篇 apptek gmbh
13 篇 human language t...
10 篇 human language t...
10 篇 machine learning...
9 篇 spoken language ...
9 篇 human language t...
8 篇 computer science...
8 篇 human language t...
6 篇 human language t...
6 篇 human language t...
6 篇 human language t...
5 篇 a2ia sa

作者

220 篇 ney hermann
85 篇 hermann ney
70 篇 schlüter ralf
26 篇 ralf schlüter
21 篇 ralf schluter
20 篇 wuebker joern
19 篇 zhou wei
18 篇 gao yingbo
18 篇 zeyer albert
14 篇 kim yunsu
14 篇 herold christian
14 篇 thulke david
13 篇 mansour saab
13 篇 zeineldeen moham...
13 篇 patrick doetsch
13 篇 peitz stephan
13 篇 huck matthias
12 篇 peter jan-thorst...
12 篇 yang zijian
12 篇 michel wilfried

语言

349 篇 英文
4 篇 其他
1 篇 中文

检索条件"机构=Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen"

共 354 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Document-Level language Models for Machine Translation 8

Document-Level Language Models for Machine Translation

引用

8th Conference on Machine Translation, WMT 2023

作者： Petrick, Frithjof Herold, Christian Petrushkov, Pavel Khadivi, Shahram Ney, Hermann eBay Inc. Aachen Germany Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (纸本)9798891760417

Despite the known limitations, most machine translation systems today still operate on the sentence-level. One reason for this is, that most parallel training data is only sentence-level aligned, without document-level meta information available. In this work, we set out to build context-aware translation systems utilizing document-level monolingual data instead. This can be achieved by combining any existing sentence-level translation model with a document-level language model. We improve existing approaches by leveraging recent advancements in model combination. Additionally, we propose novel weighting techniques that make the system combination more flexible and significantly reduce computational overhead. In a comprehensive evaluation on four diverse translation tasks, we show that our extensions improve document-targeted scores substantially and are also computationally more efficient. However, we also find that in most scenarios, back-translation gives even better results, at the cost of having to re-train the translation system. Finally, we explore language model fusion in the light of recent advancements in large language models. Our findings suggest that there might be strong potential in utilizing large language models via model combination. © 2023 Association for Computational Linguistics.

关键词： Machine translation

来源：评论

学校读者我要写书评

暂无评论

Comparison of Different Neural Network Architectures for Spoken language Identification 15

Comparison of Different Neural Network Architectures for Spo...

引用

15th ITG Conference on Speech Communication

作者： Bazazo, Tala Zeineldeen, Mohammad Plahl, Christian Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition RWTH Aachen University Germany eBay Aachen Germany

ISBN: (纸本)9783800761654

This paper compares different neural network based architectures on the spoken language identification task. To our best knowledge such a comparison of different models on the same dataset and the same set of languages does not yet exist. We incorporate 7 different models which include the latest architectures: a spectral images based Resnet model, a Convolutional Neural Network, a Bi-directional Long Short-Term Memory, a Convolutional Recurrent Neural Network, Wav2Vec 2.0, a transformer and a conformer. We also tackle audio with background noise and music by training on data with similar accoustics. We finally also show that our models generalize well on third-party data. © VDE VERLAG GMBH Berlin Offenbach.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

Improving Long Context Document-Level Machine Translation 4

Improving Long Context Document-Level Machine Translation

引用

4th Workshop on Computational Approaches to Discourse, CODI 2023

作者： Herold, Christian Ney, Hermann Human Language Technology and Pattern Recognition Group Computer Science Department RWTH Aachen University AachenD-52056 Germany

ISBN: (纸本)9781959429890

Document-level context for neural machine translation (NMT) is crucial to improve the translation consistency and cohesion, the translation of ambiguous inputs, as well as several other linguistic phenomena. Many works have been published on the topic of document-level NMT, but most restrict the system to only local context, typically including just the one or two preceding sentences as additional information. This might be enough to resolve some ambiguous inputs, but it is probably not sufficient to capture some document-level information like the topic or style of a conversation. When increasing the context size beyond just the local context, there are two challenges: (i) the memory usage increases exponentially (ii) the translation performance starts to degrade. We argue that the widely-used attention mechanism is responsible for both issues. Therefore, we propose a constrained attention variant that focuses the attention on the most relevant parts of the sequence, while simultaneously reducing the memory consumption. For evaluation, we utilize targeted test sets in combination with novel evaluation techniques to analyze the translations in regards to specific discourse-related phenomena. We find that our approach is a good compromise between sentence-level NMT vs attending to the full context, especially in low resource scenarios. © 2023 Association for Computational Linguistics.

关键词： Neural machine translation

来源：评论

学校读者我要写书评

暂无评论

Enhancing and Adversarial: Improve ASR with Speaker Labels 48

Enhancing and Adversarial: Improve ASR with Speaker Labels

引用

48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023

作者： Zhou, Wei Wu, Haotian Xu, Jingjing Zeineldeen, Mohammad Luscher, Christoph Schluter, Ralf Ney, Hermann Rwth Aachen University Human Language Technology and Pattern Recognition Computer Science Department Aachen52074 Germany AppTek GmbH Aachen52062 Germany

ISBN: (纸本)9781728163277

ASR can be improved by multi-task learning (MTL) with domain enhancing or domain adversarial training, which are two opposite objectives with the aim to increase/decrease domain variance towards domain-aware/agnostic ASR, respectively. In this work, we study how to best apply these two opposite objectives with speaker labels to improve conformer-based ASR. We also propose a novel adaptive gradient reversal layer for stable and effective adversarial training without tuning effort. Detailed analysis and experimental verification are conducted to show the optimal positions in the ASR neural network (NN) to apply speaker enhancing and adversarial training. We also explore their combination for further improvement, achieving the same performance as i-vectors plus adversarial training. Our best speaker-based MTL achieves 7% relative improvement on the Switchboard Hub5'00 set. We also investigate the effect of such speaker-based MTL w.r.t. cleaner dataset and weaker ASR NN. © 2023 IEEE.

关键词： Linearization

来源：评论

学校读者我要写书评

暂无评论

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers 48

Lattice-Free Sequence Discriminative Training for Phoneme-Ba...

引用

48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023

作者： Yang, Zijian Zhou, Wei Schluter, Ralf Ney, Hermann Rwth Aachen University Human Language Technology and Pattern Recognition Computer Science Department Aachen52074 Germany AppTek GmbH Aachen52062 Germany

ISBN: (纸本)9781728163277

Recently, RNN-Transducers have achieved remarkable results on various automatic speech recognition tasks. However, lattice-free sequence discriminative training methods, which obtain superior performance in hybrid models, are rarely investigated in RNN-Transducers. In this work, we propose three lattice-free training objectives, namely lattice-free maximum mutual information, lattice-free segment-level minimum Bayes risk, and lattice-free minimum Bayes risk, which are used for the final posterior output of the phoneme-based neural transducer with a limited context dependency. Compared to criteria using N-best lists, lattice-free methods eliminate the decoding step for hypotheses generation during training, which leads to more efficient training. Experimental results show that lattice-free methods gain up to 6.5% relative improvement in word error rate compared to a sequence-level cross-entropy trained model. Compared to the N-best-list based minimum Bayes risk objectives, lattice-free methods gain 40% - 70% relative training time speedup with a small degradation in performance. © 2023 IEEE.

关键词： neural transducer sequence discriminative training Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization 2

Prompting and Fine-Tuning of Small LLMs for Length-Controlla...

引用

2nd International Conference on Foundation and Large language Models, FLLM 2024

作者： Thulke, David Gao, Yingbo Jalota, Rricha Dugast, Christian Ney, Hermann AppTek GmbH Aachen Germany RWTH Aachen University Machine Learning and Human Language Technology Group Germany

ISBN: (纸本)9798350354799

This paper explores the rapid development of a telephone call summarization system utilizing large language models (LLMs). Our approach involves initial experiments with prompting existing LLMs to generate summaries of telephone conversations, followed by the creation of a tailored synthetic training dataset utilizing stronger frontier models. We place special focus on the diversity of the generated data and on the ability to control the length of the generated summaries to meet various use-case specific requirements. The effectiveness of our method is evaluated using two state-of-the-art LLM-as-a-judge-based evaluation techniques to ensure the quality and relevance of the summaries. Our results show that fine-tuned Llama-2-7B-based summarization model performs on-par with GPT-4 in terms of factual accuracy, completeness and conciseness. Our findings demonstrate the potential for quickly bootstrapping a practical and efficient call summarization system. © 2024 IEEE.

关键词： Modeling languages

来源：评论

学校读者我要写书评

暂无评论

Robust Knowledge Distillation from RNN-T Models with Noisy Training Labels Using Full-Sum Loss 48

Robust Knowledge Distillation from RNN-T Models with Noisy T...

引用

48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023

作者： Zeineldeen, Mohammad Audhkhasi, Kartik Baskar, Murali Karthick Ramabhadran, Bhuvana Rwth Aachen University Human Language Technology and Pattern Recognition Computer Science Department Aachen52074 Germany Google Llc New York United States

ISBN: (纸本)9781728163277

This work studies knowledge distillation (KD) and addresses its constraints for recurrent neural network transducer (RNN-T) models. In hard distillation, a teacher model transcribes large amounts of unlabelled speech to train a student model. Soft distillation is another popular KD method that distills the output logits of the teacher model. Due to the nature of RNN-T alignments, applying soft distillation between RNNT architectures having different posterior distributions is challenging. In addition, bad teachers having high word-error-rate (WER) reduce the efficacy of KD. We investigate how to effectively distill knowledge from variable quality ASR teachers, which has not been studied before to the best of our knowledge. We show that a sequence-level KD, full-sum distillation, outperforms other distillation methods for RNN-T models, especially for bad teachers. We also propose a variant of full-sum distillation that distills the sequence discriminative knowledge of the teacher leading to further improvement in WER. We conduct experiments on public datasets namely SpeechStew and LibriSpeech, and on in-house production data. © 2023 IEEE.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

Revisiting Checkpoint Averaging for Neural Machine Translation 2

Revisiting Checkpoint Averaging for Neural Machine Translati...

引用

2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural language Processing, AACL-IJCNLP 2022

作者： Gao, Yingbo Herold, Christian Yang, Zijian Ney, Hermann Human Language Technology and Pattern Recognition Group Computer Science Department Rwth Aachen University AachenD-52056 Germany

ISBN: (纸本)9781959429043

Checkpoint averaging is a simple and effectivemethod to boost the performance of convergedneural machine translation models. The calculation is cheap to perform and the fact thatthe translation improvement almost comes forfree, makes it widely adopted in neural machine translation research. Despite the popularity, the method itself simply takes the mean ofthe model parameters from several checkpoints,the selection of which is mostly based on empirical recipes without many justifications. In thiswork, we revisit the concept of checkpoint averaging and consider several extensions. Specifically, we experiment with ideas such as usingdifferent checkpoint selection strategies, calculating weighted average instead of simplemean, making use of gradient information andfine-tuning the interpolation weights on development data. Our results confirm the necessityof applying checkpoint averaging for optimalperformance, but also suggest that the landscape between the converged checkpoints israther flat and not much further improvementcompared to simple averaging is to be obtained. © AACL-IJCNLP *** rights reserved

关键词： Neural machine translation

来源：评论

学校读者我要写书评

暂无评论

Does Joint Training Really Help Cascaded Speech Translation?

Does Joint Training Really Help Cascaded Speech Translation?

引用

2022 Conference on Empirical Methods in Natural language Processing, EMNLP 2022

作者： Tran, Viet Anh Khoa Thulke, David Gao, Yingbo Herold, Christian Ney, Hermann Human Language Technology and Pattern Recognition Group Computer Science Department RWTH Aachen University AachenD-52056 Germany

Currently, in speech translation, the straightforward approach - cascading a recognition system with a translation system - delivers state-of-the-art results. However, fundamental challenges such as error propagation from the automatic speech recognition system still remain. To mitigate these problems, recently, people turn their attention to direct data and propose various joint training methods. In this work, we seek to answer the question of whether joint training really helps cascaded speech translation. We review recent papers on the topic and also investigate a joint training criterion by marginalizing the transcription posterior probabilities. Our findings show that a strong cascaded baseline can diminish any improvements obtained using joint training, and we suggest alternatives to joint training. We hope this work can serve as a refresher of the current speech translation landscape, and motivate research in finding more efficient and creative ways to utilize the direct data for speech translation. © 2022 Association for Computational Linguistics.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Right Label Context in End-to-End Training of Time-Synchronous ASR Models

Right Label Context in End-to-End Training of Time-Synchrono...

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Raissi, Tina Schlüter, Ralf Ney, Hermann Machine Learning and Human Language Technology Group RWTH Aachen University Germany AppTek GmbH Germany

ISBN: (纸本)9798350368741

Current time-synchronous sequence-to-sequence automatic speech recognition (ASR) models are trained by using sequence level cross-entropy that sums over all alignments. Due to the discriminative formulation, incorporating the right label context into the training criterion's gradient causes normalization problems and is not mathematically well-defined. The classic hybrid neural network hidden Markov model (NN-HMM) with its inherent generative formulation enables conditioning on the right label context. However, due to the HMM state-tying the identity of the right label context is never modeled explicitly. In this work, we propose a factored loss with auxiliary left and right label contexts that sums over all alignments. We show that the inclusion of the right label context is particularly beneficial when training data resources are limited. Moreover, we also show that it is possible to build a factored hybrid HMM system by relying exclusively on the full-sum criterion. Experiments were conducted on Switchboard 300h and LibriSpeech 960h. © 2025 IEEE.

关键词： CTC end-to-end factored hybrid HMM full-sum HMM

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共36页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：