检索结果-内蒙古大学图书馆

arXiv 2025年

作者： Gimeno-Gómez, David Martínez-Hinarejos, Carlos D. Pattern Recognition and Human Language Technologies Research Center Universitat Politècnica de València Camino de Vera s/n València Comunitat Valenciana46022 Spain

Visual speech recognition remains an open research problem where different challenges must be considered by dispensing with the auditory sense, such as visual ambiguities, the inter-personal variability among speakers, and the complex modeling of silence. Nonetheless, recent remarkable results have been achieved in the field thanks to the availability of large-scale databases and the use of powerful attention mechanisms. Besides, multiple languages apart from English are nowadays a focus of interest. This paper presents noticeable advances in automatic continuous lipreading for Spanish. First, an end-to-end system based on the hybrid CTC/Attention architecture is presented. Experiments are conducted on two corpora of disparate nature, reaching state-of-the-art results that significantly improve the best performance obtained to date for both databases. In addition, a thorough ablation study is carried out, where it is studied how the different components that form the architecture influence the quality of speech recognition. Then, a rigorous error analysis is carried out to investigate the different factors that could affect the learning of the automatic system. Finally, a new Spanish lipreading benchmark is consolidated. Code and trained models are available at https://***/david-gimeno/evaluating-end2end-spanish-lipreading. © 2025, CC BY-NC-ND.

关键词： Continuous speech recognition

来源：评论

学校读者我要写书评

暂无评论

A Deep Learning Approach to Machine Transliteration 4

A Deep Learning Approach to Machine Transliteration

引用

4th Workshop on Statistical Machine Translation, WMT 2009, immediately preceding the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009

作者： Deselaers, Thomas Hasan, Sǎsa Bender, Oliver Ney, Hermann Human Language Technology And Pattern Recognition Group RWTH Aachen University Germany

In this paper we present a novel transliteration technique which is based on deep belief networks. Common approaches use finite state machines or other methods similar to conventional machine translation. Instead of using conventional NLP techniques, the approach presented here builds on deep belief networks, a technique which was shown to work well for other machine learning problems. We show that deep belief networks have certain properties which are very interesting for transliteration and possibly also for translation and that a combination with conventional techniques leads to an improvement over both components on an Arabic-English transliteration task. ©2009 Association for Computational Linguistics.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Comparison of extended lexicon models in search and rescoring for SMT

Comparison of extended lexicon models in search and rescorin...

引用

2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics: human language technologies, NAACL-HLT 2009

作者： Hasan, Saša Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Germany

ISBN: (纸本)9781932432428

We show how the integration of an extended lexicon model into the decoder can improve translation performance. The model is based on lexical triggers that capture long-distance dependencies on the sentence level. The results are compared to variants of the model that are applied in reranking of n-best lists. We present how a combined application of these models in search and rescoring gives promising results. Experiments are reported on the GALE Chinese-English task with improvements of up to +0.9% BLEU and -1.5% TER absolute on a competitive baseline. © 2009 Association for Computational Linguistics

关键词：

来源：评论

学校读者我要写书评

暂无评论

A source-side decoding sequence model for statistical machine translation

A source-side decoding sequence model for statistical machin...

引用

9th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2010

作者： Feng, Minwei Mauser, Arne Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Germany

We propose a source-side decoding sequence language model for phrase-based statistical machine translation. This model is a reordering model in the sense that it helps the decoder find the correct decoding sequence. The model uses word-aligned bilingual training data. We show improved translation quality of up to 1.34% BLEU and 0.54% TER using this model compared to three other widely used reordering models.

关键词： Computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

Unsupervised training for large vocabulary translation using sparse lexicon andword classes 15

Unsupervised training for large vocabulary translation using...

引用

15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017

作者： Kim, Yunsu Schamper, Julian Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Germany

ISBN: (纸本)9781510838604

We address for the first time unsupervised training for a translation task with hundreds of thousands of vocabulary words. We scale up the expectation-maximization (EM) algorithm to learn a large translation table without any parallel text or seed lexicon. First, we solve the memory bottleneck and enforce the sparsity with a simple thresholding scheme for the lexicon. Second, we initialize the lexicon training with word classes, which efficiently boosts the performance. Our methods produced promising results on two large-scale unsupervised translation tasks. © 2017 Association for Computational Linguistics.

关键词： Maximum principle

来源：评论

学校读者我要写书评

暂无评论

Deciphering foreign language by combining language models and context vectors

Deciphering foreign language by combining language models an...

引用

50th Annual Meeting of the Association for Computational Linguistics, ACL 2012

作者： Nuhn, Malte Mauser, Arne Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Germany

ISBN: (纸本)9781937284244

In this paper we show how to train statistical machine translation systems on reallife tasks using only non-parallel monolingual data from two languages. We present a modification of the method shown in (Ravi and Knight, 2011) that is scalable to vocabulary sizes of several thousand words. On the task shown in (Ravi and Knight, 2011) we obtain better results with only 5% of the computational effort when running our method with an n-gram language model. The efficiency improvement of our method allows us to run experiments with vocabulary sizes of around 5,000 words, such as a non-parallel version of the VERBMOBIL corpus. We also report results using data from the monolingual French and English GIGAWORD corpora. © 2012 Association for computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

A comparison of update strategies for large-scale maximum expected BLEU training

A comparison of update strategies for large-scale maximum ex...

引用

Conference of the North American Chapter of the Association for Computational Linguistics: human language technologies, NAACL HLT 2015

作者： Wuebker, Joern Muehr, Sebastian Lehnen, Patrick Peitz, Stephan Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

ISBN: (纸本)9781941643495

This work presents a flexible and efficient discriminative training approach for statistical machine translation. We propose to use the RPROP algorithm for optimizing a maximum expected BLEU objective and experimentally compare it to several other updating schemes. It proves to be more efficient and effective than the previously proposed growth transformation technique and also yields better results than stochastic gradient descent and AdaGrad. We also report strong empirical results on two large scale tasks, namely BOLT Chinese→English and WMT German→English, where our final systems outperform results reported by Setiawan and Zhou (2013) and on ***. On the WMT task, discriminative training is performed on the full training data of 4M sentence pairs, which is unsurpassed in the literature. © 2015 Association for Computational Linguistics.

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

When and Why is Unsupervised Neural Machine Translation Useless? 22

When and Why is Unsupervised Neural Machine Translation Usel...

引用

22nd Annual Conference of the European Association for Machine Translation, EAMT 2020

作者： Kim, Yunsu Graça, Miguel Ney, Hermann Human Language Technology and Pattern Recognition Group Rwth Aachen University Aachen Germany

ISBN: (纸本)9789893305898

This paper studies the practicality of the current state-of-the-art unsupervised methods in neural machine translation (NMT). In ten translation tasks with various data settings, we analyze the conditions under which the unsupervised methods fail to produce reasonable translations. We show that their performance is severely affected by linguistic dissimilarity and domain mismatch between source and target monolingual data. Such conditions are common for low-resource language pairs, where unsupervised learning works poorly. In all of our experiments, supervised and semi-supervised baselines with 50k-sentence bilingual data outperform the best unsupervised results. Our analyses pinpoint the limits of the current unsupervised NMT and also suggest immediate research directions. © 2020 The authors.

关键词： Neural machine translation

来源：评论

学校读者我要写书评

暂无评论

Phrase Model Training for Statistical Machine Translation with Word Lattices of Preprocessing Alternatives 7

Phrase Model Training for Statistical Machine Translation wi...

引用

7th Workshop on Statistical Machine Translation, WMT 2012, immediately following the Conference of the North-American Chapter of the Association for Computational Linguistics - human language technologies, NAACL HLT 2012

作者： Wuebker, Joern Ney, Hermann Human Language Technology And Pattern Recognition Group Rwth Aachen University Aachen Germany

ISBN: (纸本)9781937284206

In statistical machine translation, word lattices are used to represent the ambiguities in the preprocessing of the source sentence, such as word segmentation for Chinese or morphological analysis for German. Several approaches have been proposed to define the probability of different paths through the lattice with external tools like word segmenters, or by applying indicator features. We introduce a novel lattice design, which explicitly distinguishes between different preprocessing alternatives for the source sentence. It allows us to make use of specific features for each preprocessing type and to lexicalize the choice of lattice path directly in the phrase translation model. We argue that forced alignment training can be used to learn lattice path and phrase translation model simultaneously. On the newscommentary portion of the German!English WMT 2011 task we can show moderate improvements of up to 0.6% BLEU over a stateof- the-art baseline system. ©2012 Association for Computational Linguistics.

关键词： Computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

A Combination of Hierarchical Systems with Forced Alignments from Phrase-Based Systems 7

A Combination of Hierarchical Systems with Forced Alignments...

引用

7th International Workshop on Spoken language Translation, IWSLT 2010

作者： Heger, Carmen Wuebker, Joern Vilar, David Ney, Hermann Human Language Technology and Pattern Recognition Group RWTH Aachen University Aachen Germany

Currently most state-of-the-art statistical machine translation systems present a mismatch between training and generation conditions. Word alignments are computed using the well known IBM models for single-word based translation. After-wards phrases are extracted using extraction heuristics, unrelated to the stochastic models applied for finding the word alignment. In the last years, several research groups have tried to overcome this mismatch, but only with limited success. Recently, the technique of forced alignments has shown to improve translation quality for a phrase-based system, applying a more statistically sound approach to phrase extraction. In this work we investigate the first steps to combine forced alignment with a hierarchical model. Experimental results on IWSLT and WMT data show improvements in translation quality of up to 0.7% BLEU and 1.0% TER. © IWSLT 2010. All rights reserved.

关键词： Alignment

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：