检索结果-内蒙古大学图书馆

7th International Workshop on Spoken language Translation, IWSLT 2010

作者： Mansour, Saab Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

In this paper, we investigate different methodologies of Arabic segmentation for statistical machine translation by comparing a rule-based segmenter to different statistically-based segmenters. We also present a new method for segmentation that serves the need for a real-time translation system without impairing the translation accuracy. © IWSLT 2010. All rights reserved.

关键词： computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

The RWTH Aachen Machine Translation system for IWSLT 2010 7

The RWTH Aachen Machine Translation system for IWSLT 2010

引用

7th International Workshop on Spoken language Translation, IWSLT 2010

作者： Mansour, Saab Peitz, Stephan Vilar, David Wuebker, Joern Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

In this paper we describe the statistical machine translation system of the RWTH Aachen University developed for the translation task of the IWSLT 2010. This year, we participated in the BTEC translation task for the Arabic to English language direction. We experimented with two state-of-the-art decoders: phrase-based and hierarchical-based decoders. Extensions to the decoders included phrase training (as opposed to heuristic phrase extraction) for the phrase-based decoder, and soft syntactic features for the hierarchical decoder. Additionally, we experimented with various rule-based and statistical-based segmenters for Arabic. Due to the different decoders and the different methodologies that we apply for segmentation, we expect that there will be complimentary variation in the results achieved by each system. The next step would be to exploit these variations and achieve better results by combining the systems. We try different strategies for system combination and report significant improvements over the best single system. © IWSLT 2010. All rights reserved.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Direct construction of compact context-dependency transducers from data

Direct construction of compact context-dependency transducer...

引用

作者： Rybach, David Riley, Michael Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Germany Google Inc. 76 Ninth Avenue New York NY United States

This paper describes a new method for building compact context-dependency transducers for finite-state transducer-based ASR decoders. Instead of the conventional phonetic decision-tree growing followed by FST compilation, this approach incorporates the phonetic context splitting directly into the transducer construction. The objective function of the split optimization is augmented with a regularization term that measures the number of transducer states introduced by a split. We give results on a large spoken-query task for various n-phone orders and other phonetic features that show this method can greatly reduce the size of the resulting context-dependency transducer with no significant impact on recognition accuracy. This permits using context sizes and features that might otherwise be unmanageable. © 2010 ISCA.

关键词： Transducers

来源：评论

学校读者我要写书评

暂无评论

DISCRIMINATIVE HMMS, LOG-LINEAR MODELS, AND CRFS: WHAT IS THE DIFFERENCE?

DISCRIMINATIVE HMMS, LOG-LINEAR MODELS, AND CRFS: WHAT IS TH...

引用

IEEE International Conference on Acoustics, Speech, and Signal Processing

作者： G. Heigold S. Wiesler M. Nussbaum-Thom P. Lehnen R. Schluter H. Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen Germany

ISBN: (纸本)9781424442959

Recently, there have been many papers studying discriminative acoustic modeling techniques like conditional random fields or discriminative training of conventional Gaussian HMMs. This paper will give an overview of the recent work and progress. We will strictly distinguish between the type of acoustic models on the one hand and the training criterion on the other hand. We will address two issues in more detail: the relation between conventional Gaussian HMMs and conditional random fields and the advantages of formulating the training criterion as a convex optimization problem. Experimental results for various speech tasks will be presented to carefully evaluate the different concepts and approaches, including both a digit string and large vocabulary continuous speech recognition tasks.

关键词： speech recognition hidden Markov model discriminative training log-linear model conditional random field

来源：评论

学校读者我要写书评

暂无评论

A comparative large scale study of MLP features for mandarin ASR

A comparative large scale study of MLP features for mandarin...

引用

作者： Valente, Fabio Doss, Mathew Magimai Plahl, Christian Ravuri, Suman Wang, Wen IDIAP Research Institute CH-1920 Martigny Switzerland Human Language Technology and Pattern Recognition RWTH Aachen University Germany International Computer Science Institute 1947 Center Street Berkeley CA 94704 United States Speech Technology and Research Laboratory SRI International Menlo Park CA United States

MLP based front-ends have shown significant complementary properties to conventional spectral features. As part of the DARPA GALE program, different MLP features were developed for Mandarin ASR. In this paper, all the proposed frontends are compared in systematic manner and we extensively investigate the scalability of these features in terms of the amount of training data (from 100 hours to 1600 hours) and system complexity (maximum likelihood training, SAT, lattice level combination, and discriminative training). Results on 5 hours of evaluation data from the GALE project reveal that the MLP features consistently produce relative improvements in the range of 15% - 23% at the different steps of a multipass system when compared to the conventional short-term spectral based features like MFCC and PLP. The largest improvement is obtained using a hierarchical MLP approach. © 2010 ISCA.

关键词： Maximum likelihood

来源：评论

学校读者我要写书评

暂无评论

Extending statistical machine translation with discriminative and trigger-based lexicon models

Extending statistical machine translation with discriminativ...

引用

2009 Conference on Empirical Methods in Natural language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009

作者： Mauser, Arne Hasan, SǍa Ney, Hermann Human Language Technology and Pattern Recognition Group Department of Computer Science 6 RWTH Aachen University Germany

In this work, we propose two extensions of standard word lexicons in statistical machine translation: A discriminative word lexicon that uses sentence-level source information to predict the target words and a trigger-based lexicon model that extends IBM model 1 with a second trigger, allowing for a more fine-grained lexical choice of target words. The models capture dependencies that go beyond the scope of conventional SMT models such as phraseand language models. We show that the models improve translation quality by 1% in BLEU over a competitive baseline on a large-scale task. © 2009 ACL and AFNLP.

关键词： computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

Are unaligned words important for machine translation?

Are unaligned words important for machine translation?

引用

13th Annual Conference of the European Association for Machine Translation, EAMT 2009

作者： Zhang, Yuqi Matusov, Evgeny Ney, Hermann Human Language Technology and Pattern Recognition Lehrstuhl für Informatik 6 - Computer Science Department RWTH Aachen University D-52056 Aachen Germany

In this paper, we deal with the problem of a large number of unaligned words in automatically learned word alignments for machine translation (MT). These unaligned words are the reason for ambiguous phrase pairs extracted by a statistical phrase-based MT system. In translation, this phrase ambiguity causes deletion and insertion errors. We present hard and optional deletion approaches to remove the unaligned words in the source language sentences. Improvements in translation quality are achieved both on large and small vocabulary tasks with the presented methods. © 2009 European Association for Machine Translation.

关键词： Machine translation

来源：评论

学校读者我要写书评

暂无评论

Audio segmentation for speech recognition using segment features

Audio segmentation for speech recognition using segment feat...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： David Rybach Christian Gollan Ralf Schluter Hermann Ney Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Germany

Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a novel framework which combines the advantages of different well known segmentation methods. An automatically estimated log-linear segment model is used to determine the segmentation of an audio stream in a holistic way by a maximum a posteriori decoding strategy, instead of classifying change points locally. A comparison to other segmentation techniques in terms of speech recognition performance is presented, showing a promising segmentation quality of our approach.

关键词： Speech recognition Streaming media Decoding Broadcasting Loudspeakers Automatic speech recognition Bayesian methods humans Natural languages pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Spoken language processing techniques for sign language recognition and translation

引用

technology and Disability 2008年第2期20卷 121-133页

作者： Dreuw, Philippe Stein, Daniel Deselaers, Thomas Rybach, David Zahedi, Morteza Bungeroth, Jan Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department 6 RWTH Aachen University Aachen Germany

We present an approach to automatically recognize sign language and translate it into a spoken language. A system to address these tasks is created based on state-of-the-art techniques from statistical machine translation, speech recognition, and image processing research. Such a system is necessary for communication between deaf and hearing people. The communication is otherwise nearly impossible due to missing sign language skills on the hearing side, and the low reading and writing skills on the deaf side. As opposed to most current approaches, which focus on the recognition of isolated signs only, we present a system that recognizes complete sentences in sign language. Similar to speech recognition, we have to deal with temporal sequences. Instead of the acoustic signal in speech recognition, we process a video signal as input. Therefore, we use a speech recognition system to obtain a textual representation of the signed sentences. This intermediate representation is then fed into a statistical machine translation system to create a translation into a spoken language. To achieve good results, some particularities of sign languages are considered in both systems. We use a publicly available corpus to show the performance of the proposed system and report very promising results. © 2008 IOS Press. All rights reserved.

关键词： computer input-output equipment SIGN language SYMBOLIC communication MEANS of communication for deaf people SPEECH perception AUTOMATIC speech recognition

来源：评论

学校读者我要写书评

暂无评论

The RWTH Machine Translation System for IWSLT 2008 5

The RWTH Machine Translation System for IWSLT 2008

引用

5th International Workshop on Spoken language Translation, IWSLT 2008

作者： Vilar, David Stein, Daniel Zhang, Yuqi Matusov, Evgeny Mauser, Arne Bender, Oliver Mansour, Saab Ney, Hermann Human Language Technology and Pattern Recognition Lehrstuhl für Informatik 6 Computer Science Department RWTH Aachen University AachenD-52056 Germany

RWTH's system for the 2008 IWSLT evaluation consists of a combination of different phrase-based and hierarchical statistical machine translation systems. We participated in the translation tasks for the Chinese-to-English and Arabic-to-English language pairs. We investigated different preprocessing techniques, reordering methods for the phrase-based system, including reordering of speech lattices, and syntax-based enhancements for the hierarchical systems. We also tried the combination of the Arabic-to-English and Chinese-to-English outputs as an additional submission. © 2008 International Workshop on Spoken language Translation, IWSLT 2008. All rights reserved.

关键词： Hierarchical systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：