检索结果-内蒙古大学图书馆

i-CREATe 2007 - 1st International Convention on Rehabilitation Engineering and Assistive Technology in Conjunction with 1st Tan Tock Seng Hospital Neurorehabilitation Meeting

作者： Manochiopinig, Sriwimon Thubthong, Nuttakorn Kayasith, Prakasit Speech and Language Therapy Unit Department of Rehabilitation Medicine Mahidol University Bangkoknoi Bangkok 10700 Thailand Acoustics and Speech Research Group Department of Physics Chulalongkorn University Phathumwan Bangkok 10300 Thailand Assistive Technology Center National Electonics and Computer Center Thailand Science Park Pathumthani 12120 Thailand

ISBN: (纸本)9781595938527

The dysarthric speech characteristics of 14 Thai stroke patients were assessed by the computerized Articulation Test [1]. speech accuracy and error pattern were analyzed. Vowels and tonal characteristics were the most intact characteristics, while reduction of the clusters was the most impaired feature. Both initial and final consonants were frequently substituted, followed by omission and distortion. Generally, low and mid tone, unaspirated consonants and final consonant, monophthong vowels were produced more precisely than the other features. © ACM 2007.

关键词： Linguistics

来源：评论

学校读者我要写书评

暂无评论

Segmentation and alignment of parallel text for statistical machine translation

引用

Natural language Engineering 2007年第3期13卷 235-260页

作者： Deng, Yonggang Kumar, Shankar Byrne, William Center for Language and Speech Processing Department of Electrical and Computer Engineering The Johns Hopkins University 3400 N. Charles St. Baltimore MD 21218 United States Google Inc. 1600 Amphitheatre Parkway Mountain View CA 94043 United States Department of Engineering Cambridge University Trumpington Street Cambridge CB2 1PZ United Kingdom

We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a stochastic generative process over text translation pairs, and derive two *** alignment procedures based on the underlying alignment model. The first procedure is a now-standard dynamic programming alignment model which we use to generate an initial coarse alignment of the parallel text. The second procedure is a divisive clustering parallel text alignment procedure which we use to refine the first-pass alignments. This latter procedure is novel in that it permits the segmentation of the parallel text into sub-sentence units which are allowed to be reordered to improve the chunk alignment. The quality of chunk pairs are measured by the performance of machine translation systems trained from them. We show practical *** of divisive clustering as well as how system performance can be improved by exploiting portions of the parallel text that otherwise would have to be discarded. We also show that chunk alignment as a first step in word alignment can *** reduce word alignment error rate. © 2007 Cambridge University Press.

关键词： computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

Finding a Needle in a Haystack

Finding a Needle in a Haystack

引用

Annual Conference on Information sciences and Systems (CISS)

作者： Bruno Jedynak Damianos Karakos Department of Applied Mathematics Johns Hopkins University MD USA Center for Language and Speech Processing and Department of Electrical and Computer Engineering Johns Hopkins University Baltimore MD USA

Summary form only given. We study a simplified version of the problem of target detectability in the presence of clutter. The target (the needle) is a sample of size N from a discrete distribution p. The clutter (the haystack) is made up of M independent samples of size JV from a distribution q (which is different from p, but with the same support). Two cases can be easily shown: (i) If M is fixed and JV goes to infinity, the target can be detected with probability that approaches 1. (ii) If TV is fixed and M goes to infinity, then, with probability approaching 1, the target cannot be detected. For the case where both JV, M go to infinity, we show that the asymptotic behavior of the optimal detector (if p, q are known) and of a plug-in detector (which estimates p, q on the fly) is determined by the asymptotic behavior of the quantity Mexp(-ND(p\\q)) : if it goes to zero (resp. infinity), then, with high probability, the target can (resp. cannot) be detected.

关键词： Needles H infinity control World Wide Web Detectors Mathematics Natural languages speech processing

来源：评论

学校读者我要写书评

暂无评论

Resolving and generating definite anaphora by modeling hypernymy using unlabeled corpora 10

Resolving and generating definite anaphora by modeling hyper...

引用

10th Conference on Computational Natural language Learning, CoNLL-X

作者： Garera, Nikesh Yarowsky, David Department of Computer Science Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218 United States

We demonstrate an original and successful approach for both resolving and generating definite anaphora. We propose and evaluate unsupervised models for extracting hypernym relations by mining cooccurrence data of definite NPs and potential antecedents in an unlabeled corpus. The algorithm outperforms a standard WordNet-based approach to resolving and generating definite anaphora. It also substantially outperforms recent related work using pattern-based extraction of such hypernym relations for coreference resolution. © 2006 Association for Computational Linguistics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Vine parsing and minimum risk reranking for speed and precision 10

Vine parsing and minimum risk reranking for speed and precis...

引用

10th Conference on Computational Natural language Learning, CoNLL-X

作者： Dreyer, Markus Smith, David A. Smith, Noah A. Department of Computer Science/Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218 United States

We describe our entry in the CoNLL-X shared task. The system consists of three phases: a probabilistic vine parser (Eisner and N. Smith, 2005) that produces unlabeled dependency trees, a probabilistic relation-labeling model, and a discriminative minimum risk reranker (D. Smith and Eisner, 2006). The system is designed for fast training and decoding and for high precision. We describe sources of crosslingual error and ways to ameliorate them. We then provide a detailed error analysis of parses produced for sentences in German (much training data) and Arabic (little training data).

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

Novel probabilistic finite-state transducers for cognate and transliteration modeling

Novel probabilistic finite-state transducers for cognate and...

引用

7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006

作者： Schafer, Charles Department of Computer Science Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218 United States

We present and empirically compare a range of novel probabilistic finite-state transducer (PFST) models targeted at two major natural language string transduction tasks, transliteration selection and cognate translation selection. Evaluation is performed on 10 distinct language pair data sets, and in each case novel models consistently and substantially outperform a well-established standard reference algorithm. © 2006 The Association for Machine Translation in the Americas.

关键词： Transducers

来源：评论

学校读者我要写书评

暂无评论

A fast finite-state relaxation method for enforcing global constraints on sequence decoding 06

A fast finite-state relaxation method for enforcing global c...

引用

2006 Human language Technology Conference - North American Chapter of the Association for Computational Linguistics Annual Meeting, HLT-NAACL 2006

作者： Tromble, Roy W. Eisner, Jason Department of Computer Science Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218 United States

We describe finite-state constraint relaxation, a method for applying global constraints, expressed as automata, to sequence model decoding. We present algorithms for both hard constraints and binary soft constraints. On the CoNLL-2004 semantic role labeling task, we report a speedup of at least 16x over a previous method that used integer linear programming. © 2006 Association for Computational Linguistics.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

Minimum risk annealing for training log-linear models 21

Minimum risk annealing for training log-linear models

引用

21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, COLING/ACL 2006

作者： Smith, David A. Eisner, Jason Department of Computer Science Center for Language and Speech Processing Johns Hopkins University BaltimoreMD21218 United States

When training the parameters for a natural language system, one would prefer to minimize 1-best loss (error) on an evaluation set. Since the error surface for many natural language problems is piecewise constant and riddled with local minima, many systems instead optimize log-likelihood, which is conveniently differentiable and convex. We propose training instead to minimize the expected loss, or risk. We define this expectation using a probability distribution over hypotheses that we gradually sharpen (anneal) to focus on the 1-best hypothesis. Besides the linear loss functions used in previous work, we also describe techniques for optimizing nonlinear functions such as precision or the BLEU metric. We present experiments training log-linear combinations of models for dependency parsing and for machine translation. In machine translation, annealed minimum risk training achieves significant improvements in BLEU over standard minimum error training. We also show improvements in labeled dependency parsing. © 2006 Association for Computational Linguistics

关键词： Machine translation

来源：评论

学校读者我要写书评

暂无评论

Quasi-synchronous grammars: Alignment by soft projection of syntactic dependencies

Quasi-synchronous grammars: Alignment by soft projection of ...

引用

2006 Workshop on Statistical Machine Translation, WMT 2006, collocated with the HLT-NAACL 2006

作者： Smith, David A. Eisner, Jason Department of Computer Science Center for Language and Speech Processing Johns Hopkins University BaltimoreMD21218 United States

Many syntactic models in machine translation are channels that transform one tree into another, or synchronous grammars that generate trees in parallel. We present a new model of the translation process: quasi-synchronous grammar (QG). Given a source-language parse tree T1, a QG defines a monolingual grammar that generates translations of T1. The trees T2allowed by this monolingual grammar are inspired by pieces of substructure in T1and aligned to T1at those points. We describe experiments learning quasi-synchronous context-free grammars from bitext. As with other monolingual language models, we evaluate the crossentropy of QGs on unseen text and show that a better fit to bilingual data is achieved by allowing greater syntactic divergence. When evaluated on a word alignment task, QG matches standard baselines. © HLT-NAACL *** right reserved.

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

A weighted finite state transducer translation template model for statistical machine translation

引用

Natural language Engineering 2006年第1期12卷 35-75页

作者： Kumar, Shankar Deng, Yonggang Byrne, William Center for Language and Speech Processing Department of Electrical and Computer Engineering The Johns Hopkins University 3400 N. Charles St. Baltimore MD 21218 United States

We present a Weighted Finite State Transducer Translation Template Model for statistical machine translation. This is a source-channel model of translation inspired by the Alignment Template translation model. The model attempts to overcome the deficiencies of word-to-word translation models by considering phrases rather than words as units of translation. The approach we describe allows us to implement each constituent distribution of the model as a weighted finite state transducer or acceptor. We show that bitext word alignment and translation under the model can be performed with standard finite state machine operations involving these transducers. One of the benefits of using this framework is that it avoids the need to develop specialized search procedures, even for the generation of lattices or N-Best lists of bitext word alignments and translation hypotheses. We report and analyze bitext word alignment and translation performance on the Hansards French-English task and the FBIS Chinese-English task under the Alignment Error Rate, BLEU, NIST and Word Error-Rate metrics. These experiments identify the contribution of each of the model components to different aspects of alignment and translation performance. We finally discuss translation performance with large bitext training sets on the NIST 2004 Chinese-English and Arabic-English MT tasks. © 2005 Cambridge University Press.

关键词： computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：