检索结果-内蒙古大学图书馆

2004 Human language Technology Conference of the North American Chapter of the Association for Computational Linguistics - Student Research Workshop, HLT-NAACL 2004

作者： Cui, Jia Guthrie, David Center for Language and Speech Processing Johns Hopkins University BaltimoreMD21210 United States Department of Computer Science University of Sheffield SheffieldS1 4DP United Kingdom

In this work, we are concerned with a coarse grained semantic analysis over sparse data, which labels all nouns with a set of semantic categories. To get the benefit of unlabeled data, we propose a bootstrapping framework with Maximum Entropy modeling (MaxEnt) as the statistical learning component. During the iterative tagging process, unlabeled data is used not only for better statistical estimation, but also as a medium to integrate non-statistical knowledge into the model training. Two main issues are discussed in this paper. First, Association Rule principles are suggested to guide MaxEnt feature selections. Second, to guarantee the convergence of the bootstrapping process, three adjusting strategies are proposed to soft tag unlabeled data. © HLT-NAACL 2004 - Human language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Detection of voice onset time (VOT) for unvoived stops (/p/, /t/, /k/) using the teager energy operator (TEO) for automatic detection of accented english

Detection of voice onset time (VOT) for unvoived stops (/p/,...

引用

Proceedings of the 6th Nordic Signal processing Symposium, NORSIG 2004

作者： Das, Sharmistha Hansen, John H.L. Department of Speech Science University of Colorado Boulder CO 80309-0594 United States Robust Speech Processing Group Center for Spoken Language Research University of Colorado Boulder CO 80309-0594 United States

Voice Onset Time (VOT) is an important temporal feature in speech perception and speech recognition. It also benefits for accent detection[1,2]. Fixed length frame based speech processing inherently ignores VOT. In this paper we propose a more effective VOT detection scheme using the non-linear energy tracking algorithm (Teager Energy Operator (TEO)) across a sub-frequency band partition for unvoiced stops (p, t and k). The VOT detection algorithm is applied to the problem of accent classification. Three different language groups (Indian, Chinese and American English) are used from CU-Accent-Corpus to compare VOT's of both accented and native American English. Some pathological cases are considered where speakers have breathy voices or other issues in recording procedure. The VOT is detected with less than 10% error when compared to the manual detected VOT. Also, pairwise English accent classification are 87% for Chinese accent, 80% for English accent, and 47% for Indian accent (includes atypical cases for Indian case).

关键词： speech recognition

来源：评论

学校读者我要写书评

暂无评论

Using random forests in the structured language model 04

Using random forests in the structured language model

引用

Proceedings of the 18th International Conference on Neural Information processing Systems

作者： Peng Xu Frederick Jelinek Center for Language and Speech Processing Department of Electrical and Computer Engineering The Johns Hopkins University

In this paper, we explore the use of Random Forests (RFs) in the structured language model (SLM), which uses rich syntactic information in predicting the next word based on words already seen. The goal in this work is to construct RFs by randomly growing Decision Trees (DTs) using syntactic information and investigate the performance of the SLM modeled by the RFs in automatic speech ***, which were originally developed as classifiers, are a combination of decision tree classifiers. Each tree is grown based on random training data sampled independently and with the same distribution for all trees in the forest, and a random selection of possible questions at each node of the decision tree. Our approach extends the original idea of RFs to deal with the data sparseness problem encountered in language *** have been studied in the context of n-gram language modeling and have been shown to generalize well to unseen data. We show in this paper that RFs using syntactic information can also achieve better performance in both perplexity (PPL) and word error rate (WER) in a large vocabulary speech recognition system, compared to a baseline that uses Kneser-Ney smoothing.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Effects of Spectro-Temporal Asynchrony in Auditory and Auditory-Visual speech processing

引用

Seminars in Hearing 2004年第3期25卷 241-255页

作者： Ken W. Grant Steven Greenberg David Poeppel Virginie van Wassenhove 1 Auditory-Visual Speech Recognition Laboratory Walter Reed Army Medical Center Army Audiology and Speech Center Washington District of Columbia 2 International Computer Speech Institute Berkeley California 3 Cognitive Neuroscience of Language Laboratory Neuroscience and Cognitive Science Program (NACS) Department of Biology and Department of Linguistics University of Maryland College Park Maryland

来源：评论

学校读者我要写书评

暂无评论

Automatic enrichment of a very large dictionary of word combinations on the basis of dependency formalism

引用

3rd Mexican International Conference on Artificial Intelligence, MICAI 2004

作者： Gelbukh, Alexander Sidorov, Grigori Han, San-Yong Hernández-Rubio, Erika Natural Language and Text Processing Laboratory Center for Computing Research National Polytechnic Institute Av. Juan Dios Batiz s/n Zacatenco Mexico City07738 Mexico Department of Computer Science and Engineering Chung-Ang University 221 Huksuk-Dong DongJak-Ku Seoul156-756 Korea Republic of

ISBN: (纸本)3540214593

The paper presents a method of automatic enrichment of a very large dictionary of word combinations. The method is based on results of automatic syntactic analysis (parsing) of sentences. The dependency formalism is used for representation of syntactic trees that allows for easier treatment of information about syntactic compatibility. Evaluation of the method is presented for the Spanish language based on comparison of the automatically generated results with manually marked word combinations. © Springer-Verlag Berlin Heidelberg 2004.

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

speechFIND: spoken document retrieval for a national gallery of the spoken word

SPEECHFIND: spoken document retrieval for a national gallery...

引用

Proceedings of the Nordic Signal processing Symposium (NORSIG)

作者： J.H.L. Hansen Rongqing Huang P. Mangalath Bowen Zhou M. Seadle J.R. Deller Robust Speech Processing Group Center for Spoken Language Research University of Colorado Boulder CO USA Michigan State University East Lansing MI USA Department Electrical & Computer Engineering Michigan State University East Lansing MI USA

来源：评论

学校读者我要写书评

暂无评论

Automatic recognition of spontaneous speech for access to multilingual oral history archives

Automatic recognition of spontaneous speech for access to mu...

引用

作者： Byrne, William Doermann, David Franz, Martin Gustman, Samuel Hajič, Jan Oard, Douglas Picheny, Michael Psutka, Josef Ramabhadran, Bhuvana Soergel, Dagobert Ward, Todd Zhu, Wei-Jing Ctr. for Lang. and Speech Processing Department of Electrical Engineering Johns Hopkins University Baltimore MD 21218 United States Inst. for Advanced Computer Studies University of Maryland College Park MD 20742 United States Natural Language Systems Department IBM T. J. Watson Research Center Yorktown Heights NY 10598 United States Survivors Shoah Vis. Hist. Found. Los Angeles CA 90078 United States Inst. of Formal/Applied Linguistics Center for Computational Linguistics Charles University CZ-11800 Prague 1 Czech Republic Inst. for Advanced Computer Studies College of Information Studies University of Maryland College Park MD 20742 United States Hum. Lang. Technologies Department IBM T. J. Watson Research Center Yorktown Heights NY 10598 United States Department of Cybernetics Center for Computational Linguistics University of West Bohemia CZ-30614 Pilsen Czech Republic College of Information Studies University of Maryland College Park MD 20742 United States

Much is known about the design of automated systems to search broadcast news, but it has only recently become possible to apply similar techniques to large collections of spontaneous speech. This paper presents initial results from experiments with speech recognition, topic segmentation, topic categorization, and named entity detection using a large collection of recorded oral histories. The work leverages a massive manual annotation effort on 10 000 h of spontaneous speech to evaluate the degree to which automatic speech recognition (ASR)-based segmentation and categorization techniques can be adapted to approximate decisions made by human annotators. ASR word error rates near 40% were achieved for both English and Czech for heavily accented, emotional and elderly spontaneous speech based on 65-84 h of transcribed speech. Topical segmentation based on shifts in the recognized English vocabulary resulted in 80% agreement with manually annotated boundary positions at a 0.35 false alarm rate. Categorization was considerably more challenging, with a nearest-neighbor technique yielding F = 0.3. This is less than half the value obtained by the same technique on a standard newswire categorization benchmark, but replication on human-transcribed interviews showed that ASR errors explain little of that difference. The paper concludes with a description of how these capabilities could be used together to search large collections of recorded oral histories.

关键词： speech recognition

来源：评论

学校读者我要写书评

暂无评论

Forward-decoding kernel-based phone sequence recognition 15

Forward-decoding kernel-based phone sequence recognition

引用

16th Annual Neural Information processing Systems Conference, NIPS 2002

作者： Chakrabartty, Shantanu Cauwenberghs, Gert Center for Language and Speech Processing Department of Electrical and Computer Engineering Johns Hopkins University Baltimore MD 21218 United States

ISBN: (纸本)0262025507

Forward decoding kernel machines (FDKM) combine large-margin classifiers with hidden Markov models (HMM) for maximum a posteriori (MAP) adaptive sequence estimation. State transitions in the sequence are conditioned on observed data using a kernel-based probability model trained with a recursive scheme that deals effectively with noisy and partially labeled data. Training over very large datasets is accomplished using a sparse probabilistic support vector machine (SVM) model based on quadratic entropy, and an on-line stochastic steepest descent algorithm. For speaker-independent continuous phone recognition, FDKM trained over 177,080 samples of the TIMET database achieves 80.6% recognition accuracy over the full test set, without use of a prior phonetic language model.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

Word-selective training for speech recognition

Word-selective training for speech recognition

引用

IEEE Workshop on Automatic speech Recognition and Understanding

作者： T.M. Kamm G.G.L. Meyer Center for Language and Speech Processing Department of Electrical and Computer Engineering Johns Hopkins University Baltimore MD USA

We previously proposed (Kamm and Meyer (2001, 2002)) a two-pronged approach to improve system performance by selective use of training data. We demonstrated a sentence-selective algorithm that, first, made effective use of the available humanly transcribed training data and, second, focused future human transcription effort on data that was more likely to improve system performance. We now extend that algorithm to focus on word selection, and demonstrate that we can reduce the error rate from 10.3 % to 9.3 % on a simple, 36-word corpus, by selecting 30 % (15 hours) of the 50 hours of training data available in this corpus, without knowledge of the true transcription. We also discuss application of our word selection algorithm to the Wall Street Journal 5 K word task. Preliminary results show that we can select up to 60 % (48 hours) of the training data, with minimal knowledge of the true transcription, and match or beat the error rate of a system built using the same amount of randomly selected training data.

关键词： speech recognition Training data Error analysis System performance Humans Costs Natural languages speech processing Automatic speech recognition Learning systems

来源：评论

学校读者我要写书评

暂无评论

Evaluating sense disambiguation across diverse parameter spaces

引用

Natural language Engineering 2002年第4期8卷 293-310页

作者： Yarowsky, David Florian, Radu Department of Computer Science and Center for Language and Speech Processing Johns Hopkins University MD 21218 United States

This paper presents a comprehensive empirical exploration and evaluation of a diverse range of data characteristics which influence word sense disambiguation performance. It focuses on a set of six core supervised algorithms, including three variants of Bayesian classifiers, a cosine model, non-hierarchical decision lists, and an extension of the transformation-based learning model. Performance is investigated in detail with respect to the following parameters: (a) target language (English, Spanish, Swedish and Basque);(b) part of speech;(c) sense granularity;(d) inclusion and exclusion of major feature classes;(e) variable context width (further broken down by part-of-speech of keyword);(f) number of training examples;(g) baseline probability of the most likely sense;(h) sense distributional entropy;(i) number of senses per keyword;(j) divergence between training and test data;(k) degree of (artificially introduced) noise in the training data;(1) the effectiveness of an algorithm’s confidence rankings;and (m) a full keyword breakdown of the performance of each algorithm. The paper concludes with a brief analysis of similarities, differences, strengths and weaknesses of the algorithms and a hierarchical clustering of these algorithms based on agreement of sense classification behavior. Collectively, the paper constitutes the most comprehensive survey of evaluation measures and tests yet applied to sense disambiguation algorithms. And it does so over a diverse range of supervised algorithms, languages and parameter spaces in single unified experimental framework. © 2002, Cambridge University Press. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：