检索结果-内蒙古大学图书馆

8th Workshop on Statistical Machine Translation, WMT 2013

作者： Post, Matt Ganitkevitch, Juri Orland, Luke Weese, Jonathan Cao, Yuan Callison-Burch, Chris Human Language Technology Center of Excellence Johns Hopkins University Center for Language and Speech Processing Johns Hopkins University Computer and Information Sciences Department University of Pennsylvania United States

ISBN: (纸本)9781937284572

We describe improvements made over the past year to Joshua, an open-source translation system for parsing-based machine translation. The main contributions this past year are significant improvements in both speed and usability of the grammar extraction and decoding steps. We have also rewritten the decoder to use a sparse feature representation, enabling training of large numbers of features with discriminative training methods. © 2013 Association for Computational Linguistics

关键词： Open systems

来源：评论

学校读者我要写书评

暂无评论

Sequential UBM adaptation for speaker verification

Sequential UBM adaptation for speaker verification

引用

2013 IEEE China Summit and International Conference on Signal and Information processing, ChinaSIP 2013

作者： Wang, Jun Wang, Dong Wu, Xiaojun Zheng, Thomas Fang Center for Speech and Language Technologies Division of Technical Innovation and Development Tsinghua National Laboratory for Information Science and Technology Beijing 100084 China Center for Speech and Language Technologies Research Institute of Information Technology Tsinghua University Beijing 100084 China Department of Computer Science and Technology Tsinghua University Beijing 100084 China

ISBN: (纸本)9781479910434

GMM-UBM-based speaker verification heavily relies on a well trained UBM. In practice, it is not often easy to obtain an UBM that fully matches acoustic channels in operation. To solve this problem, we propose a novel sequential MAP adaptation approach: by being sequentially updated with data from new enrollments, the UBM learns and converges to the working channel. Our experiments are conducted on a time-varying speech database, with two channel-mismatched UBMs as the initial model. The results confirm that the sequential UBM adaptation provides significant performance improvement, leading to a relative EER reduction of 6.3% and 14.8% for the two mismatched UBMs, respectively. © 2013 IEEE.

关键词： speech recognition

来源：评论

学校读者我要写书评

暂无评论

TASK-DRIVEN ATTENTIONAL MECHANISMS FOR AUDITORY SCENE RECOGNITION

TASK-DRIVEN ATTENTIONAL MECHANISMS FOR AUDITORY SCENE RECOGN...

引用

IEEE International Conference on Acoustics, speech, and Signal processing

作者： Kailash Patil Mounya Elhilali Center for Language and Speech Processing Department of Electrical and Computer Engineering Johns Hopkins University Baltimore MD USA

ISBN: (纸本)9781479903573

How do humans attend to and pick out relevant auditory objects amongst all other sounds in the environment? Based on neurophysiological findings we propose two task oriented attentional mechanisms acting as Bayesian priors which act on two separate levels of processing: a sensory mapping stage and object representation stage. The former sensory stage is modeled as a high dimensional mapping which captures the spectrotemporal nuances and cues of auditory objects. The latter object representation stage then captures the statistical distribution of the different classes of acoustic scenes. This scheme shows a relative improvement in performance by 81 % compared to a baseline system.

关键词： Auditory Attention Acoustic Scene Analysis Sensory processing Object based attention Sensory Process statistical distribution capture

来源：评论

学校读者我要写书评

暂无评论

Emotional speaker verification with linear adaptation

Emotional speaker verification with linear adaptation

引用

2013 IEEE China Summit and International Conference on Signal and Information processing, ChinaSIP 2013

作者： Bie, Fanhu Wang, Dong Zheng, Thomas Fang Chen, Ruxin Center for Speech and Language Technologies Tsinghua National Laboratory for Information Science and Technology Tsinghua University China R and D Sony Computer Entertainment America Foster City CA United States Department of Computer Science and Technology Tsinghua University Beijing 100084 China

ISBN: (纸本)9781479910434

Speaker verification suffers from significant performance degradation on emotional speech. We present an adaptation approach based on maximum likelihood linear regression (MLLR) and its feature-space variant, CMLLR. Our preliminary experiments demonstrate that this approach leads to considerable performance improvement, particularly with CMLLR (about 10% relative EER reduction in average). We also find that the performance gain can be significantly increased with a large set of training data for the transform estimation. © 2013 IEEE.

关键词： Maximum likelihood estimation

来源：评论

学校读者我要写书评

暂无评论

Korean-Thai Lexicon for Natural language processing

Korean-Thai Lexicon for Natural Language Processing

引用

International Conference on Information science and Applications (ICISA)

作者： Sukrita Mahahing Pusadee Seresangtakul Natural Language and Speech Processing Laboratory (NLSP) Department of Computer Science Faculty of Science Khon Kaen University Khon Kaen Thailand

This paper presents Korean-Thai lexicon. This research aims to study and collect necessary features to construct the Korean-Thai lexicon for natural language processing (NLP) and speech processing researches. The research method used for study was that of (1) creating Korean-Thai lexicon consisting of 7 parts : Korean words, Korean Revised Romanization, part of speech, sub part of speech, special characteristic, Thai meaning and description of meaning (2) Korean transcription. According to lack of useful tools for the Korean- Thai machine translation, therefore we have a proposal for creating Korean-Thai lexicon for machine translation. The Korean-Thai lexicon consists of 36,000 Korean words. As it would take a lot of time and effort to gather enough Korean words to cover all domains, Korean Revised Romanization was applied for some words such as terminology, names and places.

关键词： speech Educational institutions Databases Printing Encyclopedias Natural language processing speech processing

来源：评论

学校读者我要写书评

暂无评论

Predictive analysis of two tone stream segregation via extended Kalman filter

Predictive analysis of two tone stream segregation via exten...

引用

Annual Conference on Information sciences and Systems (CISS)

作者： Debmalya Chakrabarty Mounya Elhilali Department of Electrical and Computer Engineering Laboratory of Computational Audio Perception Center for Speech and Language Processing Johns Hopkins University MD USA

Hearing engages in a seemingly effortless way, complex processes that allow our brains to parse the acoustic environment around us into perceptual sound objects, in a phenomenon called streaming or stream segregation. In this paper, we explore the hypothesis that the auditory system relies on the regularity inherent to each stream to segregate it from other competing streams in the scene. Tracking these regularities is achieved via a recursive prediction that tracks the evolution of each stream, using a Kalman filtering approach. The proposed approach combines spectral analysis operating at the level of the auditory periphery with a temporal analysis using Kalman tracking. To incorporate nonlinear relationships in the signal patterns, we employ an extended Kalman filter. This scheme is tested on sinusoidal patterns, or the two tone paradigm. The combined spectral and temporal analysis developed here is able to predict perceptual results of stream segregation by human listeners in a two tone paradigm.

关键词： Indexes Kalman filters Frequency response

来源：评论

学校读者我要写书评

暂无评论

Learning to Relate Literal and Sentimental Descriptions of Visual Properties 2

Learning to Relate Literal and Sentimental Descriptions of V...

引用

2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human language Technologies, NAACL-HLT 2013

作者： Yatskar, Mark Volkova, Svitlana Celikyilmaz, Asli Dolan, Bill Zettlemoyer, Luke Computer Science and Engineering University of Washington SeattleWA United States Center for Language and Speech Processing Johns Hopkins University BaltimoreMD United States Conversational Understanding Sciences Microsoft Mountain ViewCA United States NLP Group Microsoft Research RedmondWA United States

ISBN: (纸本)9781937284473

language can describe our visual world at many levels, including not only what is literally there but also the sentiment that it invokes. In this paper, we study visual language, both literal and sentimental, that describes the overall appearance and style of virtual characters. Sentimental properties, including labels such as "youthful" or "country western," must be inferred from descriptions of the more literal properties, such as facial features and clothing selection. We present a new dataset, collected to describe Xbox avatars, as well as models for learning the relationships between these avatars and their literal and sentimental descriptions. In a series of experiments, we demonstrate that such learned models can be used for a range of tasks, including predicting sentimental words and using them to rank and build avatars. Together, these results demonstrate that sentimental language provides a concise (though noisy) means of specifying low-level visual properties. © 2013 Association for Computational Linguistics.

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

Learning to relate literal and sentimental descriptions of visual properties

Learning to relate literal and sentimental descriptions of v...

引用

2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human language Technologies, NAACL HLT 2013

ISBN: (纸本)9781937284473

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

Hierarchy identification for automatically generating table-of-contents

Hierarchy identification for automatically generating table-...

引用

9th International Conference on Recent Advances in Natural language processing, RANLP 2013

作者： Erbs, Nicolai Gurevych, Iryna Zesch, Torsten Ubiquitous Knowledge Processing Lab. Department of Computer Science Technische Universität Darmstadt Germany Information Center for Education German Institute for Educational Research and Educational Information Germany Language Technology University of Duisburg-Essen Germany

A table-of-contents (TOC) provides a quick reference to a document's content and structure. We present the first study on identifying the hierarchical structure for automatically generating a TOC using only textual features instead of structural hints e.g. from HTML-tags. We create two new datasets to evaluate our approaches for hierarchy identification. We find that our algorithm performs on a level that is sufficient for a fully automated system. For documents without given segment titles, we extend our work by automatically generating segment titles. We make the datasets and our experimental framework publicly available in order to foster future research in TOC generation.

关键词： Automation

来源：评论

学校读者我要写书评

暂无评论

Phonetically balanced and psychometrically equivalent monosyllabic word lists for word recognition testing in Thai

引用

Proceedings of Meetings on Acoustics 2015年第1期25卷

作者： Sajeerat Poonyaban Pasinee Aungsakulchai Charturong Tantibundhit Chutamanee Onsuwan Rattinan Tiravanitchakul Krit Kosawat Adirek Munthuli 1Center of Excellence in Intelligent Informatics Speech and Language Technology and Service Innovation (CILS) Thammasat University Thailand 2Department of Electrical and Computer Engineering Faculty of Engineering Thammasat University Rangsit Campus Khlong Luang Pathum Thani Thailand pond_poonyaban@*** meilyp.aung@*** tchartur@engr.tu.ac.th ***@student.tu.ac.th 3Department of English and Linguistics Faculty of Liberral Arts Thammasat University Rangsit Campus Khlong Luang Pathum Thani Thailand consuwan@tu.ac.th 4Department of Communication Science and Disorders Faculty of Medicine Mahidol University Thailand rartv@*** 5NECTEC National Science and Technology Development Agency (NSTDA) Pathum Thani Thailand krit.kosawat@nectec.or.th

Word recognition testing may be defined as a procedure to assess a listener’s ability to identify one-syllable words (such as phonetically-balanced/PB words) that are presented at a given suprathreshold level to arrive at a word recognition score. For Thai, Thammasat University and Ramathibodi Hospital Phonetically Balanced Word Lists 2015 (TU-RAMA PB’15) were created with five lists, each with 25 monosyllabic words. Besides its phoneme distributions being based on large-scale Thai spoken corpora, TU-RAMA PB’15 is in line with TU PB’14 with emphasis on phonetic balance, symmetrical phoneme occurrence, and word familiarity. To evaluate its homogeneity in terms of decibel intelligibility, the lists were recorded and presented to 10 normal hearing participants, ranging from 0 to 50 dB HL in 2 dB increments (ascending order) until they repeated correct verbal responses. Using logistic regression, regression slopes and intercepts were calculated to estimate percentage of correct performance at any given intensity and to construct psychometric functions for every list. Derived psychometric function slopes ranged from 0.2015 to 0.2262 while intensities required for 50% intelligibility ranged from 17.0876 to 20.8856. Two-way Chi-Square analysis performed on both parameters indicated that there was no significant difference among the five lists.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：