检索结果-内蒙古大学图书馆

Annual Conference on Information Sciences and Systems

作者： Carlin, Michael A. Elhilali, Mounya Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218 United States Human Language Technology Center of Excellence Johns Hopkins University Baltimore MD 21218 United States Dept. of Electrical and Computer Engineering Johns Hopkins University Baltimore MD 21218 United States

ISBN: (纸本)9781424498475

It is well known that speech sounds evolve at multiple timescales over the course of tens to hundreds of milliseconds. Such temporal modulations are crucial for speech perception and are believed to directly influence the underlying code for representing acoustic stimuli. The present work seeks to explicitly quantify this relationship using the principle of temporal coherence. Here we show that by constraining the outputs of model linear neurons to be highly correlated over timescales relevant to speech, we observe the emergence of neural response fields that are bandpass, localized, and reflective of the rich spectro-temporal structure present in speech. The emergent response fields also appear to share qualitative similarities those observed in auditory neurophysiology. Importantly, learning is accomplished using unlabeled speech data, and the emergent neural properties well-characterize the spectro-temporal statistics of the input. We analyze the characteristics and coverage of ensembles of learned response fields for a variety of timescales, and suggest uses of such a coherence learning framework for common speech tasks. © 2011 IEEE.

关键词： Speech

来源：评论

学校读者我要写书评

暂无评论

Ecological loudness: Binarual loudness constancy International Congress on Acoustics, ICA 2010

Ecological loudness: Binarual loudness constancy Internation...

引用

20th International Congress on Acoustics 2010, ICA 2010 - Incorporating the 2010 Annual Conference of the Australian Acoustical Society

作者： Florentine, Mary Epstein, Michael Dept. of Speech-Language Pathology and Audiology Institute for Hearing Speech and Language Northeastern University 360 Huntington Ave. Boston MA 02115 United States Auditory Modeling and Processing Laboratory Dept. of Speech-Language Pathology and Audiology Northeastern University 360 Huntington Ave Boston MA 02115 United States CDSP Center Dept. of Electrical and Computer Engineering Northeastern University 360 Huntington Ave Boston MA 02115 United States

ISBN: (纸本)9781617827457

Are conclusions about loudness drawn from tones presented via earphones in laboratories applicable to listening to a talker in a room? The present experiment tests the following hypothesis: speech from the same talkers presented under more ecologically valid conditions results in a smaller binaural-to-monaural loudness ratio than speech presented without visual cues and/or presented via headphones. Twelve normal listeners were presented two types of stimuli (recorded speech, with and without visual cues) monaurally and binaurally across a wide range of levels. The same stimuli were presented via earphones and loudspeakers. Loudness was measured using magnitude estimation. Results show that the binaural-to-monaural loudness ratio was significantly less for speech with visual cues presented via a loudspeaker than for stimuli with any other combination of test parameters (i.e., speech without visual cues presentedvia both headphones and loudspeakers, and speech presented with visual cues via headphones). The present results indicate that the loudness of a visually present talker in daily environments is little affected by switching between binaural and monaural listening. This phenomenon has been dubbed "Binaural Loudness Constancy," because of its similarity to loudness constancy that occurs with distance from the speaker. The present experiment supports the importance of ecological validity in loudness research, which could change how perception of loudness is understood.

关键词： Speech

来源：评论

学校读者我要写书评

暂无评论

Metamodeling for community coordinated multimedia and experience on metamodel-driven content annotation service prototype

Metamodeling for community coordinated multimedia and experi...

引用

2008 IEEE Congress on Services, SERVICES 2008

作者： Zhou, Jiehan Rautiainen, Mika Ylianttila, Mika Foulonneau, Muriel Blandin, Patrick Information processing laboratory Dept. of Electrical and Information Engineering University of Oulu Finland Reference modelling /Knowledge systems Centre Henri Tudor Luxembourg Luxembourg

ISBN: (纸本)9780769532868

Community Coordinated Multimedia (CCM) provides an extended and enhanced human experience by collaboratively consuming electronic and networked content and multimedia-intensive services. Community coordinated multimedia vision raises the challenges of multimedia content creation, interpretation, exchange, and consumption over a large range of heterogeneous services, terminals and networks. The development of CCM metamodel plays a key role in tackling these challenges by enabling the transparent multimedia aggregation and exchange crossing communities. This paper aims to design and develop the CCM metamodel. A generic terminology for CCM metamodeling is developed. A extensive study on metamodeling activities and metamodeling methodologies is presented. A combined metamodeling approach is proposed for the CCM metamodel development. Experiences and results on the CCM metamodeling and CCM metamodel-driven content annotation prototype are elaborated. © 2008 IEEE.

关键词： Computer programming

来源：评论

学校读者我要写书评

暂无评论

Unigram language models using diffusion smoothing over graphs

Unigram language models using diffusion smoothing over graph...

引用

2nd Workshop on Graph-Based Algorithms for Natural language processing, TextGraphs 2007

作者： Jedynak, Bruno Karakos, Damianos Dept. of Appl. Mathematics and Statistics Center for Imaging Sciences Johns Hopkins University Baltimore MD 21218-2686 United States Dept. of Electrical and Computer Engineering Center for Language and Speech Processing Johns Hopkins University Baltimore MD 21218-2686 United States

We propose to use graph-based diffusion techniques with data-dependent kernels to build unigram language models. Our approach entails building graphs, where each vertex corresponds uniquely to a word from a closed vocabulary, and the existence of an edge (with an appropriate weight) between two words indicates some form of similarity between them. In one of our constructions, we place an edge between two words if the number of times these words were seen in a training set differs by at most one count. This graph construction results in a similarity matrix with small intrinsic dimension, since words with the same counts have the same neighbors. Experimental results from a benchmark task from language modeling show that our method is competitive with the Good-Turing estimator.

关键词： Graphic methods

来源：评论

学校读者我要写书评

暂无评论

Semantics, Web and Mining: Preface

Lecture Notes in Computer Science (including subseries Lectu...

引用

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2006年 4289 LNAI卷 v-vi页

作者： Berendt, Bettina Hotho, Andreas Mladenič, Dunja Semeraro, Giovanni Spiliopoulou, Myra Stumme, Gerd Van Someren, Maarten Ackermann, Markus Grobelnik, Marko Svátek, Vojtěch Institute of Information Systems Humboldt University Berlin Germany Knowledge and Data Engineering Group University of Kassel Germany J. Stefan Institute Ljubljana Slovenia Department of Informatics University of Bari Italy Faculty of Computer Science Otto-von-Guericke-Univ. Magdeburg Germany Informatics Institute University of Amsterdam Netherlands Dept. of Natural Language Processing Institute for Computer Science University of Leipzig Germany University of Economics Prague Czech Republic

来源：评论

学校读者我要写书评

暂无评论

MoSS: A program for molecular substructure mining

MoSS: A program for molecular substructure mining

引用

1st International Workshop on Open Source Data Mining: Frequent Pattern Mining Implementations, OSDM 2005, held in conjunction with the 11th ACM SIGKDD International Conference on knowledge Discovery and Data Mining

作者： Borgelt, Christian Meinl, Thorsten Berthold, Michael Dept. of Knowledge Processing and Language Engineering University of Magdeburg Universitätsplatz 2 39106 Magdeburg Germany Computer Science Department 2 University of Erlangen-Nuremberg Martenstraße 3 91058 Erlangen Germany Dept. of Computer and Information Science University of Konstanz 78457 Konstanz Germany

ISBN: (纸本)1595932100

Molecular substructure mining is currently an intensively studied research area. In this paper we present an implementation of an algorithm for finding frequent substructures in a set of molecules, which may also be used to find substructures that discriminate well between a focus and a complement group. In addition to the basic algorithm, we discuss advanced pruning techniques, demonstrating their effectiveness with experiments on two publicly available molecular data sets, and briefly mention some other extensions. Copyright 2005 ACM.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Information retrieval using word senses: Root sense tagging approach

Information retrieval using word senses: Root sense tagging ...

引用

Proceedings of Sheffield SIGIR - Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

作者： Kim, Sang-Bum Seo, Hee-Cheol Rim, Hae-Chang Natural Language Processing Lab. Dept. of Comp. Sci. and Engineering Korea University Anam-dong 5 ka SungPuk-gu Seoul 136-701 Korea Republic of

ISBN: (纸本)1581138814

Information retrieval using word senses is emerging as a good research challenge on semantic information retrieval. In this paper, we propose a new method using word senses in information retrieval: root sense tagging method. This method assigns coarse-grained word senses defined in WordNet to query terms and document terms by unsupervised way using co-occurrence information constructed automatically. Our sense tagger is crude, but performs consistent disambiguation by considering only the single most informative word as evidence to disambiguate the target word. We also allow multiple-sense assignment to alleviate the problem caused by incorrect disambiguation. Experimental results on a large-scale TREC collection show that our approach to improve retrieval effectiveness is successful, while most of the previous work failed to improve performances even on small text collection. Our method also shows promising results when is combined with pseudo relevance feedback and state-of-the-art retrieval function such as BM25.

关键词： Information retrieval

来源：评论

学校读者我要写书评

暂无评论

Genetic mining of DNA sequence structures for effective classification of the risk types of human papillomavirus (HPV)

引用

11th International Conference on Neural Information processing, ICONIP 2004

作者： Eom, Jae-Hong Park, Seong-Bae Zhang, Byoung-Tak Biointelligence Lab School of Computer Science and Engineering Seoul National University Seoul151-744 Korea Republic of Language and Information Processing Lab Dept. of Computer Engineering Kyungpook National University Daegu702-701 Korea Republic of

ISBN: (纸本)3540239316

Human papillomavirus (HPV) is considered to be the most common sexually transmitted disease and the infection of HPV is known as the major factor for cervical cancer. There are more than 100 types in HPV and each HPV has two risk types, low and high. In particular, high risk type HPV is known to the most important factors in medical judgment. Thus, the classifying the risk type of HPV is very important to the treat of cervical cancer. In this paper, we present a machine learning approach to mine the structure of HPV DNA sequence for effective classification of the HPV risk types. We learn the most informative subsequence segment sets and its weights with genetic algorithm to classify the risk types of each HPV. To resolve the problem of computational complexity of genetic algorithm we use distributed intelligent data engineering platform based on active grid concept ***@Home.. The proposed genetic mining method, with the described platform, shows about 85.6% classification accuracy with relatively fast mining speed. © Springer-Verlag Berlin Heidelberg 2004.

关键词： DNA sequences

来源：评论

学校读者我要写书评

暂无评论

SpeechFind: Spoken document retrieval for a National Gallery of the Spoken Word

SpeechFind: Spoken document retrieval for a National Gallery...

引用

Proceedings of the 6th Nordic Signal processing Symposium, NORSIG 2004

作者： Hansen, John H. L. Huang, Rongqing Mangalath, Praful Zhou, Bowen Seadle, Michael Deller Jr., John R. Robust Speech Processing Group Center for Spoken Language Research University of Colorado Boulder CO 80309-0594 United States Michigan State University E308 Main Library East Lansing MI 48824 United States Michigan State University Dept. Electrical Engineering East Lansing MI 48824 United States

In this study, we discuss a number of issues for audio stream phrase recognition for information retrieval for a new National Gallery of the Spoken Word (NGSW). NGSW is the first large-scale repository of its kind, consisting of speeches, news broadcasts, and recordings that are of historical content from the 20th Century. We propose a system diagram and discuss critical tasks associated with effective audio information retrieval that include: advanced audio segmentation, speech recognition model adaptation for acoustic background noise and speaker variability, and natural language processing for text query requests. A number of questions regarding copyright assessment, metadata construction, digital watermarking must also be addressed for a sustainable audio collection of this magnitude. Our experimental online system entitled "SpeechFind" is presented which allows for audio retrieval from a portion of the NGSW corpus. We discuss a number of research challenges to address the overall task of robust phrase searching in unrestricted audio corpora.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Some charactistical aspects of MarkRead - A software package for automatic mark data entry

Some charactistical aspects of MarkRead - A software package...

引用

Asia-Pacific Conference on Circuits and Systems, APCCAS 2002

作者： Tao, Ngo Quoc Toan, Do Nang Dept. of Image Processing and Knowledge Engineering Institute of Information Technology Hoang Quoc Viet Street 18 Caugiay Hanoi Viet Nam

ISBN: (纸本)0780376900

Automatic data entry plays an important role for improving speed and effectiveness for information technology. In order to recognize forms and join results into a database, it is necessary to correctly isolate geometrical objects and match forms with a pattern. Therefore, this paper presents some difficult problems of optical mark recognition such as detecting image skew, margins and basic geometrical objects, which are solved in the process of developing MarkRead, a software package for automatic mark data entry. © 2002 IEEE.

关键词： Optical data processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：