检索结果-内蒙古大学图书馆

10th International workshop on Finite State methods and natural language processing, FSMNLP 2012

作者： Calvo, Marcos Gómez, Jon Ander Hurtado, Lluís-F. Sanchis, Emilio Departament de Sistemes Informàtics i Computació Universitat Politècnica de València Cami de Vera s/n València46022 Spain

In this work, we describe a methodology based on the Stochastic Finite State Transducers paradigm for Spoken language Understanding (SLU) for obtaining concept graphs from word graphs. In the edges of these concept graphs, both semantic and lexical information are represented. This makes these graphs a very useful representation of the information for SLU. The best path in these concept graphs provides the best sequence of concepts. © 2012 Association for Computational Linguistics.

关键词： graphic methods

来源：评论

学校读者我要写书评

暂无评论

Kleene, a free and open-source language for finite-state programming 10

Kleene, a free and open-source language for finite-state pro...

引用

10th International workshop on Finite State methods and natural language processing, FSMNLP 2012

作者： Beesley, Kenneth R. SAP Labs LLC P.O. Box 540475 North Salt LakeUT84054 United States

Kleene is a high-level programming language, based on the OpenFst library, for constructing and manipulating finite-state acceptors and transducers. Users can program using regular expressions, alternation-rule syntax and right-linear phrase-structure grammars;and Kleene provides variables, lists, functions and familiar program-control syntax. Kleene has been approved by SAP AG for release as free, open-source code under the Apache License, Version 2.0, and will be available by August 2012 for downloading from http:// ***. The design, implementation, development status and future plans for the language are discussed. © 2012 Association for Computational Linguistics.

关键词： Open source software

来源：评论

学校读者我要写书评

暂无评论

Enrichment of inflection dictionaries: Automatic extraction of semantic labels from encyclopedic definitions

Enrichment of inflection dictionaries: Automatic extraction ...

引用

9th International workshop on natural language processing and Cognitive Science, NLPCS 2012, in Conjunction with ICEIS 2012

作者： Chrzaszcz, Pawel Computational Linguistics Department Jagiellonian University Golebia 24 Kraków Poland Computer Science Department AGH University of Science and Technology Mickiewicza 30 Kraków Poland

ISBN: (纸本)9789898565167

Inflection dictionaries are widely used in many natural language processing tasks, especially for inflecting languages. However, they lack semantic information, which could increase the accuracy of such processing. This paper describes a method to extract semantic labels from encyclopedic entries. Adding such labels to an inflection dictionary could eliminate the need of using ontologies and similar complex semantic structures for many typical tasks. A semantic label is either a single word or a sequence of words that describes the meaning of a headword, hence it is similar to a semantic category. However, no taxonomy of such categories is known prior to the extraction. Encyclopedic articles consist of headwords and their definitions, so the definitions are used as sources for semantic labels. The described algorithm has been implemented for extracting data from the Polish Wikipedia. It is based on definition structure analysis, heuristic methods and word form recognition and processing with use of the Polish Inflection Dictionary. This paper contains a description of the method and test results as well as discussion on possible further development.

关键词： Heuristic methods

来源：评论

学校读者我要写书评

暂无评论

Deep Unsupervised Feature Learning for natural language processing

Deep Unsupervised Feature Learning for Natural Language Proc...

引用

2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human language Technologies, NAACL-HLT 2012

作者： Gouws, Stephan MIH Media Lab Stellenbosch University Stellenbosch South Africa

ISBN: (纸本)1937284204

Statistical natural language processing (NLP) builds models of language based on statistical features extracted from the input text. We investigate deep learning methods for unsupervised feature learning for NLP tasks. Recent results indicate that features learned using deep learning methods are not a silver bullet and do not always lead to improved results. In this work we hypothesise that this is the result of a disjoint training protocol which results in mismatched word representations and classifiers. We also hypothesise that modelling long-range dependencies in the input and (separately) in the output layers would further improve performance. We suggest methods for overcoming these limitations, which will form part of our final thesis work. © 2012 Association for Computational Linguistics

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Usability-driven pruning of large ontologies: the case of SNOMED CT

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第E1期19卷 E102-E109页

作者： Lopez-Garcia, Pablo Boeker, Martin Illarramendi, Arantza Schulz, Stefan Univ Basque Country Dept Lenguajes & Sistemas Informat Donostia San 20008 Sebastian Spain Univ Freiburg Inst Med Biometrie & Med Informat D-79106 Freiburg Germany Med Univ Graz Inst Med Informat Stat & Dokument Graz Austria

Objectives To study ontology modularization techniques when applied to SNOMED CT in a scenario in which no previous corpus of information exists and to examine if frequency-based filtering using MEDLINE can reduce subset size without discarding relevant concepts. Materials and methods Subsets were first extracted using four graph-traversal heuristics and one logic-based technique, and were subsequently filtered with frequency information from MEDLINE. Twenty manually coded discharge summaries from cardiology patients were used as signatures and test sets. The coverage, size, and precision of extracted subsets were measured. Results graph-traversal heuristics provided high coverage (71-96% of terms in the test sets of discharge summaries) at the expense of subset size (17-51% of the size of SNOMED CT). Pre-computed subsets and logic-based techniques extracted small subsets (1%), but coverage was limited (24-55%). Filtering reduced the size of large subsets to 10% while still providing 80% coverage. Discussion Extracting subsets to annotate discharge summaries is challenging when no previous corpus exists. Ontology modularization provides valuable techniques, but the resulting modules grow as signatures spread across subhierarchies, yielding a very low precision. Conclusion graph-traversal strategies and frequency data from an authoritative source can prune large biomedical ontologies and produce useful subsets that still exhibit acceptable coverage. However, a clinical corpus closer to the specific use case is preferred when available.

关键词： Controlled controlled terminologies and vocabularies data bases developing and refining EHR data standards (including image standards) knowledge bases knowledge representations measuring/improving patient safety and reducing medical errors medical records natural-language processing ontology other methods of information extraction semantic web SNOMED CT software systematized nomenclature of medicine vocabulary

来源：评论

学校读者我要写书评

暂无评论

Intelligent Tutoring Systems for Foreign language Learning: The Bridge to International Communication

引用

2012年

作者： Merryanna L Swartz Masoud Yazdani

ISBN: (纸本)9783642772047

This volume presents the proceedings of an international workshop held for amultidisciplinary group of researchers involved in intelligent tutoring systems research for language learning. The papers include work on: - Computational bases, tools, and environments for delivering language instruction, - Theoretical frameworks for developing language-based architectures and computational grammars, - Pedagogical practice, learner characteristics, and learner performance data, - methods for representing tutoring and student modelling knowledge in the tutoring system, - Existing systems for language learning. The approach to developing intelligent tutoring systems that integrates natural language processing in a multimedia environment is new. This book presents readers with the state of the art in the field in a single volume, with contributors from computer science, linguistics, and psychology.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A New Parametric Estimation Method for graph-based Clustering 12

A New Parametric Estimation Method for Graph-based Clusterin...

引用

workshop on graph-based methods for natural language processing

作者： Javid Ebrahimi Mohammad Saniee Abadeh Faculty of Electrical & Computer Engineering Tarbiat Modares University Tehran Iran

ISBN: (纸本)9781627483445

Relational clustering has received much attention from researchers in the last decade. In this paper we present a parametric method that employs a combination of both hard and soft clustering. based on the corresponding Markov chain of an affinity matrix, we simulate a probability distribution on the states by defining a conditional probability for each subpopulation of states. This probabilistic model would enable us to use expectation maximization for parameter estimation. The effectiveness of the proposed approach is demonstrated on several real datasets against spectral clustering methods.

关键词： Clustering methods Probability distribution parametric method Statistical analysis Probabilistic Model Markov chain Cluster Analysis Parameter estimation Parametric Conditional probability

来源：评论

学校读者我要写书评

暂无评论

Bringing the Associative Ability to Social Tag Recommendation 12

Bringing the Associative Ability to Social Tag Recommendatio...

引用

workshop on graph-based methods for natural language processing

作者： Miao Fan Yingnan Xiao Qiang Zhou Department of Computer Science and Technology Tsinghua UniversitySchool of Software Engineering Beijing University of Posts and Telecommunications School of Software Engineering Beijing University of Posts and Telecommunications Department of Computer Science and Technology Tsinghua University

ISBN: (纸本)9781627483445

Social tagging systems, which allow users to freely annotate online resources with tags, become popular in the Web 2.0 era. In order to ease the annotation process, research on social tag recommendation has drawn much attention in recent years. Modeling the social tagging behavior could better reflect the nature of this issue and improve the result of recommendation. In this paper, we proposed a novel approach for bringing the associative ability to model the social tagging behavior and then to enhance the performance of automatic tag recommendation. To simulate human tagging process, our approach ranks the candidate tags on a weighted digraph built by the semantic relationships among meaningful words in the summary and the corresponding tags for a given resource. The semantic relationships are learnt via a word alignment model in statistical machine translation on large datasets. Experiments on real world datasets demonstrate that our method is effective, robust and language-independent compared with the state-of-the-art methods.

关键词： TAGGING machine translation language-independent Online resources Associative sign systems

来源：评论

学校读者我要写书评

暂无评论

Gesture and Sign language in Human-Computer Interaction and Embodied Communication - 9th International Gesture workshop, GW 2011, Revised Selected Papers

Gesture and Sign Language in Human-Computer Interaction and ...

引用

9th International Gesture workshop, GW 2011

ISBN: (纸本)9783642341816

The proceedings contain 24 papers. The topics discussed include: gestures in assisted living environments;choosing and modeling the hand gesture database for a natural user interface;user experience of gesture based interfaces: a comparison with traditional interaction methods on pragmatic and hedonic qualities;low cost force-feedback interaction with haptic digital audio effects;the role of spontaneous gestures in spatial problem solving;effects of spectral features of sound on gesture type and timing;human-motion saliency in complex scenes;what, why, where and how do children think? towards a dynamic model of spatial cognition as action;a labanotation based ontology for representing dance movement;assessing agreement on segmentations by means of staccato, the segmentation agreement calculator according to Thomann;and how do iconic gestures convey visuo-spatial information? bringing together empirical, theoretical, and simulation studies.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A possibilistic approach for automatic word sense disambiguation

A possibilistic approach for automatic word sense disambigua...

引用

24th Conference on Computational Linguistics and Speech processing, ROCLING 2012

作者： Khiroun, Oussama Ben Elayeb, Bilel Bounhas, Ibrahim Evrard, Fabrice Saoud, Narjès Bellamine Ben RIADI Research Laboratory ENSI Manouba University 2010 Tunisia IRIT-ENSEEIHT 02 Rue Camichel 31071 Toulouse Cedex 7 France Department of Computer Science Faculty of Sciences of Tunis 1060 Tunis Tunisia

ISBN: (纸本)9789573079255

This paper presents and experiments a new approach for automatic word sense disambiguation (WSD) applied for French texts. First, we are inspired from possibility theory by taking advantage of a double relevance measure (possibility and necessity) between words and their contexts. Second, we propose, analyze and compare two different training methods: judgment and dictionary based training. Third, we summarize and discuss the overall performance of the various performed tests in a global analysis way. In order to assess and compare our approach with similar WSD systems we performed experiments on the standard ROMANSEVAL test collection.

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：