检索结果-内蒙古大学图书馆

9th workshop on graph-based methods for natural language processing, Textgraphs 2014, in conjunction with the Conference on Empirical methods in natural language processing, EMNLP 2014

作者： Asheghi, Noushin Rezapour Markert, Katja Sharoff, Serge School of Computing University of Leeds United Kingdom L3S Research Center Leibniz Universität Hannover School of Computing University of Leeds United Kingdom School of Modern Languages and Cultures University of Leeds United Kingdom

ISBN: (纸本)9781937284961

Until now, it is still unclear which set of features produces the best result in automatic genre classification on the web. Therefore, in the first set of experiments, we compared a wide range of content-based features which are extracted from the data appearing within the web pages. The results show that lexical features such as word unigrams and character n-grams have more discriminative power in genre classification compared to features such as part-of-speech n-grams and text statistics. In a second set of experiments, with the aim of learning from the neighbouring web pages, we investigated the performance of a semi-supervised graph-based model, which is a novel technique in genre classification. The results show that our semi-supervised min-cut algorithm improves the overall genre classification accuracy. However, it seems that some genre classes benefit more from this graph-based model than others. © 2014 Association for Computational Linguistics

关键词： Websites

来源：评论

学校读者我要写书评

暂无评论

From visualisation to hypothesis construction for second language acquisition 9

From visualisation to hypothesis construction for second lan...

引用

9th workshop on graph-based methods for natural language processing, Textgraphs 2014, in conjunction with the Conference on Empirical methods in natural language processing, EMNLP 2014

作者： Malmasi, Shervin Dras, Mark Centre for Language Technology Macquarie University Sydney NSW Australia

ISBN: (纸本)9781937284961

One research goal in Second language Acquisition (SLA) is to formulate and test hypotheses about errors and the environments in which they are made, a process which often involves substantial effort;large amounts of data and computational visualisation techniques promise help here. In this paper we have defined a new task for finding contexts for errors that vary with the native language of the speaker that are potentially useful for SLA research. We propose four models for approaching this task, and find that one based only on error-feature co-occurrence and another based on determining maximum weight cliques in a feature association graph discover strongly distinguishing contexts, with an apparent trade-off between false positives and very specific contexts. © 2014 Association for Computational Linguistics

关键词： Errors

来源：评论

学校读者我要写书评

暂无评论

proceedings of Textgraphs@EMNLP 2013: The 8th workshop on graph-based methods for natural language processing

Proceedings of TextGraphs@EMNLP 2013: The 8th Workshop on Gr...

引用

8th workshop on graph-based methods for natural language processing, Textgraphs 2013, at the Conference on Empirical methods in natural language processing, EMNLP 2013

ISBN: (纸本)9781937284978

The proceedings contain 11 papers. The topics discussed include: event-centered information retrieval using kernels on event graphs;reconstructing big semantic similarity networks;graph-based unsupervised learning of word similarities using heterogeneous feature types;understanding seed selection in bootstrapping;graph-structures matching for review relevance identi?cation;automatic extraction of reasoning chains from textual reports;graph-based Approaches for organization entity resolution in MapReduce;and a graph-based approach to skill extraction from text.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Exploiting timegraphs in temporal relation classification 9

Exploiting timegraphs in temporal relation classification

引用

9th workshop on graph-based methods for natural language processing, Textgraphs 2014, in conjunction with the Conference on Empirical methods in natural language processing, EMNLP 2014

作者： Laokulrat, Natsuda Miwa, Makoto Tsuruoka, Yoshimasa University of Tokyo 3-7-1 Hongo Bunkyo-ku Tokyo Japan Toyota Technological Institute 2-12-1 Hisakata Tempaku-ku Nagoya Japan

ISBN: (纸本)9781937284961

Most of the recent work on machine learning-based temporal relation classification has been done by considering only a given pair of temporal entities (events or temporal expressions) at a time. Entities that have temporal connections to the pair of temporal entities under inspection are not considered even though they provide valuable clues to the prediction. In this paper, we present a new approach for exploiting knowledge obtained from nearby entities by making use of timegraphs and applying the stacked learning method to the temporal relation classification task. By performing 10-fold cross validation on the Timebank corpus, we achieved an F1 score of 59.61% based on the graph-based evaluation, which is 0.16 percentage points higher than that of the local approach. Our system outperformed the state-of-the-art system that utilizes global information and achieved about 1.4 percentage points higher accuracy. © 2014 Association for Computational Linguistics

关键词： graphic methods

来源：评论

学校读者我要写书评

暂无评论

Arabic Native language Identification

Arabic Native Language Identification

引用

EMNLP 2014 workshop on Arabic natural language processing, ANLP 2014

作者： Malmasi, Shervin Dras, Mark Centre for Language Technology Macquarie University SydneyNSW Australia

ISBN: (纸本)9781937284961

In this paper we present the first application of Native language Identification (NLI) to Arabic learner data. NLI, the task of predicting a writer’s first language from their writing in other languages has been mostly investigated with English data, but is now expanding to other languages. We use L2 texts from the newly released Arabic Learner Corpus and with a combination of three syntactic features (CFG production rules, Arabic function words and Part-of-Speech n-grams), we demonstrate that they are useful for this task. Our system achieves an accuracy of 41% against a baseline of 23%, providing the first evidence for classifier-based detection of language transfer effects in L2 Arabic. Such methods can be useful for studying language transfer, developing teaching materials tailored to students’ native language and forensic linguistics. Future directions are discussed. ©2014 Association for Computational Linguistics

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

基于概率主题建模的新闻文本可视化综述

引用

计算机辅助设计与图形学学报 2015年第5期27卷 771-782页

作者：汤斯亮程璐邵健吴飞鲁伟明浙江大学计算机科学与技术学院杭州310027

伴随着信息技术的发展,传统纸质新闻逐渐向新媒体新闻转变.与此同时,近年来数据挖掘和自然语言处理等技术得到了极大的发展,使得对新闻所蕴含丰富语义和主题进行深度挖掘成为可能.然而,信息的超载使得主题可视化成为一个新的挑战,即如... 详细信息

伴随着信息技术的发展,传统纸质新闻逐渐向新媒体新闻转变.与此同时,近年来数据挖掘和自然语言处理等技术得到了极大的发展,使得对新闻所蕴含丰富语义和主题进行深度挖掘成为可能.然而,信息的超载使得主题可视化成为一个新的挑战,即如何以更好的方式来呈现海量互联网文本所蕴含的主题.隐形语义分析(LDA)是近年来兴起的主题建模方法,被当前学术界认为是主流的主题建模技术.文中首先介绍以LDA为主的文本概率主题建模技术及其发展,讨论了新闻主题建模特点;随后概括对比新闻主题可视化的若干方法,并对其进行分类,分析不同方法的适用性和局限性;最后对新闻主题可视化进行总结和展望.

关键词：概率图模型主题建模可视化

来源：评论

学校读者我要写书评

暂无评论

Incremental N-gram Approach for language Identification in Code-Switched Text 1

Incremental N-gram Approach for Language Identification in C...

引用

1st workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical methods in natural language processing, EMNLP 2014

作者： Shrestha, Prajwol Kathmandu University Department of Computer Science and Engineering Dhulikhel Nepal

ISBN: (纸本)9781937284961

A multilingual person writing a sentence or a piece of text tends to switch between languages s/he is proficient in. This alteration between languages, commonly known as code-switching, presents us with the problem of determining the correct language of each word in the text. My method uses a variety of techniques based upon the observed differences in the formation of words in these languages. My system was able to obtain third position in both tweet and token level for the main test dataset as well as first position in the token level evaluation for the surprise dataset both consisting of Nepali-English code-switched texts. © 2014 Association for Computational Linguistics

关键词： Statistical tests

来源：评论

学校读者我要写书评

暂无评论

1st workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical methods in natural language processing, EMNLP 2014 - proceedings

1st Workshop on Computational Approaches to Code Switching, ...

引用

1st workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical methods in natural language processing, EMNLP 2014

ISBN: (纸本)9781937284961

The proceedings contain 17 papers. The topics discussed include: foreign words and the automatic processing of Arabic social media text written in roman script;code mixing: a challenge for language identification in the language of social media;detecting code-switching in a multilingual alpine heritage corpus;exploration of the impact of maximum entropy in recurrent neural network language models for code-switching speech;predicting code-switching in multilingual communication for immigrant communities;overview for the first shared task on language identification in code-switched data;word-level language identification using CRF: code-switching shared task report of MSR India system;and the CMU submission for the shared task on language identification in code-switched data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Parsing clinical text: how good are the state-of-the-art parsers?

引用

BMC MEDICAL INFORMATICS AND DECISION MAKING 2015年第Sup1期15卷 S2-S2页

作者： Jiang, Min Huang, Yang Fan, Jung-wei Tang, Buzhou Denny, Josh Xu, Hua Univ Texas Houston Sch Biomed Informat Houston Houston TX 77030 USA Kaiser Permanente San Diego CA USA Harbin Inst Technol Shenzhen Grad Sch Shenzhen Peoples R China Vanderbilt Univ Sch Med Dept Med Nashville TN 37212 USA Vanderbilt Univ Sch Med Dept Biomed Informat Nashville TN 37212 USA

Background: Parsing, which generates a syntactic structure of a sentence (a parse tree), is a critical component of natural language processing (NLP) research in any domain including medicine. Although parsers developed in the general English domain, such as the Stanford parser, have been applied to clinical text, there are no formal evaluations and comparisons of their performance in the medical domain. methods: In this study, we investigated the performance of three state-of-the-art parsers: the Stanford parser, the Bikel parser, and the Charniak parser, using following two datasets: (1) A Treebank containing 1,100 sentences that were randomly selected from progress notes used in the 2010 i2b2 NLP challenge and manually annotated according to a Penn Treebank based guideline;and (2) the MiPACQ Treebank, which is developed based on pathology notes and clinical notes, containing 13,091 sentences. We conducted three experiments on both datasets. first, we measured the performance of the three state-of-the-art parsers on the clinical Treebanks with their default settings. Then we re-trained the parsers using the clinical Treebanks and evaluated their performance using the 10-fold cross validation method. Finally we re-trained the parsers by combining the clinical Treebanks with the Penn Treebank. Results: Our results showed that the original parsers achieved lower performance in clinical text (Bracketing F-measure in the range of 66.6%-70.3%) compared to general English text. After retraining on the clinical Treebank, all parsers achieved better performance, with the best performance from the Stanford parser that reached the highest Bracketing F-measure of 73.68% on progress notes and 83.72% on the MiPACQ corpus using 10-fold cross validation. When the combined clinical Treebanks and Penn Treebank was used, of the three parsers, the Charniak parser achieved the highest Bracketing F-measure of 73.53% on progress notes and the Stanford parser reached the highest F-measur

关键词： Medical language processing natural language processing parsing clinical text NLP

来源：评论

学校读者我要写书评

暂无评论

The Dynamic Model Embed in Augmented graph Cuts for Robust Hand Tracking and Segmentation in Videos

引用

MATHEMATICAL PROBLEMS IN ENGINEERING 2014年第1期2014卷 1-12页

作者： Wan, Jun Ruan, Qiuqi An, Gaoyun Li, Wei Liang, Yanyan Zhao, Ruizhen Beijing Jiaotong Univ Inst Informat Sci Beijing 100044 Peoples R China Beijing Key Lab Adv Informat Sci & Network Techno Beijing 100044 Peoples R China Macau Univ Sci & Technol Space Sci Inst Macau 999078 Peoples R China

Segmenting human hand is important in computer vision applications, for example, sign language interpretation, human computer interaction, and gesture recognition. However, some serious bottlenecks still exist in hand localization systems such as fast hand motion capture, hand over face, and hand occlusions on which we focus in this paper. We present a novel method for hand tracking and segmentation based on augmented graph cuts and dynamic model. first, an effective dynamic model for state estimation is generated, which correctly predicts the location of hands probably having fast motion or shape deformations. Second, new energy terms are brought into the energy function to develop augmented graph cuts based on some cues, namely, spatial information, hand motion, and chamfer distance. The proposed method successfully achieves hand segmentation even though the hand passes over other skin-colored objects. Some challenging videos are provided in the case of hand over face, hand occlusions, dynamic background, and fast motion. Experimental results demonstrate that the proposed method is much more accurate than other graph cuts-based methods for hand tracking and segmentation.

关键词： SEGMENTATION (Image processing) graph theory COMPUTER vision SIGN language HUMAN-computer interaction DEFORMATIONS (Mechanics)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：