The Arabic Online Commentary (AOC) (Zaidan and Callison-Burch, 2011) is a large-scale repository of Arabic dialects with manual labels for 4 varieties of the language. Existing dialect identification models exploiting...
详细信息
This study is aimed to validate a text processing architecture for knowledge acquisition and analyze the performance of several populations under controlled validation studies by focusing on empirical methods. We repo...
详细信息
ISBN:
(纸本)9781450365727
This study is aimed to validate a text processing architecture for knowledge acquisition and analyze the performance of several populations under controlled validation studies by focusing on empirical methods. We report our experience by analyzing three case studies whose performance issues have been addressed by using a proposed natural-language-mapping model. We discuss the advantages and the disadvantages of an automated prototype for implementing such architectural model, compared with a byhand one. We compare the cases running out on the prototype in order to identify the suitable features the software systems should have. The final goal of the validation process is describing an ongoing research work concerned with the definition of an approach to automate processes for knowledge extraction in requirements engineering. Some of achieved findings of the performance analysis are: (i) the approach can be applied to business-based technical documents regardless of the organizational process involved;(ii) the activities related to the domain understanding can be executed in a low-costs process;and (iii) empirical methods can be used in controlled validation studies for knowledge extraction approaches.
This work is on a previously formalized semantic evaluation task of spatial role labeling (SpRL) that aims at extraction of formal spatial meaning from text. Here, we report the results of initial efforts towards expl...
详细信息
The proceedings contain 8 papers. The topics discussed include: normalized entity graph for computing local coherence;exploiting timegraphs in temporal relation classification;multi-document summarization using bipart...
ISBN:
(纸本)9781937284961
The proceedings contain 8 papers. The topics discussed include: normalized entity graph for computing local coherence;exploiting timegraphs in temporal relation classification;multi-document summarization using bipartite graphs;a novel two-stage framework for extracting opinionated sentences from news articles;constructing coherent event hierarchies from news stories;semi-supervised graph-based genre classification for web pages;the modular community structure of linguistic predication networks;and from visualization to hypothesis construction for second language acquisition.
Determining the textual entailment between texts is important in many NLP tasks, such as summarization, question answering, and information extraction and retrieval. Various methods have been suggested based on extern...
详细信息
This paper presents a survey on hate speech detection. Given the steadily growing body of social media content, the amount of online hate speech is also increasing. Due to the massive scale of the web, methods that au...
详细信息
In this paper, we present an analysis of feature extraction methods via dimensionality reduction for the task of biomedical Word Sense Disambiguation (WSD). We modify the vector representations in the 2-MRD WSD algori...
详细信息
In this paper, we describe the 2017 CLAS system as entered into the C@merata shared task. This year, our aim was to use the challenge as a case study of how one manages naturallanguage queries to structured data, and...
详细信息
Distant supervision has been applied to automatically generate labeled data for biomedical relation extraction. Noise exists in both positively and negatively-labeled data and affects the performance of supervised mac...
详细信息
The centroid-based model for extractive document summarization is a simple and fast baseline that ranks sentences based on their similarity to a centroid vector. In this paper, we apply this ranking to possible summar...
详细信息
暂无评论