Paraphrases are important linguistic resources for a wide variety of NLP applications. Many techniques for automatic paraphrase mining from general corpora have been proposed. While these techniques are successful at ...
详细信息
Both statistical and rule-basedmethods for named entity recognition are quite sensitive to the type of language used in the analysed texts. Former studies have shown for example that it was harder to detect named ent...
详细信息
ISBN:
(纸本)9781450381635
Both statistical and rule-basedmethods for named entity recognition are quite sensitive to the type of language used in the analysed texts. Former studies have shown for example that it was harder to detect named entities in SMS or microblog messages where words are abridged or changed to lowercase. In this article, we focus on old French texts to evaluate the impact of manual and automatic normalisation before applying five geographical named entity recognition tools, as well as an improved version of one of them, in order to help building maps displaying the locations mentioned in ancient texts. Our results show that manual normalisation leads to better results for all methods and that automatic normalisation performs differently depending on the tool used to extract geographical named entities, but with a significant improvement on most methods.
The proceedings contain 38 papers. The topics discussed include: specialized interactive methods for using data on radar application models;development of the information system for finding the best route for electric...
The proceedings contain 38 papers. The topics discussed include: specialized interactive methods for using data on radar application models;development of the information system for finding the best route for electric cars;modeling of 3- and 5-isogenies of supersingular Edwards curves;estimation method of information system functioning quality based on the fuzzy logic;techniques comparison for naturallanguageprocessing;selection of effective methods of big data analytical processing in information systems of smart cities;optimal strategy for the development of insurance business structures in a competitive environment;forecasting temperatures of a synchronous motor with permanent magnets using machine learning;development of intelligent information technology of computer processing of pedagogical tests open tasks based on machine learning approach;method for automatic analysis of compliance of expenses data and the enterprise income by neural network model of forecast;and analysis of the demand for bicycle use in a smart city based on machine learning.
Continuous integration in software development requires to run the tests on a regular basis to ensure that the code does not regress. So that the execution time of the regression test suite remains reasonable its size...
详细信息
The proceedings contain 55 papers. The topics discussed include: Vicomtech at ALexS 2020: unsupervised complex word identification based on domain frequency;general lexicon-based complex word identification extended w...
The proceedings contain 55 papers. The topics discussed include: Vicomtech at ALexS 2020: unsupervised complex word identification based on domain frequency;general lexicon-based complex word identification extended with stem N-grams and morphological engines;Hulat - ALexS CWI Task - CWI for language and learning disabilities applied to university educational text;overview of ALexS 2020: first workshop on Lexical Analysis at SEPLN;named entity recognition, concept normalization and clinical coding: overview of the cantemist track for cancer text mining in Spanish, corpus, guidelines, methods and results;extracting neoplasms morphology mentions in Spanish clinical cases through word embeddings;and NLNDE at CANTEMIST: neural sequence labeling and parsing approaches for clinical concept extraction.
The rising growth of fake news and misleading information through online media outlets demands an automatic method for detecting such news articles. Of the few limited works which differentiate between trusted vs othe...
详细信息
This paper describes our submission to the shared task1 on "Multi-hop Inference Explanation Regeneration" in Textgraphs workshop at EMNLP 2019 (Jansen and Ustalov, 2019). Our system identifies chains of fact...
详细信息
Relation extraction is an important task in naturallanguageprocessing (NLP). The existing methods generally pay more attention on extracting textual semantic information from text, but ignore the relation contextual...
详细信息
The proceedings contain 8 papers. The topics discussed include: dependency parsing with dilated iterated graph CNNs;entity identification as multitasking;towards neural machine translation with latent tree attention;s...
ISBN:
(纸本)9781945626937
The proceedings contain 8 papers. The topics discussed include: dependency parsing with dilated iterated graph CNNs;entity identification as multitasking;towards neural machine translation with latent tree attention;structured prediction via learning to search under bandit feedback;syntax aware LSTM model for semantic role labeling;spatial language understanding with multimodal graphs using declarative learning based programming;boosting information extraction systems with character-level neural networks and free noisy supervision;and piecewise latent variables for neural variational text processing.
In this paper, we explore strategies to evaluate models for the task research paper novelty detection: Given all papers released at a given date, which of the papers discuss new ideas and influence future research? We...
详细信息
暂无评论