Cross-language translating has been well solved with the help of the processing of naturallanguageprocessing(NLP). However, there are a few studies done about the domain of translating the naturallanguage to progra...
详细信息
Automatically extracting key information from scientific documents has the potential to help scientists work more efficiently and accelerate the pace of scientific progress. Prior work has considered extracting docume...
详细信息
ISBN:
(纸本)9781954085527
Automatically extracting key information from scientific documents has the potential to help scientists work more efficiently and accelerate the pace of scientific progress. Prior work has considered extracting document-level entity clusters and relations end-to-end from raw scientific text, which can improve literature search and help identify methods and materials for a given problem. Despite the importance of this task, most existing works on scientific information extraction (SciIE) consider extraction solely based on the content of an individual paper, without considering the paper's place in the broader literature. In contrast to prior work, we augment our text representations by leveraging a complementary source of document context: the citation graph of referential links between citing and cited papers. On a test set of English-language scientific documents, we show that simple ways of utilizing the structure and content of the citation graph can each lead to significant gains in different scientific information extraction tasks. When these tasks are combined, we observe a sizable improvement in end-to-end information extraction over the state-of-the-art, suggesting the potential for future work along this direction. We release software tools to facilitate citation-aware SciIE development.(1)
Shapley Values, a solution to the credit assignment problem in cooperative game theory, are a popular type of explanation in machine learning, having been used to explain the importance of features, embeddings, and ev...
详细信息
ISBN:
(纸本)9781954085534
Shapley Values, a solution to the credit assignment problem in cooperative game theory, are a popular type of explanation in machine learning, having been used to explain the importance of features, embeddings, and even neurons. In NLP, however, leave-oneout and attention-based explanations still predominate. Can we draw a connection between these different methods? We formally prove that - save for the degenerate case - attention weights and leave-one-out values cannot be Shapley Values. Attention flow is a post-processed variant of attention weights obtained by running the max-flow algorithm on the attention graph. Perhaps surprisingly, we prove that attention flows are indeed Shapley Values, at least at the layerwise level. Given the many desirable theoretical qualities of Shapley Values - which has driven their adoption among the ML community - we argue that NLP practitioners should, when possible, adopt attention flow explanations alongside more traditional ones.
Modern naturallanguageprocessing (NLP) makes intensive use of deep learning methods because of the accuracy they offer for a variety of applications. Due to the significant environmental impact of deep learning, cos...
详细信息
HCI and NLP traditionally focus on different evaluation methods. While HCI involves a small number of people directly and deeply, NLP traditionally relies on standardized benchmark evaluations that involve a larger nu...
详细信息
This paper presents a new Massive Open Online Course on naturallanguageprocessing, targeted at non-English speaking students. The course lasts 12 weeks;every week consists of lectures, practical sessions, and quiz a...
详细信息
In this paper we present a graph-based approach to question answering. The method assumes a graph representation of question sentences and text sentences. Question answering rules are automatically learnt from a train...
详细信息
This paper presents the methods submitted by the CIMAT-NLP-GTO team for participation in the MentalRiskES 2023 shared tasks, which focus on the early detection of eating disorders, depression, and anxiety. Our approac...
详细信息
The various potential of children can be limited by language delay or language impairments. However, there are many instances where parents are unaware of the child's condition and do not obtain appropriate treatm...
详细信息
Extracting information from documents usually relies on naturallanguageprocessingmethods working on one-dimensional sequences of text. In some cases, for example, for the extraction of key information from semi-str...
详细信息
暂无评论