Association as a gift enables people do not have to mention something in completely straightforward words and allows others to understand what they intend to refer to. In this paper, we propose a chain association-bas...
详细信息
One of the frequently used approach for multi-hop question answering (MHQA) is that of graph neural network. In graph neural network based MHQA, learning is primarily based on the graph and connections between nodes o...
详细信息
ISBN:
(数字)9798350349115
ISBN:
(纸本)9798350349122
One of the frequently used approach for multi-hop question answering (MHQA) is that of graph neural network. In graph neural network based MHQA, learning is primarily based on the graph and connections between nodes of the graph. Creating a meaningful graph from text, however, is a challenging task, and current methods involve extensive data manipulation. In this paper, we propose a novel technique for constructing the mentioned graph. To do so, we first consider paragraphs, sentences, and entities as nodes of the graph. Furthermore, we assume nodes of our graph to be sets of words. Next, in order to find if any given two nodes are connected, we take their intersection, and filter it based on parts of speech types of its elements. If the intersection of the given two nodes is still none-empty after being filtered, an edge is added between them. Our observations show that graphs constructed this way are rich in quantity and variety of connections between nodes, and will result in performance at least equivalent to that of graphs constructed using more complex processes.
Recent work on aspect-level sentiment classification has employed graph Convolutional Networks (GCN) over dependency trees to learn interactions between aspect terms and opinion words. In some cases, the corresponding...
详细信息
The first international workshop on Programming languageprocessing presents interdisciplinary contributions that address programming language procession problems with machine learning and data mining techniques. Rece...
详细信息
ISBN:
(纸本)9781450383325
The first international workshop on Programming languageprocessing presents interdisciplinary contributions that address programming language procession problems with machine learning and data mining techniques. Recently, there are lots of successful naturallanguageprocessingmethods. But the mining of programming languages could not exactly follow the manner of naturallanguageprocessing. The difference between naturallanguage and programming language brings in new research challenges and opportunities. The workshop will bring together researchers from machine learning, data mining and software engineering to discuss and debate the path forward for mining the value of programming languages.
Semantic representation that supports the choice of an appropriate connective between pairs of clauses inherently addresses discourse coherence, which is important for tasks such as narrative understanding, argumentat...
详细信息
Most of the existing Knowledge-based Question Answering (KBQA) methodsfirst learn to map the given question to a query graph, and then convert the graph to an executable query to find the answer. The query graph is t...
详细信息
Classical tasks of a librarian, such as screening and categorizing new documents based on their content, are increasingly replaced by search engines or through the use of cataloging software. A first overview of a cor...
详细信息
Joint medical relation extraction refers to extracting triples, composed of entities and relations, from the medical text with a single model. One of the solutions is to convert this task into a sequential tagging tas...
详细信息
ISBN:
(纸本)9781665488679
Joint medical relation extraction refers to extracting triples, composed of entities and relations, from the medical text with a single model. One of the solutions is to convert this task into a sequential tagging task. However, in the existing works, the methods of representing and tagging the triples in a linear way failed to the overlapping triples, and the methods of organizing the triples as a graph faced the challenge of large computational effort. In this paper, inspired by the tree-like relation structures in the medical text, we propose a novel scheme called Bidirectional Tree Tagging (BiTT) to form the medical relation triples into two binary trees and convert the trees into a word-level tags sequence. based on BiTT scheme, we develop a joint relation extraction model to predict the BiTT tags and further extract medical triples efficiently. Our model outperforms the best baselines by 2.0% and 2.5% in F1 score on two medical datasets. What's more, the models with our BiTT scheme also obtain promising results in three public datasets of other domains.
Few-shot knowledge graph completion (FKGC) aims to infer unknown fact triples of a relation using its few-shot reference entity pairs. Recent FKGC studies focus on learning semantic representations of entity pairs by ...
详细信息
In the context of the COVID-19 pandemic, social media platforms such as Twitter have been of great importance for users to exchange news, ideas, and perceptions. Researchers from fields such as discourse analysis and ...
详细信息
In the context of the COVID-19 pandemic, social media platforms such as Twitter have been of great importance for users to exchange news, ideas, and perceptions. Researchers from fields such as discourse analysis and the social sciences have resorted to this content to explore public opinion and stance on this topic, and they have tried to gather information through the compilation of large-scale corpora. However, the size of such corpora is both an advantage and a drawback, as simple text retrieval techniques and tools may prove to be impractical or altogether incapable of handling such masses of data. This study provides methodological and practical cues on how to manage the contents of a large-scale social media corpus such as Chen et al. (JMIR Public Health Surveill 6(2):e19273, 2020) COVID-19 corpus. We compare and evaluate, in terms of efficiency and efficacy, available methods to handle such a large corpus. first, we compare different sample sizes to assess whether it is possible to achieve similar results despite the size difference and evaluate sampling methods following a specific data management approach to storing the original corpus. Second, we examine two keyword extraction methodologies commonly used to obtain a compact representation of the main subject and topics of a text: the traditional method used in corpus linguistics, which compares word frequencies using a reference corpus, and graph-based techniques as developed in naturallanguageprocessing tasks. The methods and strategies discussed in this study enable valuable quantitative and qualitative analyses of an otherwise intractable mass of social media data.
暂无评论