We live in a world of information, and with the ever-increasing rate of content growth, we have no choice but to use machine-based solutions to manage, classify, and use it. However, the produced content is often unst...
详细信息
With the emergence of massive news every day leading to information overload, it is difficult for people to select the content they are really interested in from numerous news articles. The languageknowledge and stro...
详细信息
In recent years, the swift progress in information retrieval technologies has positioned text classification as a key area of research. Classifying short texts represents a major challenge within naturallanguage proc...
详细信息
Social media anxiety detection refers to the approach of recognizing various signs of anxiety in a person based on their social media engagements. A person who experiences anxiety may express negative emotional respon...
详细信息
Paraphrasing, the art of rephrasing text while retaining its original meaning, lies at the core of naturallanguage understanding and generation. With the rise of demand for more domain-specialized models, high-qualit...
详细信息
Paraphrasing, the art of rephrasing text while retaining its original meaning, lies at the core of naturallanguage understanding and generation. With the rise of demand for more domain-specialized models, high-quality data is more valued than ever;this includes paraphrasing. ParaFusion-Extend (PFE) is a large-scale dataset driven by Large language Models incorporating lexical and phrasal knowledge. The dataset is curated to contain high-quality diverse paraphrase pairs and also separate knowledge bases that could be used for research work and data augmentation models. We show that PFE offers around at least a 30% increase in syntactic and lexical diversity compared to the original data sources that are commonly used. We demonstrate the effectiveness of PFE on several downstream tasks such as few-shot learning and training on sentence embeddings. We utilize a gold-standard evaluation scheme, which is further strengthened by human evaluation that shows the potential of PFE in advancing paraphrase generation.
Although large language models (LLMs) have achieved significant success in various tasks, they often struggle with hallucination issues in scenarios requiring deep reasoning. Incorporating external knowledge into LLM ...
详细信息
In this paper, we propose a fake news detection model that generates naturallanguage explanations and leverages these explanations for prediction. Deep learning models, particularly Large language Models like BERT, a...
详细信息
Answering complex questions over textual resources remains a challenge, particularly when dealing with nuanced relationships between multiple entities expressed within natural-language sentences. To this end, curated ...
详细信息
ISBN:
(纸本)9783031522642;9783031522659
Answering complex questions over textual resources remains a challenge, particularly when dealing with nuanced relationships between multiple entities expressed within natural-language sentences. To this end, curated knowledge bases (KBs) like YAGO, DBpedia, Freebase, and Wikidata have been widely used and gained great acceptance for question-answering (QA) applications in the past decade. While these KBs offer a structured knowledge representation, they lack the contextual diversity found in natural-language sources. To address this limitation, BigText-QA introduces an integrated QA approach, which is able to answer questions based on a more redundant form of a knowledge graph (KG) that organizes both structured and unstructured (i.e., "hybrid") knowledge in a unified graphical representation. Thereby, BigText-QA is able to combine the best of both worlds-a canonical set of named entities, mapped to a structured background KB (such as YAGO or Wikidata), as well as an open set of textual clauses providing highly diversified relational paraphrases with rich context information. Our experimental results demonstrate that BigText-QA outperforms DrQA, a neural-network-based QA system, and achieves competitive results to QUEST, a graph-based unsupervised QA system.
Mobile app repositories serve as large-scale crowdsourced information systems used for various document-based software engineering tasks, leveraging product descriptions, user reviews, and other naturallanguage docum...
详细信息
ISBN:
(纸本)9783031609992;9783031610004
Mobile app repositories serve as large-scale crowdsourced information systems used for various document-based software engineering tasks, leveraging product descriptions, user reviews, and other naturallanguage documents. Particularly, feature extraction (i.e., identifying functionalities or capabilities of a mobile app mentioned in these documents) is key for product recommendation, topic modelling, and feedback analysis. However, researchers often face domain-specific challenges in mining these repositories, including the integration of heterogeneous data sources, large-scale data collection, normalization and ground-truth generation for feature-oriented tasks. In this paper, we introduce MAppKG, a combination of software resources and data artefacts in the field of mobile app repositories aimed at supporting feature-oriented knowledge generation tasks. Our contribution provides a framework for automatically constructing a knowledge graph that models a domain-specific catalog of naturallanguage documents related to mobile applications. We distribute MApp-KG through a public triplestore, enabling its immediate use for future research and replication of our findings.
Having difficulties like being visually impaired, hard of listening to, dumb are a more amount of difficulty. Technology and innovation have motivated humans to grow to be depending on solace but there exists an under...
详细信息
暂无评论