Currently, there is a significant gap in the conversion of mathematical theorems from naturallanguage to logical expressions, specifically in the form of first-order predicate logic. To address this issue, this paper...
详细信息
In the age of artificial intelligence, the role of large language models (LLMs) is becoming increasingly central. Despite their growing prevalence, their capacity to consolidate knowledge from different training docum...
详细信息
ISBN:
(纸本)9798891760608
In the age of artificial intelligence, the role of large language models (LLMs) is becoming increasingly central. Despite their growing prevalence, their capacity to consolidate knowledge from different training documents-a crucial ability in numerous applications-remains unexplored. This paper presents the first study examining the capability of LLMs to effectively combine such information within their parameter space. We introduce EpiK-Eval, a novel question-answering benchmark tailored to evaluate LLMs' proficiency in formulating a coherent and consistent knowledge representation from segmented narratives. Evaluations across various LLMs reveal significant weaknesses in this domain. We contend that these shortcomings stem from the intrinsic nature of prevailing training objectives. Consequently, we advocate for refining the approach towards knowledge consolidation, as it harbors the potential to dramatically improve their overall effectiveness and performance. The findings from this study offer insights for developing more robust and reliable LLMs. Our code and benchmark are available at https://***/chandar-lab/EpiK-Eval
Medical entity disambiguation (MED) plays a crucial role in naturallanguageprocessing and biomedical domains, which is the task of mapping ambiguous medical mentions to structured candidate medical entities from kno...
详细信息
The proceedings contain 135 papers. The topics discussed include: Bipol: multi-axes evaluation of bias with explainability in benchmark datasets;automatically generating Hindi Wikipedia pages using Wikidata as a knowl...
ISBN:
(纸本)9789544520922
The proceedings contain 135 papers. The topics discussed include: Bipol: multi-axes evaluation of bias with explainability in benchmark datasets;automatically generating Hindi Wikipedia pages using Wikidata as a knowledge graph: a domain-specific template sentences approach;cross-lingual classification of crisis-related tweets using machine translation;lexicon-driven automatic sentence generation for the skills section in a job posting;multilingual racial hate speech detection using transfer learning;exploring Amharic hate speech data collection and classification approaches;are you not moved? incorporating sensorimotor knowledge to improve metaphor detection;HAQA and QUQA: constructing two Arabic question-answering Corpora for the Quran and Hadith;a review in knowledge extraction from knowledge bases;evaluating of large language models in relationship extraction from unstructured data: empirical study from holocaust testimonies;and impact of emojis on automatic analysis of individual emotion categories.
The automotive sector has a notable upswing in the sales of pre-owned vehicles, hence necessitating the development of precise pricing models to enable knowledgeable decision-making for both purchasers and vendors. In...
详细信息
Modern Standard Arabic (MSA) serves as the official language across all Arab countries, employed in administrative, educational, official broadcast, and press settings. However, in everyday informal communication, eac...
详细信息
knowledge Graphs (KG) are emerging and becoming increasingly popular. Building a domain knowledge graph from a large amount of text is a challenging task which requires a tremendous amount of work, including entity re...
详细信息
Addressing the challenge of few-shot learning in intent classification tasks within naturallanguageprocessing (NLP), this study introduces a novel approach that harnesses the robust adaptation capabilities of Model-...
详细信息
ISBN:
(纸本)9798350376357;9798350376340
Addressing the challenge of few-shot learning in intent classification tasks within naturallanguageprocessing (NLP), this study introduces a novel approach that harnesses the robust adaptation capabilities of Model-Agnostic Meta-Learning (MAML) combined with sophisticated language embeddings, namely BERT, LaBSE, and text-embedding-ada-002. The need for models to understand and classify intents with minimal training data is imperative to progress in creating versatile, responsive AI systems. We propose a methodology that leverages the generalizability of MAML and the deeply contextualized representations offered by state-of-the-art embeddings, allowing for significant improvements in Accuracy and data efficiency. We evaluate our approach using the CLINC150 dataset across a series of N-way & K-shot configurations, demonstrating the efficacy of the proposed model with varying numbers of intent classes and examples. Our findings reveal that the text-embedding-ada-002 embeddings consistently provide superior performance in both 1-shot and 5-shot settings across all class configurations tested, indicating their potent synergy with metalearning strategies. Specifically, text-embedding-ada-002 achieved an accuracy of 97.07% in the 5-Way & 1-Shot setting and 99.1% in the 5-Way & 5-Shot setting. The outcomes of our experimental evaluation suggest that our approach also illuminates the potential of harmonious integration of cutting-edge language embeddings with meta-learning frameworks. This work provides a solid foundation for further exploration in optimizing fewshot intent classification, paving the way for creating AI systems proficient in understanding user intents with minimal exemplars. This research lays the groundwork for future advancements in few-shot intent classification, enabling the development of AI systems that require minimal training data to interpret user intent accurately.
This paper aims to apply knowledge graph construction techniques to textbooks, explicitly focusing on the challenge of the absence of domain-specific schema for each textbook. Various entity and relation extraction mo...
详细信息
As a crucial first step in the process of constructing knowledge graph, the accuracy of named entity recognition determines the construction effect of the final graph. However, at present, Chinese named entity recogni...
详细信息
ISBN:
(纸本)9798400709227
As a crucial first step in the process of constructing knowledge graph, the accuracy of named entity recognition determines the construction effect of the final graph. However, at present, Chinese named entity recognition methods still have many problems, such as long training time and lower accuracy. Hence, we come up with a BERT-BILSTM-ACRF entity recognition method that combines the "self-attention" mechanism. To begin with, Bert model is selected as the embedding layer, the text is vectorized, and the character position in-formation is obtained through the bidirectional Long Short-Term Memory network. Secondly, the internal relationship of the character sequence is further searched through the self-attention mechanism, and finally the final optimal sequence is decoded by the conditional random field model. To check the effectiveness of the BERT-BILSTM-ACRF model, the model is applied to the data set of the university course textbook" Da-ta Structure", and the result reaches 98.97%F1 value and 98.14%accuracy, which has good experimental results and certain practical value.
暂无评论