Story generation stands as a crucial, yet formidable task, necessitating a profound grasp of subtle, often unspoken knowledge, along with context-specific cues to craft compelling narratives. The core challenges invol...
详细信息
Information Extraction (IE) aims to extract structural knowledge (e.g., entities, relations, events) from naturallanguage texts. Recently, Large language Models (LLMs) with code-style prompts have demonstrated powerf...
详细信息
ISBN:
(纸本)9789819794331;9789819794348
Information Extraction (IE) aims to extract structural knowledge (e.g., entities, relations, events) from naturallanguage texts. Recently, Large language Models (LLMs) with code-style prompts have demonstrated powerful capabilities in IE tasks. However, adopting code LLMs to conduct IE tasks still has two challenges: (1) It still lacks a unified code-style prompt for different IE tasks since existing methods use task-specific prompts for separate IE tasks. (2) It still lacks an effective in-context learning (ICL) method to encourage LLMs to conduct IE tasks precisely, considering some powerful LLMs are close-sourced and not trainable. Therefore, this paper proposes a code generation framework for Universal IE (UIE) tasks called Code4UIE. Specifically, for the first challenge, Code4UIE designs a unified code-style schema for various IE tasks via Python classes. By so doing, different IE tasks can be associated, and LLMs can learn from various IE tasks effectively. For the second challenge, Code4UIE adopts a retrieval-augmented mechanism to comprehensively utilize the ICL ability of LLMs. Extensive experiments on five representative IE tasks across nine datasets demonstrate the effectiveness of the Code4UIE framework.
Automatically generating scientific literature surveys is a valuable task that can significantly enhance research efficiency. However, the diverse and complex nature of information within a literature survey poses sub...
详细信息
ISBN:
(纸本)9789819794423;9789819794430
Automatically generating scientific literature surveys is a valuable task that can significantly enhance research efficiency. However, the diverse and complex nature of information within a literature survey poses substantial challenges for generative models. In this paper, we design a series of prompts to systematically leverage large language models (LLMs), enabling the creation of comprehensive literature surveys through a step-by-step approach. Specifically, we design prompts to guide LLMs to sequentially generate the title, abstract, hierarchical headings, and the main content of the literature survey. We argue that this design enables the generation of the headings from a high-level perspective. During the content generation process, this design effectively harnesses relevant information while minimizing costs by restricting the length of both input and output content in LLM queries. Our implementation with Qwen-long achieved third place in the NLPCC 2024 Scientific Literature Survey Generation evaluation task, with an overall score only 0.03% lower than the second-place team. Additionally, our soft heading recall is 95.84%, the second best among the submissions. Thanks to the efficient prompt design and the low cost of the Qwen-long API, our method reduces the expense for generating each literature survey to 0.1 RMB, enhancing the practical value of our method.
The healthcare industry is undergoing a fundamental revolution with the integration of Quantum naturallanguageprocessing techniques for matching patient-clinical trials. This research envisions a paradigm where pati...
详细信息
Open-ended referring expression comprehension focuses on locating the text query within an image via scene knowledge, requiring complex reasoning across the triplet of the image, scene knowledge, and the text query. H...
详细信息
Large language Models (LLMs) are pivotal in advancing naturallanguageprocessing but often struggle with complex reasoning tasks due to inefficient attention distributions. In this paper, we explore the effect of inc...
详细信息
Recent advancements in artificial intelligence (AI), particularly in naturallanguageprocessing (NLP), have led to the development of sophisticated chatbots like ChatGPT. These chatbots excel in answering questions a...
详细信息
In the ever-digitalized environment, entrepreneurs encounter challenges to engage their customers while managing business operations. The study investigates the development of a chatbot specifically for entrepreneurs ...
详细信息
Santhali is one of the popular local languages and is mainly spoken by tribal people of States like Jharkhand, Odisha, and West Bengal, as well as a few other parts of the country. Tokenization is one of the major cha...
详细信息
Zero-shot anomaly detection (ZSAD) identifies anomalies without needing training samples from the target dataset, essential for scenarios with privacy concerns or limited data. Vision-language models like CLIP show po...
详细信息
暂无评论