Over the Internet, an efficient approach and promising solution to retrieve significant information envisages the beginning of Question Answering Systems (QAS). Because of data sources availability, the deep learning ...
详细信息
This study proposes an efficient model training framework aimed at improving the performance of hierarchical classification of Chinese text in low-computation environments. By leveraging the generative capabilities of...
详细信息
ISBN:
(纸本)9798331528911;9798331528928
This study proposes an efficient model training framework aimed at improving the performance of hierarchical classification of Chinese text in low-computation environments. By leveraging the generative capabilities of large language models for data annotation and augmentation, a high-quality dataset is constructed. knowledge distillation techniques are applied to transfer knowledge from a complex teacher model to a lightweight student model, effectively reducing computational resources. Additionally, a reinforcement learning strategy is introduced to further optimize the student model, enabling it to surpass the teacher model in both accuracy and inference efficiency, providing a novel solution for naturallanguageprocessing and other resource-constrained tasks.
Traditional manufacturing faces challenges adapting to dynamic environments and quickly responding to manufacturing changes. The use of multi-agent systems has improved adaptability and coordination but requires furth...
详细信息
ISBN:
(纸本)9798350358513;9798350358520
Traditional manufacturing faces challenges adapting to dynamic environments and quickly responding to manufacturing changes. The use of multi-agent systems has improved adaptability and coordination but requires further advancements in rapid human instruction comprehension, operational adaptability, and coordination through naturallanguage integration. Large language models like GPT-3.5 and GPT-4 enhance multi-agent manufacturing systems by enabling agents to communicate in naturallanguage and interpret human instructions for decision-making. This research introduces a novel framework where large language models enhance the capabilities of agents in manufacturing, making them more adaptable, and capable of processing context-specific instructions. A case study demonstrates the practical application of this framework, showing how agents can effectively communicate, understand tasks, and execute manufacturing processes, including precise G-code allocation among agents. The findings highlight the importance of continuous large language model integration into multi-agent manufacturing systems and the development of sophisticated agent communication protocols for a more flexible manufacturing system.
knowledge graphs (KGs) are routinely curated to provide factual data for various domain-specific analyses. Nevertheless, it remains nontrivial to explore domain knowledge with standard query languages. We demonstrate ...
详细信息
By integrating recent advances in large language models (LLMs) and generative models into the emerging semantic communication (SC) paradigm, in this article we put forward to a novel framework of language-oriented sem...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
By integrating recent advances in large language models (LLMs) and generative models into the emerging semantic communication (SC) paradigm, in this article we put forward to a novel framework of language-oriented semantic communication (LSC). In LSC, machines communicate using human language messages that can be interpreted and manipulated via naturallanguageprocessing (NLP) techniques for SC efficiency. To demonstrate LSC's potential, we introduce three innovative algorithms: 1) semantic source coding (SSC) which compresses a text prompt into its key head words capturing the prompt's syntactic essence while maintaining their appearance order to keep the prompt's context;2) semantic channel coding (SCC) that improves robustness against errors by substituting head words with their lenghthier synonyms;and 3) semantic knowledge distillation (SKD) that produces listener-customized prompts via in-context learning the listener's language style. In a communication task for progressive text-to-image generation, the proposed methods achieve higher perceptual similarities with fewer transmissions while enhancing robustness in noisy communication channels.
Unsupervised question answering is a promising yet challenging task, which alleviates the burden of building large-scale annotated data in a new domain. It motivates us to study the unsupervised multiple-choice questi...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Unsupervised question answering is a promising yet challenging task, which alleviates the burden of building large-scale annotated data in a new domain. It motivates us to study the unsupervised multiple-choice question answering (MCQA) problem. In this paper, we propose a novel framework designed to generate synthetic MCQA data barely based on contexts from the universal domain without relying on any form of manual annotation. Possible answers are extracted and used to produce related questions, then we leverage both named entities (NE) and knowledge graphs to discover plausible distractors to form complete synthetic samples. Experiments on multiple MCQA datasets demonstrate the effectiveness of our method.
In modern software engineering, requirements analysis and modeling are key steps in requirements engineering, influencing the subsequent system design and implementation. Traditional requirements analysis and modeling...
详细信息
Automated analysis and specification of software requirements expressed in naturallanguage is a challenge addressed by the research community and is becoming a reality thanks to the advances in Artificial Intelligenc...
详细信息
ISBN:
(纸本)9798331528690;9798331528706
Automated analysis and specification of software requirements expressed in naturallanguage is a challenge addressed by the research community and is becoming a reality thanks to the advances in Artificial Intelligence (AI) and naturallanguageprocessing (NLP) techniques. While the research community focuses mainly on generic software requirements or specialized solutions for security requirements, we find a gap in the automation of analysis and specification for requirements in the cloud computing domain and the automatic mapping of requirements on actual products offered in the cloud service market. In this research work, we propose AI-CRAS an AI-driven cloud service requirement analysis and specification methodology. The proposed method, which leverages state-of-the-art transformer-based large language model, has been implemented and validated in a real case. Experimental results demonstrate that the model performed well in binary and multilabel classification of requirements (achieving recall/F1-score of 0.96/0.92 and 0.86/0.76, respectively) and mapping requirements into actual cloud services.
Large language models (LLMs) have been observed to occasionally exhibit hallucination, a phenomenon where they generate statements unsupported by factual evidence, thereby compromising the trustworthiness of their out...
详细信息
The language prior problem in VQA makes the model directly predictis based on questions, causing the model's performance to drop sharply outside the distribution. Current debiased methods often achieve good out-of...
详细信息
ISBN:
(纸本)9798350390155;9798350390162
The language prior problem in VQA makes the model directly predictis based on questions, causing the model's performance to drop sharply outside the distribution. Current debiased methods often achieve good out-of-distribution generalization capabilities at the expense of in-distribution performance degradation. We propose a novel method combining Robust knowledge distillation and self-contrast Reasoning (RR-VQA) to solve the language prior problem in VQA. We propose the QAS module to select reasonable questions for images, perform knowledge distillation through the ID and OOD teacher models, and obtain pseudo answers after passing through the QAS module. The CRSG module we propose synthesizes four visual language positive and negative samples for contrastive reasoning, which effectively increases the semantic dependency of the images while avoiding performance degradation due to spurious correlations between questions and answers. Our method is model-agnostic and achieves state-of-the-art performance on VQA-CP v2 dataset while maintaining performance on VQA v2 dataset.
暂无评论