Text summarization has been rapidly developed as an important task in the field of naturallanguage text generation. Among them, due to the practical needs of Tibetan text summarization, some people have also begun to...
详细信息
Stack Overflow (SO) is a widely used question-andanswer (Q&A) forum dedicated to software development. It plays a supplementary role to official documentation (DOC for short) by offering practical examples and res...
详细信息
ISBN:
(纸本)9798350327830
Stack Overflow (SO) is a widely used question-andanswer (Q&A) forum dedicated to software development. It plays a supplementary role to official documentation (DOC for short) by offering practical examples and resolving uncertainties. However, the process of simultaneously consulting both the documentation and SO posts can be challenging and time-consuming due to their disconnected nature. In this study, we propose DOSA, a novel approach to automatically align SO and DOC, which inject domain-specific knowledge about the DOC structure into large language models (LLMs) through weak supervision and constrained decoding, thereby enhancing knowledge retrieval and streamlining task completion during the software development procedure. Our preliminary experiments find that DOSA outperforms various widely-used baselines, showing the promise of using generative retrieval models to perform low-resource software engineering tasks.
It might be intimidating to navigate the wide array of learning resources in this era of information overload. This article presents knowledge Navigator, an artificial intelligence (AI) system that adapts the learning...
详细信息
The invention of artificial intelligence and naturallanguageprocessing has revolutionised human-machine interaction, and OpenAI's ChatGPT models are at the forefront of this. GPT-3 and GPT-4 models generate huma...
详细信息
The implementation of naturallanguageprocessing (NLP) in chatbots has significantly enhanced the efficiency and user experience of academic information systems. This paper presents a case study of the international ...
详细信息
This research paper explores how advancements in Artificial Intelligence, particularly naturallanguageprocessing (NLP), are impacting healthcare. NLP, which can analyse and understand human language, is being integr...
详细信息
The overwhelming volume of scientific documents necessitates automatic summarization to assist researchers in efficiently finding relevant data, making informed decisions, and retrieving appropriate answers to their q...
详细信息
Recently, there has been a growing interest in the field of controllable Text-to-Speech (TTS). While previous studies have relied on users providing specific style factor values based on acoustic knowledge or selectin...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Recently, there has been a growing interest in the field of controllable Text-to-Speech (TTS). While previous studies have relied on users providing specific style factor values based on acoustic knowledge or selecting reference speeches that meet certain requirements, generating speech solely from natural text prompts has emerged as a new challenge for researchers. This challenge arises due to the scarcity of high-quality speech datasets with natural text style prompt and the absence of advanced text-controllable TTS models. In light of this, 1) we propose TextrolSpeech, which is the first large-scale speech emotion dataset annotated with rich text attributes. The dataset comprises 236,203 pairs of style prompt in natural text descriptions with five style factors and corresponding speech samples. Through iterative experimentation, we introduce a multi-stage prompt programming approach that effectively utilizes the GPT model for generating natural style descriptions in large volumes. 2) Furthermore, to address the need for generating audio with greater style diversity, we propose an efficient architecture called Salle. This architecture treats text controllable TTS as a language model task, utilizing audio codec codes as an intermediate representation to replace the conventional mel-spectrogram. Finally, we successfully demonstrate the ability of the proposed model by showing a comparable performance in the controllable TTS task. Audio samples are available on the demo page https://***/.
The use of naturallanguageprocessing (NLP) has become an indispensable instrument for conducting sentiment analysis in the field of marketing. This technology enables organizations to decipher and analyze the though...
详细信息
With the advent of large language models (LLMs), requirements engineers have gained a powerful naturallanguageprocessing tool to analyze, query, and validate a wide variety of textual artifacts, thus potentially sup...
详细信息
ISBN:
(纸本)9798350395129;9798350395112
With the advent of large language models (LLMs), requirements engineers have gained a powerful naturallanguageprocessing tool to analyze, query, and validate a wide variety of textual artifacts, thus potentially supporting the whole requirements engineering process from requirements elicitation to management. However, the input for the requirements engineering process often encompasses a variety of potential information sources in various formats, especially graphical models such as process models. Hence, this work aims to contribute to the state of the art by assessing the feasibility of utilizing graphical process models and their textual representations in the requirements engineering process. In particular, we focus on the extraction of textual process descriptions from process models as i) input for the requirements engineering process and ii) documentation as the result of process-oriented requirements engineering. To this end, we explore, quantify, and compare traditional deterministic and LLM-based extraction methods where the latter includes GPT3, GPT3.5, GPT4, and LLAMA. The evaluation assesses output quality and information loss based on one data set. The results indicate that LLMs produce human-like process descriptions based on the predefined patterns, but apparently lack true comprehension of the process models.
暂无评论