In naturallanguageprocessing (NLP) tasks, detecting out-of-distribution (OOD) samples is essential to safely deploy a language model in real-world problems. Recently, several studies report that pre-trained language...
In naturallanguageprocessing (NLP) tasks, detecting out-of-distribution (OOD) samples is essential to safely deploy a language model in real-world problems. Recently, several studies report that pre-trained language models (PLMs) accurately detect OOD data compared to LSTM, but we empirically find that PLMs show sub-par OOD detection performance when (1) OOD samples have similar semantic representation to in-distribution (IND) samples and (2) PLMs are finetuned under data scarcity settings. To alleviate above issues, state-of-the-art uncertainty quantification (UQ) methods can be used, but the comprehensive analysis of UQ methods with PLMs has received little consideration. In this work, we investigate seven UQ methods with PLMs and show their effectiveness in the text classification task.
In the intricate problem of understanding long-form multi-modal inputs, few key-aspects in scene-understanding and dialogue-and-discourse are often overlooked. In this paper, we investigate two such key-aspects for be...
详细信息
ISBN:
(纸本)9781450392037
In the intricate problem of understanding long-form multi-modal inputs, few key-aspects in scene-understanding and dialogue-and-discourse are often overlooked. In this paper, we investigate two such key-aspects for better semantic and relational understanding (i). head-object-tracking in addition to usual face-tracking, and (ii). fusing scene-to-text representation with external common-sense knowledge-base for effective mapping to sub-tasks of interest. The usage of head-tracking especially helps with enriching sparse entity mapping to inter-entity conversation interactions. These methods are guided by naturallanguage supervision on visual models, and perform well for interaction and sentiment understanding tasks.
With the rapid development of globalization, traditional learning methods have significant limitations in speaking practice and struggle to meet the personalized needs of learners. Innovative educational tools are nee...
详细信息
ISBN:
(数字)9798350374919
ISBN:
(纸本)9798350374926
With the rapid development of globalization, traditional learning methods have significant limitations in speaking practice and struggle to meet the personalized needs of learners. Innovative educational tools are needed to enhance students' speaking skills. This paper proposes a design and optimization plan for an intelligent English interactive system based on speech recognition and naturallanguageprocessing technologies. It explores the adaptability of speech recognition technology under different accents, speech rates, and noise environments. By optimizing deep learning models, the accuracy of recognition is further improved. Meanwhile, by integrating naturallanguageprocessing technology, the system understands user intentions and generates natural responses, enhancing the fluency of human-computer dialogue. In the system design, a modular architecture is adopted to facilitate later optimization, implementing core functional modules for speech recognition, naturallanguage understanding, dialogue management, naturallanguage generation, and speech synthesis. Experimental validation shows that the optimized system significantly improves recognition accuracy in multiple scenarios, performing particularly well in processing movie dialogues and everyday conversations. The research results indicate that the system designed in this paper not only enhances the English learning experience but also provides an effective technical path for the development of intelligent interactive systems.
Social media, especially microblogs, have potentials to develop unavoidable factors in investment decision-making, because of its use for capturing human sentiment. In this paper, by applying signaling theory and Natu...
详细信息
In this paper, we aim to explore text data analysis with the help of different methods to vectorize the data and carry out regression methods. At present, we have a lot of text categorization techniques, but very few ...
详细信息
ISBN:
(数字)9783031280733
ISBN:
(纸本)9783031280726;9783031280733
In this paper, we aim to explore text data analysis with the help of different methods to vectorize the data and carry out regression methods. At present, we have a lot of text categorization techniques, but very few algorithms are dedicated to text regression. But many regression models predict only a single estimated value. However, it's more efficient to predict a numerical range than a single estimate, since we can be more sure that the true value will be within the range rather than considering the estimated value as the true value. We aim to combine both these techniques in this paper. The goal of this paper is to collect text data, clean the data, and create a text regression model to find the best-suited algorithm that could be used in any situation, through the use of quantile regression.
The explosion of IoT usage provides efficiency and convenience in various fields including daily life, business and information technology. However, there are potential risks in large-scale IoT systems and vulnerabili...
详细信息
With the increasing availability of high-dimensional data in diverse fields as well as the need to process this data, the accurate measurement of similarity plays a crucial role in many applications. Specifically, the...
详细信息
naturallanguageprocessing has opened new paths for business process management and requirements engineering, particularly in automating the extraction and formalization of temporal requirements from diverse document...
详细信息
As a prominent parameter-efficient fine-tuning technique in NLP, prompt tuning is being explored its potential in computer vision. Typical methods for visual prompt tuning follow the sequential modeling paradigm stemm...
详细信息
ISBN:
(纸本)1577358872
As a prominent parameter-efficient fine-tuning technique in NLP, prompt tuning is being explored its potential in computer vision. Typical methods for visual prompt tuning follow the sequential modeling paradigm stemming from NLP, which represents an input image as a flattened sequence of token embeddings and then learns a set of unordered parameterized tokens prefixed to the sequence representation as the visual prompts for task adaptation of large vision models. While such sequential modeling paradigm of visual prompt has shown great promise, there are two potential limitations. First, the learned visual prompts cannot model the underlying spatial relations in the input image, which is crucial for image encoding. Second, since all prompt tokens play the same role of prompting for all image tokens without distinction, it lacks the fine-grained prompting capability, i. e., individual prompting for different image tokens. In this work, we propose the Spatially Aligned-and-Adapted Visual Prompt model (SA2VP), which learns a two-dimensional prompt token map with equal (or scaled) size to the image token map, thereby being able to spatially align with the image map. Each prompt token is designated to prompt knowledge only for the spatially corresponding image tokens. As a result, our model can conduct individual prompting for different image tokens in a fine-grained manner. Moreover, benefiting from the capability of preserving the spatial structure by the learned prompt token map, our SA2VP is able to model the spatial relations in the input image, leading to more effective prompting. Extensive experiments on three challenging benchmarks for image classification demonstrate the superiority of our model over other state-of-the-art methods for visual prompt tuning. Code is available at https://***/tommy-xq/SA2VP.
In today's day and age, where huge quantities of textual data are generated every second, it has become difficult to keep ourselves abreast with new information. Documents in the financial sector tell a quantitati...
详细信息
暂无评论