As the digital transformation of education continues to advance, the inefficiency and subjectivity of traditional manual scoring methods have become increasingly prominent. To address this issue, this study developed ...
详细信息
Even though large language models (LLMs) have demonstrated remarkable performance across various naturallanguageprocessing tasks, their application in speech-related tasks has largely remained underexplored. This wo...
详细信息
Currently, there is a significant gap in the conversion of mathematical theorems from naturallanguage to logical expressions, specifically in the form of first-order predicate logic. To address this issue, this paper...
详细信息
Although large language models and temporal knowledge graphs each have significant advantages in the field of artificial intelligence, they also face certain challenges. However, through collaboration, large language ...
详细信息
ISBN:
(纸本)9789819770069;9789819770076
Although large language models and temporal knowledge graphs each have significant advantages in the field of artificial intelligence, they also face certain challenges. However, through collaboration, large language models and temporal knowledge graphs can complement each other, addressing their respective shortcomings. This collaborative approach aims to harness the potential feasibility and practical effectiveness of large language models as external knowledge bases for temporal knowledge graph reasoning tasks. In our research, we have meticulously designed a synergized model that leverages the knowledge from the graph as prompts. The answers generated by the large language model undergo careful processing before being seamlessly incorporated into the training dataset. The ultimate goal is to significantly enhance the reasoning capabilities of temporal knowledge graphs. Experimental results underscore the positive impact of this synergized model on the completion tasks of temporal knowledge graphs, showcasing its potential to address gaps in knowledge and improve overall performance. While its influence on prediction tasks is relatively weak, the collaborative synergy demonstrates promising avenues for further exploration and development in the realm of AI research.
Intention detection and slot filling are two main tasks in the field of naturallanguage understanding in naturallanguageprocessing. Since the two tasks are highly correlated, the two tasks are often modeled jointly...
详细信息
People with Visual Impairments are vulnerable to various challenges in their everyday lives ranging from Access to Information, Travelling to places, Recognizing their surroundings, Isolated Lifestyle and much more. T...
详细信息
With the rapid growth of online content, there is an increasing demand for efficient tools that facilitate content understanding and knowledge retention. This research focuses on the development of an AI-powered syste...
详细信息
The automatic development of meaningful, detailed textual descriptions for supplied images is a difficult task in the fields of computer vision and naturallanguageprocessing. As a result, an AI-powered image caption...
详细信息
The automatic development of meaningful, detailed textual descriptions for supplied images is a difficult task in the fields of computer vision and naturallanguageprocessing. As a result, an AI-powered image caption generator can be incredibly useful for producing captions. In this study, we present a unique method for creating picture captions utilizing an attention mechanism that concentrates on pertinent areas of the image while it creates captions. On benchmark datasets, our model, which uses deep neural networks to extract picture attributes and produce captions, obtains state-of-the-art results, confirming the effectiveness of the attention mechanism in raising the caliber of the generated captions. We also offer a thorough evaluation of the performance of our approach and talk about potential future directions for enhancing image caption generation.
Visual Chinese Character Checking (C3) aims to detect and correct errors in handwritten Chinese text images, including faked characters and misspelled characters. This task is beneficial for subsequent tasks by improv...
详细信息
ISBN:
(纸本)9789819794423;9789819794430
Visual Chinese Character Checking (C3) aims to detect and correct errors in handwritten Chinese text images, including faked characters and misspelled characters. This task is beneficial for subsequent tasks by improving the efficiency of identifying errors in handwritten text. Recent methods are mainly based on Optical Character Recognition (OCR) and Pre-trained language Models (PLMs). Visual Chinese Character Checking is an emerging task, and relevant research has made progress. However, we believe that existing work has not fully leveraged the inherent knowledge of pre-trained models and has not addressed the semantic bias issue between pre-trained models and the character checking task. These challenges result in deficiencies in recognizing misspelled Chinese characters and correcting misused characters. Therefore, we propose various multimodal contrastive learning methods based on image-to-image and image-to-text comparisons. These methods are used throughout the processes of character recognition, error detection, and correction. By aligning the semantic feature representations among different models, our approach makes these models more suitable for the Visual Chinese Character Checking task, thereby enhancing their capabilities.
The following discusses a feasibility review being carried out for the use of naturallanguageprocessing in the context of capturing Model-to-Model transformations in visual models, in particular, the extraction of i...
详细信息
暂无评论