The surging popularity of online movie databases has created a challenge for viewers: choosing a film from a massive library can be overwhelming. In this paper, it proposes to design a new hybrid movie recommendation ...
详细信息
The basis of this project is to investigate whether the YOLO, an object detection algorithm where 'You Only Look Once' constitutes the name, could be applied to develop FMCG management;together with the manage...
详细信息
The goal of this project is to draw a deeper understanding of the subjective nature behind online product reviews, largely by examining a large dataset received from Amazon that contains numerous star ratings and comm...
详细信息
Innovative technology solutions have been developed in response to the growing need for effective and customized client contact on e-commerce platforms. This work introduces an intelligent chatbot system that uses mac...
详细信息
Poverty is considered a serious global issue that must be immediately eradicated by Sustainable Development Goals (SDGs) 1, namely ending poverty anywhere and in any form. As a developing country, poverty is a complex...
详细信息
作者:
Zhong, WenjieSun, TaoZhou, Jian-TaoWang, ZhuoweiSong, XiaoyuInner Mongolia University
College of Computer Science the Engineering Research Center of Ecological Big Data Ministry of Education the Inner Mongolia Engineering Laboratory for Cloud Computing and Service Software the Inner Mongolia Engineering Laboratory for Big Data Analysis Technology Hohhot010000 China Guangdong University of Technology
School of Computer Science and Technology Guangzhou510006 China Portland State University
Department of Electrical and Computer Engineering PortlandOR97207 United States
Colored Petri nets (CPNs) provide descriptions of the concurrent behaviors for software and hardware. Model checking based on CPNs is an effective method to simulate and verify the concurrent behavior in system design...
详细信息
Neural architecture search (NAS) has received increasing attention because of its exceptional merits in automating the design of deep neural network (DNN) architectures. However, the performance evaluation process, as...
详细信息
The increasing popularity of Graph-based neural network architectures plays a pivotal role in providing promising results in applications, viz., Friendship networks, Co-authorship networks, Product recommendations, et...
详细信息
Image captioning,the task of generating descriptive sentences for images,has advanced significantly with the integration of semantic ***,traditional models still rely on static visual features that do not evolve with ...
详细信息
Image captioning,the task of generating descriptive sentences for images,has advanced significantly with the integration of semantic ***,traditional models still rely on static visual features that do not evolve with the changing linguistic context,which can hinder the ability to form meaningful connections between the image and the generated *** limitation often leads to captions that are less accurate or *** this paper,we propose a novel approach to enhance image captioning by introducing dynamic interactions where visual features continuously adapt to the evolving linguistic *** model strengthens the alignment between visual and linguistic elements,resulting in more coherent and contextually appropriate ***,we introduce two innovative modules:the Visual Weighting Module(VWM)and the Enhanced Features Attention Module(EFAM).The VWM adjusts visual features using partial attention,enabling dynamic reweighting of the visual inputs,while the EFAM further refines these features to improve their relevance to the generated *** continuously adjusting visual features in response to the linguistic context,our model bridges the gap between static visual features and dynamic language *** demonstrate the effectiveness of our approach through experiments on the MS-COCO dataset,where our method outperforms state-of-the-art techniques in terms of caption quality and contextual *** results show that dynamic visual-linguistic alignment significantly enhances image captioning performance.
Story video-text alignment, a core task in computational story understanding, aims to align video clips with corresponding sentences in their descriptions. However, progress on the task has been held back by the scarc...
详细信息
暂无评论