检索结果-内蒙古大学图书馆

Affective knowledge assisted bi-directional learning for multi-modal aspect-based sentiment analysis

COMPUTER SPEECH AND LANGUAGE 2025年 91卷

作者： Shi, Xuefeng Yang, Ming Hu, Min Ren, Fuji Kang, Xin Ding, Weiping Nantong Univ Sch Artificial Intelligence & Comp Sci Nantong 226019 Jiangsu Peoples R China Hefei Univ Technol Sch Comp & Informat Danxia Rd Hefei 230601 Anhui Peoples R China Univ Elect Sci & Technol China Sch Comp Sci & Engn Xiyuan Rd Chengdu 611731 Sichuan Peoples R China Tokushima Univ Dept Comp Sci Shinkuracho 2-24 Tokushima 7708506 Japan

As a fine-grained task in the community of multi-modal sentiment analysis (MSA), multi- modal aspect-based sentiment analysis (MABSA) is challenging and has attracted numerous researchers' attention, and prominent progress has been achieved in recent years. However, there is still a lack of effective strategies for feature alignment between different modalities, and further exploration is urgently needed. Thus, this paper proposed a novel MABSA method to enhance the sentiment feature alignment, namely Affective Knowledge-Assisted Bi-directional Learning (AKABL) networks, which learn sentiment information from different modalities through multiple perspectives. Specifically, AKABL gains the textual semantic and syntactic features through encoding text modality via pre-trained language model BERT and Syntax Parser SpaCy, respectively. And then, to strengthen the expression of sentiment information in the syntactic graph, affective knowledge SenticNet is introduced to assist AKABL in comprehending textual sentiment information. On the other side, to leverage image modality efficiently, the pre-trained model Visual Transformer (ViT) is employed to extract the necessary image features. Additionally, to integrate the obtained features, this paper utilizes the module Single modality GCN (SMGCN) to achieve the joint textual semantic and syntactic representation. And to bridge the textual and image features, the module Double modalities GCN (DMGCN) is devised and applied to extract the sentiment information from different modalities simultaneously. Besides, to bridge the alignment gap between text and image features, this paper devises a novel alignment strategy to build the relationship between these two representations, which measures that difference with the Jensen-Shannon divergence from bi-directional perspectives. It is worth noting that cross-attention and cosine distance-based similarity are also applied in the proposed AKABL. To validate the effectiveness of the pro

关键词： Affective knowledge Bi-directional learning Cross-attention Cosine similarity multi-modal aspect-based sentiment analysis

来源：评论

学校读者我要写书评

暂无评论

Dual-layer contrastive learning for aspect-aligned multimodal sentiment analysis

引用

APPLIED INTELLIGENCE 2025年第7期55卷 1-18页

作者： Guo, Junjun Yan, Zida Zhang, Guanghua Kunming Univ Sci & Technol Fac Informat Engn & Automation Kunming 650500 Yunnan Peoples R China Yunnan Key Lab Artificial Intelligence Kunming 650500 Yunnan Peoples R China Xi An Jiao Tong Univ Fac Elect & Informat Engn Sch Automat Sci & Engn Minist Educ Key Lab Intelligent Networks Xian 710049 Shanxi Peoples R China

multi-modal aspect-based sentiment analysis (MABSA) aims to identify the sentiment polarity of aspects by incorporating visual information into text. Image and text are two types of modality information with significant modality gaps in both data form and semantic expression. Narrowing the modality gaps and feature fusion are two crucial challenges in MABSA. To address these issues, this paper introduces an aspect-enhanced alignment and fusion strategy with dual-layer contrastive learning to tackle the cross-modal fusion problem. Unlike traditional contrastive learning methods, our approach increases the number of negative samples, enabling the model to learn more discriminative features and better capture fine-grained cross-modal relationships. The proposed approach leverages overlapping aspect information as multi-modal pivots to first bridge the modality gaps and then integrate visual and text information in the multi-modal feature space, thereby improving multi-modal sentiment analysis performance. We first introduce an aspect-guided modality alignment strategy that narrows the fundamental modality gaps between image and text using modality contrastive learning. Then, we design an aspect-oriented multi-modal fusion approach to promote cross-modal feature fusion through symmetric cross-modal interaction. Extensive experiments demonstrate that the proposed approach outperforms other state-of-the-art (SOTA) MABSA methods on three MABSA benchmark datasets. In-depth analysis further validates the effectiveness of the proposed multi-modal fusion approach for MABSA.

关键词： multi-modal aspect-based sentiment analysis Align-then-Fusion Transformer Feature fusion

来源：评论

学校读者我要写书评

暂无评论

Local Feature Alignment Prompt-Tuning for Few-shot multimodal aspect sentiment analysis

Local Feature Alignment Prompt-Tuning for Few-shot Multimoda...

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Ding, Meirong Zou, Chuang Lin, Hongyi School of Artificial Intelligence South China Normal University Guangzhou China

ISBN: (纸本)9798350368741

multi-modal aspect-oriented sentiment classification (MASC) is a fine-grain task, which aims to detect the sentiment polarity of specific aspect. However, conventional studies suffer from two issues. It is difficult to collect the annotated multi-modal data in fine-grained domains. Meanwhile, the local information corresponding to aspect words in the image has not been mined, and redundant information can affect the accuracy of fine-grained sentiment analysis. To alleviate the above two issues, we propose a Prompt-tuning method based on Alignment between aspect and Local Images (PAALI). Our approach introduces a novel multi-modal prompt template to bridge the gap between text and visual data modalities. Furthermore, we employ a strategy of randomly masking image patches to align them with aspect word features, capturing deeper semantic information. Extensive experiments on multiple benchmark datasets in few-shot settings consistently demonstrate the superiority and robustness of our PAALI method over state-of-the-art competitors. © 2025 IEEE.

关键词： Few-shot learning multi-modal aspect-based sentiment analysis Prompt tuning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：