版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Guangxi Normal Univ Key Lab Educ Blockchain & Intelligent Technol Minist Educ Guilin 541004 Peoples R China Guangxi Normal Univ Guangxi Key Lab Multisource Informat Min & Secur Guilin 541004 Peoples R China
出 版 物:《MULTIMEDIA SYSTEMS》 (Multimedia Syst)
年 卷 期:2025年第31卷第1期
页 面:1-15页
核心收录:
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:National Natural Science Foundation of China [62276073, 61966004] Innovation Project of Guangxi Graduate Education [YCBZ2024115] Guangxi "Bagui Scholar" Teams for Innovation and Research Project Guangxi Collaborative Innovation Center of Multi-source Information Integration and Intelligent Processing
主 题:Multimodal sentiment analysis Multi-task learning Representation learning Contrastive learning Feature decomposition
摘 要:Multimodal sentiment analysis (MSA) aims to predict human emotions via language, visual and acoustic modalities. Based on the idea of extracting modalities into modality-invariant and modality-specific components, feature decomposition methods decompose features into similarity and dissimilarity features. However, dissimilarity features contribute less than similarity features due to emotional divergence. And they have not been effectively utilized due to insufficient methods for feature extraction and processing. To address these issues, we propose a dissimilarity feature-driven decomposition network (DFDDN) for MSA. We deploy a feature extraction module to extract and enhance features. This not only increases the differences between the features, but also enables us to focus more on the emotional information contained in the features. We utilize different encoders to decompose features, and design loss functions to increase the differences of modality features. Compared to state-of-the-art (SOTA) method on the CMU-MOSI dataset, there are improvements of 0.35%/1.57% and 0.28%/2.09% in Acc2 and macro F1, a 3.31% improvement in Acc7, and the MAE decreases by 0.1. Compared to SOTA method on the CMU-MOSEI dataset, there is an improvement of 1.34% in Acc7, the Corr increases by 0.016, and the MAE decreases by 0.005.