检索结果-内蒙古大学图书馆

From Extraction to Generation: Multimodal Emotion-Cause Pair Generation in Conversations

IEEE Transactions on Affective Computing 2024年第2期16卷 1-12页

作者： Ma, Heqing Yu, Jianfei Wang, Fanfan Cao, Hanyu Xia, Rui School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing Jiangsu China

As an important task in emotion analysis, Multimodal Emotion-Cause Pair Extraction in conversations (MECPE) aims to extract all the emotion-cause utterance pairs from a conversation. However, there are two shortcomings in the MECPE task: 1) it ignores emotion utterances whose causes cannot be located in the conversation but require contextualized inference;2) it fails to locate the exact causes that occur in vision or audio modalities beyond text. To address these issues, in this paper, we introduce a new task named Multimodal Emotion-Cause Pair Generation in Conversations (MECPG), which aims to identify the emotion utterances with their emotion categories and generate their corresponding causes in a conversation. To tackle the MECPG task, we construct a dataset based on a benchmark corpus for MECPE. We further propose a generative framework named MONICA, which jointly performs emotion recognition and emotion cause generation with a sequence-to-sequence model. Experiments on our annotated dataset show the superiority of MONICA over several competitive systems. Our dataset and source codes will be publicly released. IEEE

关键词： Emotion Recognition

来源：评论

学校读者我要写书评

暂无评论

Task offloading delay minimization in vehicular edge computing based on vehicle trajectory prediction

引用

Digital Communications and Networks 2025年第2期11卷 537-546页

作者： Feng Zeng Zheng Zhang Jinsong Wu School of Computer Science and Engineering Central South UniversityChangsha410083China School of Artificial Intelligence Guilin University of Electronic TechnologyGuilin541000China The Department of Computer Science Universidad de ChileSantiagoChile

In task offloading,the movement of vehicles causes the switching of connected RSUs and servers,which may lead to task offloading failure or high service *** this paper,we analyze the impact of vehicle movements on task offloading and reveal that data preparation time for task execution can be minimized via forward-looking ***,a Bi-LSTM-based model is proposed to predict the trajectories of *** service area is divided into several equal-sized *** the actual position of the vehicle and the predicted position by the model belong to the same grid,the prediction is considered correct,thereby reducing the difficulty of vehicle trajectory ***,we propose a scheduling strategy for delay optimization based on the vehicle trajectory *** the inevitable prediction error,we take some edge servers around the predicted area as candidate execution servers and the data required for task execution are backed up to these candidate servers,thereby reducing the impact of prediction deviations on task offloading and converting the modest increase of resource overheads into delay reduction in task *** results show that,compared with other classical schemes,the proposed strategy has lower average task offloading delays.

关键词： Vehicular edge computing Task offloading Vehicle trajectory prediction Delay minimization Bi-LSTM model

来源：评论

学校读者我要写书评

暂无评论

Towards a National CAV Certification Center

引用

IEEE Transactions on Intelligent Transportation Systems 2024年第4期25卷 29-53页

作者： Qiao, Chunming Sadek, Adel Computer Science and Engineering Civil Structural and Environmental Engineering University at Buffalo United States

Connected and Autonomous Vehicles (CAVs) hold great promise to transform our current transportation system to a safer, more resilient and efficient Cyber Transportation System (CTS) that integrates advanced sensing, communications and control based on IoT, V2X and AI/ML technologies. However, many open challenges related to modeling human-Automation interaction, improving resiliency to adversarial conditions, and finding 'killer' applications for CAVs remain. Above all, a national CAV safety certification based on AR/VR and digital twin technologies is a key to gaining public (and market) trust and acceptance. In this talk, I will briefly describe our past and current work to address the above research and development challenges, aiming to rally all stakeholders around the establishment of a national CAV certification center. © 2000-2011 IEEE.

关键词： Vehicle to Everything

来源：评论

学校读者我要写书评

暂无评论

Byzantine Robust Federated Learning Scheme Based on Backdoor Triggers

引用

computers, Materials & Continua 2024年第5期79卷 2813-2831页

作者： Zheng Yang Ke Gu Yiming Zuo School of Computer and Communication Engineering Changsha University of Science and TechnologyChangsha410114China

Federated learning is widely used to solve the problem of data decentralization and can provide privacy protectionfor data owners. However, since multiple participants are required in federated learning, this allows attackers tocompromise. Byzantine attacks pose great threats to federated learning. Byzantine attackers upload maliciouslycreated local models to the server to affect the prediction performance and training speed of the global model. Todefend against Byzantine attacks, we propose a Byzantine robust federated learning scheme based on backdoortriggers. In our scheme, backdoor triggers are embedded into benign data samples, and then malicious localmodels can be identified by the server according to its validation dataset. Furthermore, we calculate the adjustmentfactors of local models according to the parameters of their final layers, which are used to defend against datapoisoning-based Byzantine attacks. To further enhance the robustness of our scheme, each localmodel is weightedand aggregated according to the number of times it is identified as malicious. Relevant experimental data showthat our scheme is effective against Byzantine attacks in both independent identically distributed (IID) and nonindependentidentically distributed (non-IID) scenarios.

关键词： Federated learning Byzantine attacks backdoor triggers

来源：评论

学校读者我要写书评

暂无评论

Enhance the Performance of Directional Feature-based Palmprint Recognition by Directional Response Stability Measurement

引用

Machine Intelligence Research 2024年第3期21卷 597-614页

作者： Haitao Wang Wei Jia School of Computer Science and Information Engineering Hefei University of TechnologyHefei230009China

Palmprint recognition is an emerging biometrics technology that has attracted increasing attention in recent years. Many palmprint recognition methods have been proposed, including traditional methods and deep learning-based methods. Among the traditional methods, the methods based on directional features are mainstream because they have high recognition rates and are robust to illumination changes and small noises. However, to date, in these methods, the stability of the palmprint directional response has not been deeply studied. In this paper, we analyse the problem of directional response instability in palmprint recognition methods based on directional feature. We then propose a novel palmprint directional response stability measurement (DRSM) to judge the stability of the directional feature of each pixel. After filtering the palmprint image with the filter bank, we design DRSM according to the relationship between the maximum response value and other response values for each pixel. Using DRSM, we can judge those pixels with unstable directional response and use a specially designed encoding mode related to a specific method. We insert the DRSM mechanism into seven classical methods based on directional feature, and conduct many experiments on six public palmprint databases. The experimental results show that the DRSM mechanism can effectively improve the performance of these methods. In the field of palmprint recognition, this work is the first in-depth study on the stability of the palmprint directional response, so this paper has strong reference value for research on palmprint recognition methods based on directional features.

关键词： Biometrics palmprint recognition directional response stability directional coding-based methods directional feature

来源：评论

学校读者我要写书评

暂无评论

Data-Driven Collaborative Scheduling Method for Multi-Satellite Data-Transmission

引用

Tsinghua science and Technology 2024年第5期29卷 1463-1480页

作者： Xiaoyu Chen Weichao Gu Guangming Dai Lining Xing Tian Tian Weilai Luo Shi Cheng Mengyun Zhou School of Computer Science China University of GeosciencesWuhan 430074China School of Electronic Engineering Xidian UniversityXi’an 710071China School of Computer Science Shaanxi Normal UniversityTaiyuan 710119China

With continuous expansion of satellite applications,the requirements for satellite communication services,such as communication delay,transmission bandwidth,transmission power consumption,and communication coverage,are becoming *** paper first presents an overview of the current development status of Low Earth Orbit(LEO)satellite constellations,and then conducts a demand analysis for multi-satellite data transmission based on LEO satellite *** problem is described,and the challenges and difficulties of the problem are analyzed *** this basis,a multi-satellite data-transmission mathematical model is then *** classical heuristic allocating strategies on the features of the proposed model,with the reinforcement learning algorithm Deep Q-Network(DQN),a two-stage optimization framework based on heuristic and DQN is ***,by taking into account the spatial and temporal distribution characteristics of satellite and facility resources,a multi-satellite scheduling instance dataset is *** results validate the rationality and correctness of the DQN algorithm in solving the collaborative scheduling problem of multi-satellite data transmission.

关键词： relay satellite scheduling data transmission Deep Q-Network(DQN) Genetic Algorithm(GA)

来源：评论

学校读者我要写书评

暂无评论

CMDCF: an effective cross-modal dense cooperative fusion network for RGB-D SOD

引用

Neural Computing and Applications 2024年第23期36卷 14361-14378页

作者： Jia, XingZhao Zhao, WenXiu Wang, YuMei DongYe, ChangLei Peng, YanJun College of Computer Science and Engineering Shandong University of Science and Technology Qingdao266590 China

The success of vision transformer demonstrates that the transformer structure is also suitable for various vision tasks, including high-level classification tasks and low-level dense prediction tasks. Salient object detection (SOD) is a pixel-level dense prediction task that simulates the most salient objects in human visual recognition scenarios. In recent years, depth images have been widely used for salient object detection. Compared with RGB SOD, the key point of RGB-D SOD is the effective fusion of depth information. As RGB-D SOD requires extracting depth features and fusing cross-modal information, additional computation is involved. However, except for lightweight models, most RGB-D SOD methods tend to obtain better prediction maps by consuming more computational resources. We propose a cross-modal dense cooperative fusion net, which provides state-of-the-art performance with less computation and parameters. We take advantage of the ability of the transformer structure to model long sequence dependencies to extract saliency features from RGB images. Since there is less information in the depth image than in the RGB image, it is not necessary to use the same structure in the depth stream. For the sake of reducing parameters and computation, we consider the asymmetric architecture. It is enough to meet our needs that deep features extracted by lightweight MobileV2Net. Our decoder can perform dense cooperative fusion of cross-modal information while decoding features. It can both effectively fuse cross-modal information and save computation. Comprehensive experiments on multiple benchmark datasets for RGB-D SOD show that compared with SOTA methods, our method performs much better with less computation and parameters. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Efficient breast cancer detection using neural networks and explainable artificial intelligence

引用

Neural Computing and Applications 2025年第5期37卷 3759-3776页

作者： Murugan, Tamilarasi Kathirvel Karthikeyan, Pritikaa Sekar, Pavithra School of Computer Science Engineering Vellore Institute of Technology Tamilnadu Chennai India

The growing dependence on deep learning models for medical diagnosis underscores the critical need for robust interpretability and transparency to instill trust and ensure responsible usage. This study investigates the efficacy of various explainable artificial intelligence (XAI) techniques in comprehending deep learning models utilized for breast cancer classification from down sampled histopathology images. A comparative assessment of multiple convolutional neural network (CNN) architectures, encompassing standard CNNs, ResNet, VGG-16, and VGG-19, on down sampled images was conducted. The primary goal is to pinpoint the model exhibiting the highest accuracy and subsequently employ three prominent XAI methods—LIME, SHAP, and Saliency Maps—to get insights into the top-performing model. This study identifies VGG-19 as the best-performing model with an accuracy of 92.59% and demonstrates that among various XAI techniques, LIME provides the most accurate and clinically relevant explanations for breast cancer classification from down sampled histopathology images. These findings, validated by medical professionals, enhance the interpretability and reliability of deep learning models in clinical settings, promoting their responsible integration into healthcare practices. This validation was further corroborated through consultation with medical professionals, including doctors specializing in breast cancer diagnosis. This research endeavors to deepen the understanding of the model’s rationale and instill confidence in its outputs. The outcomes of this study hold significant promise in elevating the interpretability and reliability of deep learning models tailored for breast cancer diagnosis, thus facilitating their responsible integration into clinical settings. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Active self-training for weakly supervised 3D scene semantic segmentation

引用

Computational Visual Media 2024年第3期10卷 425-438页

作者： Gengxin Liu Oliver van Kaick Hui Huang Ruizhen Hu College of Computer Science&Software Engineering Shenzhen UniversityShenzhen 518060China School of Computer Science Carleton UniversityOttawa K1S 5B6Canada

Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data. These methods aretypically based on learning with contrastive losses whileautomatically deriving per-point pseudo-labels from asparse set of user-annotated labels. In this paper, ourkey observation is that the selection of which samplesto annotate is as important as how these samplesare used for training. Thus, we introduce a methodfor weakly supervised segmentation of 3D scenes thatcombines self-training with active learning. Activelearning selects points for annotation that are likelyto result in improvements to the trained model, whileself-training makes efficient use of the user-providedlabels for learning the model. We demonstrate thatour approach leads to an effective method that providesimprovements in scene segmentation over previouswork and baselines, while requiring only a few userannotations.

关键词： semantic segmentation weakly supervised self-training active learning

来源：评论

学校读者我要写书评

暂无评论

An Improved Graph Partitioning Algorithm Based Approach for Workflow Offloading in a Fog Environment

引用

Journal of The Institution of Engineers (India): Series B 2025年第2期106卷 623-634页

作者： Mahajan, Neetu Narang Kaur, Parmeet Department of Computer Science and Engineering Jaypee Institute of Information Technology Noida India

The paper addresses the critical problem of application workflow offloading in a fog environment. Resource constrained mobile and Internet of Things devices may not possess specialized hardware to run complex workflows locally and hence, need to offload these tasks to fog nodes. As compared to cloud-based servers, fog nodes can provide responses in a more-timely manner and are preferred for latency-sensitive applications. Workflow applications are characterized by inter-task dependencies and hence, can be readily represented as directed acyclic graphs. Therefore, the proposed offloading solution approach utilizes an improved graph partitioning algorithm based on the Louvain community detection algorithm. The aim of the algorithm is to partition the workflow graph in such a manner that the workflow tasks having high communication costs between them are transferred or offloaded to the same fog node. The benefits of the proposed algorithm have been verified by simulation experiments where it was observed that it results in a lower makespan as compared to the related approaches. © The Institution of Engineers (India) 2024.

关键词： Fog

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：