检索结果-内蒙古大学图书馆

32nd ACM International Conference on Multimedia, MM 2024

作者： Zhang, Zefan Zhang, Weiqi Li, Yanhui Bai, Tian College of Computer Science and Technology Jilin University Changchun China Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education Jilin University Changchun China

ISBN: (纸本)9798400706868

Multimodal Relation Extraction (MRE) has achieved great improvements. However, modern MRE models are easily affected by irrelevant objects during multimodal alignment which are called error sensitivity issues. The main reason is that visual features are not fully aligned with textual features and the reasoning process may suppress redundant and noisy information at the risk of losing critical information. In light of this, we propose a Caption-Aware Multimodal Relation Extraction Network with Mutual Information Maximization (CAMIM). Specifically, we first generate detailed image captions through the Large Language Model (LLM). Then, the Caption-Aware Module (CAM) hierarchically aligns the fine-grained visual entities and textual entities for reasoning. In addition, for preserving crucial information within different modalities, we leverage a Mutual Information Maximization method to regulate the multimodal reasoning module. Experiments show that our model outperforms the state-of-the-art MRE models on the benchmark dataset MNRE. Further ablation studies prove the pluggable and effective performance of our Caption-Aware Module and Mutual Information Maximization method. Our code is available at https://***/zefanZhang-cn/CAMIM. © 2024 ACM.

关键词： Modeling languages

来源：评论

学校读者我要写书评

暂无评论

Harmfully Manipulated Images Matter in Multimodal Misinformation Detection 24

Harmfully Manipulated Images Matter in Multimodal Misinforma...

引用

32nd ACM International Conference on Multimedia, MM 2024

作者： Wang, Bing Wang, Shengsheng Li, Changchun Guan, Renchu Li, Ximing College of Computer Science and Technology Jilin University Jilin Changchun China Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education Jilin University China

ISBN: (纸本)9798400706868

Nowadays, misinformation is widely spreading over various social media platforms and causes extremely negative impacts on society. To combat this issue, automatically identifying misinformation, especially those containing multimodal content, has attracted growing attention from the academic and industrial communities, and induced an active research topic named Multimodal Misinformation Detection (MMD). Typically, existing MMD methods capture the semantic correlation and inconsistency between multiple modalities, but neglect some potential clues in multimodal content. Recent studies suggest that manipulated traces of the images in articles are non-trivial clues for detecting misinformation. Meanwhile, we find that the underlying intentions behind the manipulation, e.g., harmful and harmless, also matter in MMD. Accordingly, in this work, we propose to detect misinformation by learning manipulation features that indicate whether the image has been manipulated, as well as intention features regarding the harmful and harmless intentions of the manipulation. Unfortunately, the manipulation and intention labels that make these features discriminative are unknown. To overcome the problem, we propose two weakly supervised signals as alternatives by introducing additional datasets on image manipulation detection and formulating two classification tasks as positive and unlabeled learning problems. Based on these ideas, we propose a novel MMD method, namely Harmfully Manipulated Images Matter in MMD (Hami-m3d). Extensive experiments across three benchmark datasets can demonstrate that Hami-m3d can consistently improve the performance of any MMD baselines. © 2024 ACM.

关键词： image manipulation

来源：评论

学校读者我要写书评

暂无评论

Few-Shot Fine-Grained Classification of Histological Images 1

Few-Shot Fine-Grained Classification of Histological Images

引用

1st IEEE International Conference on Medical Artificial Intelligence, MedAI 2023

作者： Jiang, Yingdong Huang, Jing Jin, Zihe Shen, Leqi Zhang, Ziyi College of Software Jilin University Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education ChangChun China College of Computer Science and Technology Jilin University Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education ChangChun China College of Computer Science and Technology Jilin University ChangChun China

ISBN: (纸本)9798350358780

Histological image classification plays a crucial role in cancer diagnosis. However, the acquisition of well-labeled histological images is prohibitively expensive, and obtaining rare abnormal samples is challenging. Therefore, applying few-shot learning methods to histological image classification tasks holds significant clinical value. Nevertheless, existing research predom-inantly relies on coarse-grained image classification approaches based on natural image datasets, which struggle to address the fine-grained challenges encountered in histological image classification, such as intra-class diversity and inter-class similarity. To tackle this issue, this study proposes a novel few-shot fine-grained classification method for histological images, named 'Category-Aware Feature Map Reconstruction Network.' This method employs channel weights to localize the differences between inter-class and intra-class regions, composed of intra-class channel weights and inter-class channel weights, collectively referred to as category-aware weights. Specifically, intra-class channel weights indicate the matching degree of salient regions within the support set of a particular class, while inter-class channel weights represent the degree of containing distinct information between classes. The category-aware weights are utilized to transform the support feature maps and query feature maps, generating feature maps that capture differentiating details between categories. Finally, the distance between the transformed query feature map and support feature map is calculated to achieve probabilistic predictions for the categories. On a histological few-shot dataset, this method achieves an accuracy of 90.23% using ResNet-12 as the feature extractor, surpassing the baseline model by 5.24% and outperforming other few-shot methods by at least 10% in the 5-way 10-shot experimental setting. The proposed method exhibits exceptional performance on histological image few-shot datasets, playing a

关键词： Computer aided diagnosis

来源：评论

学校读者我要写书评

暂无评论

Vertical Traffic Scheduling Control Method Based On Dual Fuzzy Neural Network

Vertical Traffic Scheduling Control Method Based On Dual Fuz...

引用

2023 International Annual Conference on Complex Systems and Intelligent Science, CSIS-IAC 2023

作者： Sun, Xinhao An, Siqi Gao, Xiaoting Cui, Enchang College of Light Industry Liaoning University Shenyang China Jilin University Key Laboratory of Symbolic Symbolic Computation and Knowledge Engineering of Ministry of Education Changchun130012 China

ISBN: (纸本)9798350309003

With the complexity of the functions of modern buildings, the problem of vertical traffic in buildings is becoming more and more prominent. As the only vertical transportation, the elevator is a necessary prerequisite for the development of modern buildings to solve its group control distribution problem. In this paper, a vertical traffic scheduling control method is proposed based on dual fuzzy neural network, a recognition strategy is designed to identify the passenger flow models. The dual fuzzy neural network is used to first identify the traffic network model in which the elevator is located, and then combine the corresponding weights and the confidence of the group control allocation to complete the group control scheduling. Finally, the proposed method is verified through a semi-physical simulation platform, proving the correctness and effectivness for traffic pattern recognition and scheduling strategy optimization of group controllers. © 2023 IEEE.

关键词： Fuzzy neural networks

来源：评论

学校读者我要写书评

暂无评论

Local feature aggregation algorithm based on graph convolutional network

引用

Frontiers of Computer Science 2022年第3期16卷 203-205页

作者： Hao WANG Liyan DONG Minghui SUN College of Computer Science and Technology Jilin UniversityChangchun 130012China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin UniversityChangchun 130012China

1Introduction and main contributions In the field of social networks and knowledge graphs,semi-supervised learning models based on graph convolutional networks have achieved great success in node classification[1],inductive node embedding[2],link prediction[3],and *** semi-supervised models based on graph convolutional network(GCN)[4]expect to obtain more feature information of a graph or accelerate the training.

关键词： convolution aggregation semi

来源：评论

学校读者我要写书评

暂无评论

More Flexible Proximity Wildcards Path Planning with Compressed Path Databases 34

More Flexible Proximity Wildcards Path Planning with Compres...

引用

34th International Conference on Automated Planning and Scheduling, ICAPS 2024

作者： Chen, Xi Zhang, Yue Zhang, Yonggang College of Software Jilin University China College of Computer Science and Technology Jilin University China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University China

ISBN: (纸本)9781577358893

Grid-based path planning is one of the classic problems in AI, and a popular topic in application areas such as computer games and robotics. Compressed Path Databases (CPDs) are recognized as a state-of-the-art method for grid-based path planning. It is able to find an optimal path extremely fast without state-space search. In recent years, researchers have tended to focus on improving CPDs by reducing CPD size or improving search performance. Among various methods, proximity wildcards are one of the most proven improvements in reducing the size of CPD. However, its proximity area is significantly restricted by complex terrain, which significantly affects the pathfinding efficiency and causes more additional costs. In this paper, we enhance CPDs from the perspective of improving search efficiency and reducing search costs. Our work focuses on using more flexible methods to obtain larger proximity areas, so that more heuristic information can be used to improve search performance. Experiments conducted on the Grid-Based Path Planning Competition (GPPC) benchmarks demonstrate that the two proposed methods can effectively improve search efficiency and reduce search costs by up to 3 orders of magnitude. Remarkably, our methods can further reduce the storage cost, and improve the compression capability of CPDs simultaneously. Copyright © 2024, Association for the Advancement of Artificial Intelligence (***). All rights reserved.

关键词： Motion planning

来源：评论

学校读者我要写书评

暂无评论

Attention Guided Enhancement Network for Weakly Supervised Semantic Segmentation

引用

Chinese Journal of Electronics 2023年第4期32卷 896-907页

作者： ZHANG Zhe WANG Bilin YU Zhezhou ZHAO Fengzhi College of Computer Science and Technology Jilin University Key Laboratory for Symbol Computation and Knowledge Engineering of National Education Ministry

Weakly supervised semantic segmentation using only image-level labels is critical since it alleviates the need for expensive pixel-level labels. Most cuttingedge methods adopt two-step solutions that learn to produce pseudo-ground-truth using only image-level labels and then train off-the-shelf fully supervised semantic segmentation network with these pseudo labels. Although these methods have made significant progress, they also increase the complexity of the model and training. In this paper, we propose a one-step approach for weakly supervised image semantic segmentation—attention guided enhancement network(AGEN), which produces pseudopixel-level labels under the supervision of image-level labels and trains the network to generate segmentation masks in an end-to-end manner. Particularly, we employ class activation maps(CAM) produced by different layers of the classification branch to guide the segmentation branch to learn spatial and semantic ***, the CAM produced by the lower layer can capture the complete object region but with many ***, the self-attention module is proposed to enhance object regions adaptively and suppress irrelevant object regions, further boosting the segmentation *** on the Pascal VOC 2012 dataset demonstrate that AGEN outperforms alternative state-of-the-art weakly supervised semantic segmentation methods exclusively relying on image-level labels.

关键词： Training Annotations Semantic segmentation Scalability Semantics Benchmark testing Boosting

来源：评论

学校读者我要写书评

暂无评论

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought 41

In-Context Decision Transformer: Reinforcement Learning via ...

引用

41st International Conference on Machine Learning, ICML 2024

作者： Huang, Sili Hu, Jifeng Chen, Hechang Sun, Lichao Yang, Bo School of Artificial Intelligence Jilin University China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University China Lehigh University BethlehemPA United States

In-context learning is a promising approach for offline reinforcement learning (RL) to handle online tasks, which can be achieved by providing task prompts. Recent works demonstrated that in-context RL could emerge with self-improvement in a trial-and-error manner when treating RL tasks as an across-episodic sequential prediction problem. Despite the self-improvement not requiring gradient updates, current works still suffer from high computational costs when the across-episodic sequence increases with task horizons. To this end, we propose an In-context Decision Transformer (IDT) to achieve self-improvement in a high-level trial-and-error manner. Specifically, IDT is inspired by the efficient hierarchical structure of human decision-making and thus reconstructs the sequence to consist of high-level decisions instead of low-level actions that interact with environments. As one high-level decision can guide multi-step low-level actions, IDT naturally avoids excessively long sequences and solves online tasks more efficiently. Experimental results show that IDT achieves state-of-the-art in long-horizon tasks over current in-context RL methods. In particular, the online evaluation time of our IDT is 36× times faster than baselines in the D4RL benchmark and 27× times faster in the Grid World benchmark. Copyright 2024 by the author(s)

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

A Simple but Effective Approach for Unsupervised Few-Shot Graph Classification 24

A Simple but Effective Approach for Unsupervised Few-Shot Gr...

引用

33rd ACM Web Conference, WWW 2024

作者： Liu, Yonghao Huang, Lan Cao, Bowen Li, Ximing Giunchiglia, Fausto Feng, Xiaoyue Guan, Renchu College of Computer Science and Technology Jilin University Changchun China Department of Information Engineering and Computer Science University of Trento Trento Italy Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education China

ISBN: (纸本)9798400701719

Graphs, as a fundamental data structure, have proven efficacy in modeling complex relationships between objects and are therefore found in wide web applications. Graph classification is an essential task in graph data analysis, which can effectively assist in extracting information and mining content from the web. Recently, few-shot graph classification, a more realistic and challenging task, has garnered great research interest. Existing few-shot graph classification models are all supervised, assuming abundant labeled data in base classes for meta-training. However, sufficient annotation is often challenging to obtain in practice due to high costs or demand for expertise. Moreover, they commonly adopt complicated meta-learning algorithms via episodic training to transfer prior knowledge from base classes. To break free from these constraints, in this paper, we propose a simple yet effective approach named SMART for unsupervised few-shot graph classification without using any labeled data. SMART employs transfer learning philosophy instead of the previously prevailing meta-learning paradigm, avoiding the need for sophisticated meta-learning algorithms. Additionally, we adopt a novel mixup strategy to augment the original graph data and leverage unsupervised pretraining on these data to obtain the expressive graph encoder. We also utilize the prompt tuning technique to alleviate the overfitting and low fine-tuning efficiency caused by the limited support samples of novel classes. Extensive experimental results demonstrate the superiority of our proposed approach, significantly surpassing even leading supervised few-shot graph classification models. Our code is available here. © 2024 ACM.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

A novel hybrid butterfly optimization algorithm for feature selection with sine cosine velocity in the high-dimensional classification data

引用

Journal of Intelligent and Fuzzy Systems 2024年第5-6期47卷 369-391页

作者： Zhang, Li Chen, Xiaobo Key Laboratory of Data Science and Intelligence Education Hainan Normal University Ministry of Education Haikou Hainan China School of Computer Engineering Jiangsu University of Technology Changzhou Jiangsu China Changzhou City Center Branch People's Bank of China Changzhou Jiangsu China Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education Jilin University Changchun China

Aiming at the shortcomings of the traditional butterfly optimization algorithm in solving the high-dimensional classification feature selection problem, which has low convergence and is prone to fall into local optimal solutions, a new hybrid butterfly optimization algorithm is proposed, i.e., HBOA-SCV (A novel hybrid butterfly optimization algorithm with sine cosine velocity). The algorithm is applied to solve a high-dimensional classification feature selection problem. Firstly, the algorithm's global exploration and local exploitation ability can be dynamically balanced by introducing inertia weight coefficients w based on multiple learning strategies. Secondly, using the updated speed position formula of the sine-cosine acceleration strategy, individual butterflies' autonomous search ability and convergence speed can be further improved. Finally, according to the fitness value of each butterfly individual, the moving step length and direction of the butterfly individual are automatically adjusted better to fit the actual search process of the butterfly individual, increase the search ability in the global range, and avoid the algorithm from falling into the local optimum. To verify the algorithm's effectiveness, 18 high-dimensional classification numbers are selected to carry out simulation and comparison experiments between HBOA-SCV and traditional BOA algorithm, five improved BOA algorithms and other comparative algorithms for high-dimensional classification data successively. The experimental results show that the average fitness value and classification accuracy of the HBOA-SCV algorithm are better than the comparison algorithm, thus verifying the superiority of the HBOA-SCV algorithm. © 2024 - IOS Press. All rights reserved.

关键词： Optimization algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：