检索结果-内蒙古大学图书馆

33rd International Conference on Artificial Neural Networks and Machine Learning (ICANN)

作者： Wang, Chen Zhang, Di Li, Xiaolong Ma, Huifang Li, Zhixin Northwest Normal Univ Coll Comp Sci & Engn Lanzhou Peoples R China Key Lab Cloud Comp Gansu Prov Lanzhou Peoples R China Guangxi Normal Univ Key Lab Educ Blockchain & Intelligent Technol Minist Educ Guilin Peoples R China

ISBN: (纸本)9783031723377;9783031723384

Mining accurate Class Activation Maps (CAMs) is essential for weakly-supervised semantic segmentation (WSSS). However, the CAMs only activate the most discriminative semantic regions, which can severely affect the segmentation results. Motivated by the observation that local image can capture more details, we propose a Dual-view Label Re-assignment (DLR) framework aiming at the problems of incomplete objects and unclear boundaries in pseudo-labels. Specifically, we first extract comprehensive features and intricate features from global views and local views. In order to take advantage of these features, we further incorporate two additional components, Local View Constraint (LVC) and Foreground-Background Contrast (FBC). LVC facilitates the complementary learning of global and local features through feature transfer loss. FBC enhances boundaries by intensifying the distinction between foreground and background. Experiments show the DLR achieve 1.5% and 3.9% mIoU improvements compared with other method on the validation set of PASCAL VOC 2012 and MS COCO 2014, respectively.

关键词： weakly-supervised semantic segmentation Class Activation Maps Label Re-assignment

来源：评论

学校读者我要写书评

暂无评论

weakly-supervised semantic segmentation with Visual Words Learning and Hybrid Pooling

引用

INTERNATIONAL JOURNAL OF COMPUTER VISION 2022年第4期130卷 1127-1144页

作者： Ru, Lixiang Du, Bo Zhan, Yibing Wu, Chen Wuhan Univ Natl Engn Res Ctr Multimedia Software Inst Artificial Intelligence Sch Comp Sci Wuhan Peoples R China Wuhan Univ Hubei Key Lab Multimedia & Network Commun Engn Wuhan Peoples R China JD Explore Acad JDcom Beijing Peoples R China Wuhan Univ LIESMARS Wuhan Peoples R China

weakly-supervised semantic segmentation (WSSS) methods with image-level labels generally train a classification network to generate the Class Activation Maps (CAMs) as the initial coarse segmentation labels. However, current WSSS methods still perform far from satisfactorily because their adopted CAMs (1) typically focus on partial discriminative object regions and (2) usually contain useless background regions. These two problems are attributed to the sole image-level supervision and aggregation of global information when training the classification networks. In this work, we propose the visual words learning module and hybrid pooling approach, and incorporate them in classification network to mitigate the above problems. In visual words learning module, we counter the first problem by enforcing the classification network to learn fine-grained visual word labels so that more object extents could be discovered. Specifically, the visual words are learned with a codebook, which could be updated via two proposed strategies, i.e. learning-based strategy and memory-bank strategy. The second drawback of CAMs is alleviated with the proposed hybrid pooling, which incorporates the global average and local discriminative information to simultaneously ensure object completeness and reduce background regions. We evaluated our methods on PASCAL VOC 2012 and MS COCO 2014 datasets. Without any extra saliency prior, our method achieved 70.6% and 70.7% mIoU on the val and test set of PASCAL VOC dataset, respectively, and 36.2% mIoU on the val set of MS COCO dataset, which significantly surpassed the performance of state-of-the-art WSSS methods.

关键词： weakly-supervised semantic segmentation Visual words learning Hybrid pooling semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

Adversarial Learning of Object-Aware Activation Map for weakly-supervised semantic segmentation

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2023年第8期33卷 3935-3946页

作者： Chen, Junliang Lu, Weizeng Li, Yuexiang Shen, Linlin Duan, Jinming Shenzhen Univ Comp Vis Inst Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China Shenzhen Univ Shenzhen Inst Artificial Intelligence & Robot Soc Guangdong Key Lab Intelligent Informat Proc Shenzhen 518060 Peoples R China Tencent Jarvis Lab Shenzhen 518057 Peoples R China Univ Birmingham Sch Comp Sci Birmingham B15 2TT England

Recent years have witnessed impressive advances in the area of weakly-supervised semantic segmentation (WSSS). However, most of existing approaches are based on class activation maps (CAMs), which suffer from the under-segmentation problem (i.e., objects of interest are segmented partially). Although a number of literature works have been proposed to tackle this under-segmentation problem, we argue that these solutions built on CAMs may not be optimal for the WSSS task. Instead, in this paper we propose a network based on the object-aware activation map (OAM). The proposed network, termed OAM-Net, consists of four loss functions (foreground loss, background loss, average pixel and consistency loss) which ensure exactness, completeness, compactness and consistency of segmented objects via adversarial training. Compared to conventional CAM-based methods, our OAM-Net overcomes the under-segmentation drawback and significantly improves segmentation accuracy with negligible computational cost. A thorough comparison between OAM-Net and CAM-based approaches is carried out on the PASCAL VOC2012 dataset, and experimental results show that our network outperforms state-of-the-art approaches by a large margin. The code will be available soon.

关键词： weakly-supervised semantic segmentation class activation map object-aware activation map

来源：评论

学校读者我要写书评

暂无评论

IMAGE AUGMENTATION WITH CONTROLLED DIFFUSION FOR weakly-supervised semantic segmentation 49

IMAGE AUGMENTATION WITH CONTROLLED DIFFUSION FOR WEAKLY-SUPE...

引用

49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Wu, Wangyu Dai, Tianhong Huang, Xiaowei Ma, Fei Xiao, Jimin Univ Liverpool Xian Jiaotong Liverpool Merseyside England Univ Liverpool Liverpool Merseyside England Univ Aberdeen Aberdeen Scotland

ISBN: (纸本)9798350344868;9798350344851

weakly-supervised semantic segmentation (WSSS), which aims to train segmentation models solely using image-level labels, has achieved significant attention. Existing methods primarily focus on generating high-quality pseudo labels using available images and their image-level labels. However, the quality of pseudo labels degrades significantly when the size of available dataset is limited. Thus, in this paper, we tackle this problem from a different view by introducing a novel approach called Image Augmentation with Controlled Diffusion (IACD). This framework effectively augments existing labeled datasets by generating diverse images through controlled diffusion, where the available images and image-level labels are served as the controlling information. Moreover, we also propose a high-quality image selection strategy to mitigate the potential noise introduced by the randomness of diffusion models. In the experiments, our proposed IACD approach clearly surpasses existing state-of-the-art methods. This effect is more obvious when the amount of available data is small, demonstrating the effectiveness of our method.

关键词： weakly-supervised semantic segmentation diffusion model high-quality image selection

来源：评论

学校读者我要写书评

暂无评论

Global Consistency Enhancement Network for weakly-supervised semantic segmentation 6th

Global Consistency Enhancement Network for Weakly-Supervised...

引用

6th Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

作者： Jiang, Le Yang, Xinhao Ma, Liyan Li, Zhenglin Shanghai Univ Sch Comp Engn & Sci Shanghai Peoples R China Shanghai Univ Sch Artificial Intelligence Shanghai Peoples R China

ISBN: (纸本)9789819985456;9789819985463

Generation methods for reliable class activation maps (CAMs) are essential for weakly-supervised semantic segmentation. These methods usually face the challenge of incomplete and inaccurate CAMs due to intra-class inconsistency of final features and inappropriate use of deep-level ones. To alleviate these issues, we propose the Global Consistency Enhancement Network (GCENet) that consists of Middle-level feature Auxiliary Module (MAM), Intra-class Consistency Enhancement Module (ICEM), and Critical Region Suppression Module (CRSM). Specifically, MAM uses middle-level features which carry clearer edges information and details to enhance output features. Then, for the problem of incomplete class activation maps caused by the high variance of local context of the image, ICEM is proposed to enhance the representation of features. It takes into account the intra-class global consistency and the local particularity. Furthermore, CRSM is proposed to solve the problem of excessive CAMs caused by the over-activation of features. It activates the low-discriminative regions appropriately, thus improving the quality of class activation maps. Through our comprehensive experiments, our method outperforms all other competitors and well demonstrates its effectiveness on the PASCAL VOC2012 dataset.

关键词： weakly-supervised semantic segmentation semantic segmentation Intra-class consistency

来源：评论

学校读者我要写书评

暂无评论

Spatial Frequency-Aware Self-Distillation for weakly-supervised semantic segmentation

Spatial Frequency-Aware Self-Distillation for Weakly-Supervi...

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Fang, Jingyuan Ning, Yang Nie, Xiushan School of Computer Science and Technology Shandong Jianzhu University Jinan China

ISBN: (纸本)9798350368741

weakly-supervised semantic segmentation (WSSS) aims to achieve pixel-level classification under image-level supervision. Recent class activation map (CAM)-based methods seek to expand foreground activation while suppressing background. However, they often overlook the uncertainty of CAM, where non-salient activation in some regions complicates semantic classification. These regions are typically dismissed as noise, resulting in inappropriate activations due to inadequate regularization. To resolve this, we introduce a Spatial Frequency-Aware Self-Distillation strategy (SFS). Firstly, to enhance the perception of high-frequency spatial information in uncertain regions, we propose a boundary self-distillation and uncertain region reconstruction strategy, which captures high-frequency boundary information and fine-grained spatial context in these regions. Secondly, to enhance the discrimination of low-frequency semantic features, we propose a contrastive attention mechanism that guides the Vision Transformer (ViT) to focus more on the foreground, thereby improving the distinction between foreground and background. Finally, our SFS demonstrates outstanding performance on both the VOC 2012 and COCO 2014 datasets, attributed to its superior spatial frequency perception capabilities. The code is available at https://***/fjoybest/SFS. © 2025 IEEE.

关键词： self-distillation spatial frequency awareness weakly-supervised semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

Co-attention dictionary network for weakly-supervised semantic segmentation

引用

NEUROCOMPUTING 2022年第0期486卷 272-285页

作者： Wan, Weitao Chen, Jiansheng Yang, Ming-Hsuan Ma, Huimin Tsinghua Univ Dept Elect Engn Beijing 100084 Peoples R China Univ Calif Merced Merced CA USA Tsinghua Univ Beijing Natl Res Ctr Informat Sci & Technol Beijing 100084 Peoples R China Univ Sci & Technol Beijing Sch Comp & Commun Engn Beijing 100083 Peoples R China

In this paper, we propose the co-attention dictionary network (CODNet) for weakly-supervised semantic segmentation using only image-level class labels. The CODNet model exploits extra semantic information by jointly leveraging a pair of samples with common semantics through co-attention rather than processing them independently. The inter-sample similarities of spatially distributed deep features are computed to merge reference features through non-local connections. To discover similar patterns regardless of appearance variations, we propose to extract image representations by equipping the neural networks with dictionary learning which provides the universal basis elements for different images. Based on the CODNet model, we propose a multi-reference class activation map (MR-CAM) algorithm which generates semantic segmentation masks for a target image by jointly merging semantic cues from multiple reference images. Experimental results on the PASCAL VOC 2012 and MSCOCO benchmark data sets for weakly-supervised semantic segmentation show that the proposed algorithm performs favorably against the state-of-the-art methods.(c) 2021 Elsevier B.V. All rights reserved.

关键词： weakly-supervised semantic segmentation Dictionary learning Co-attention

来源：评论

学校读者我要写书评

暂无评论

Self-Attention Prediction Correction with Channel Suppression for weakly-supervised semantic segmentation

Self-Attention Prediction Correction with Channel Suppressio...

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Sun, Guoying Yang, Meng Sun Yat Sen Univ Sch Comp Sci & Engn Guangzhou Peoples R China Xidian Univ State Key Lab Integrated Serv Networks Xian Peoples R China Sun Yat Sen Univ Minist Educ Key Lab Machine Intelligence & Adv Comp Guangzhou Peoples R China

ISBN: (纸本)9781665468916

Single-stage weakly-supervised semantic segmentation (WSSS) with image-level labels has become a new research hotspot in the community for its lower cost and higher training efficiency. However, the pseudo label of WSSS generally suffers from somewhat noise, which limits the segmentation performance. In this paper, to explore the integral foreground activation, we propose the Channel Suppression (CS) module for preventing only activating the most discriminative regions, thereby improving the initial pseudo labels. To rectify the incorrect prediction, we explore the Self-Attention Prediction Correction (SAPC) module, which adaptively generates the category-wise prediction rectification weights. After extensive experiments, the proposed efficient single-stage framework achieves excellent performance with 67.6% mIoU and 39.9% mIoU on PASCAL VOC 2012 and MS COCO 2014 datasets, significantly exceeding several recent single-stage methods.

关键词： weakly-supervised semantic segmentation image-level label single-stage

来源：评论

学校读者我要写书评

暂无评论

Deconfounded multi-organ weakly-supervised semantic segmentation via causal intervention

引用

INFORMATION FUSION 2024年 108卷

作者： Chen, Kaitao Sun, Shiliang Du, Youtian East China Normal Univ Sch Comp Sci & Technol Shanghai 200062 Peoples R China Shanghai Jiao Tong Univ Dept Automat Shanghai 200240 Peoples R China Xi An Jiao Tong Univ Sch Automat Sci & Engn Xian 710049 Peoples R China

In weakly-supervised semantic segmentation, obtaining the class activation maps for pseudo masks is crucial. Since multiple organs appear in the same medical image, it is reasonable to obtain the activation maps of each organ by the organ-level features instead of the image-level features. The image-level features are decomposed into the organ-level features, yet the prior anatomical knowledge makes a spurious association between the image-level and organ-level features. To this end, we apply the causal intervention to cut off the spurious association and propose a novel deconfounded multi-organ weakly-supervised semantic segmentation (DeMos) method. Based on the original class activation mapping (CAM) method, the model is retrained to learn the deconfounded features of each organ via cross-attention, and we approximate the expectation of the intervention instead of the traditional likelihood. When the model converges, we extract the activation maps by CAM. Our method not only generates high-quality pseudo masks on the CHAOS, ACDC and ProMRI datasets, but is also applicable to other CAM variants. Furthermore, with the refinement, DeMos achieves the dice similarity coefficient of 93.26% on the task of the left ventricle segmentation, which outperforms the state-of-the-art methods.

关键词： Class activation mapping weakly-supervised semantic segmentation Causal inference Deep learning

来源：评论

学校读者我要写书评

暂无评论

A weakly-supervised semantic segmentation Approach Based on the Centroid Loss: Application to Quality Control and Inspection

引用

IEEE ACCESS 2021年 9卷 69010-69026页

作者： Yao, Kai Ortiz, Alberto Bonnin-Pascual, Francisco Univ Balearic Isl Dept Math & Comp Sci Palma De Mallorca 07122 Spain Inst Invest Sanitaria Illes Balears IDISBA Palma De Mallorca 07120 Spain

It is generally accepted that one of the critical parts of current vision algorithms based on deep learning and convolutional neural networks is the annotation of a sufficient number of images to achieve competitive performance. This is particularly difficult for semantic segmentation tasks since the annotation must be ideally generated at the pixel level. weakly-supervised semantic segmentation aims at reducing this cost by employing simpler annotations that, hence, are easier, cheaper and quicker to produce. In this paper, we propose and assess a new weakly-supervised semantic segmentation approach making use of a novel loss function whose goal is to counteract the effects of weak annotations. To this end, this loss function comprises several terms based on partial cross-entropy losses, being one of them the Centroid Loss. This term induces a clustering of the image pixels in the object classes under consideration, whose aim is to improve the training of the segmentation network by guiding the optimization. The performance of the approach is evaluated against datasets from two different industry-related case studies: while one involves the detection of instances of a number of different object classes in the context of a quality control application, the other stems from the visual inspection domain and deals with the localization of images areas whose pixels correspond to scene surface points affected by a specific sort of defect. The detection results that are reported for both cases show that, despite the differences among them and the particular challenges, the use of weak annotations do not prevent from achieving a competitive performance level for both.

关键词： Image segmentation Annotations semantics Tools Inspection Training Surgery Object recognition quality control and inspection weakly-supervised semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：