检索结果-内蒙古大学图书馆

IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

作者： Yijiang Li Wentian Cai Ying Gao Chengming Li Xiping Hu School of Computer Science and Engineering South China University of Technology Guangzhou China Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application Guangdong Provincial People’s Hospital Guangdong Academy of Medical Sciences Guangzhou China School of Intelligent Systems Engineering Sun Yat-sen University Shenzhen China School of Medical Technology Beijing Institute of Technology Beijing China

ISBN: (数字)9781665468190

ISBN: (纸本)9781665468206

Medical image segmentation methods downsample images for feature extraction and then upsample them to restore resolution for pixel-level predictions. In such schema, upsample technique is vital in restoring information for better performance. However, existing upsample techniques leverage little information from downsampling paths. The local and detailed feature from the shallower layer such as boundary and tissue texture is crucial in segmentation, especially medical image segmentation. To this end, we propose a novel upsample approach for medical image segmentation, Window Attention Upsample (WAU), which upsamples features conditioned on local and detailed features from downsampling path in local windows by introducing attention decoders of Transformer. WAU could serve as a general upsample method and be incorporated into any segmentation model that possesses lateral connections. We first propose the Attention Upsample which consists of Attention Decoder (AD) and bilinear upsample. AD leverages pixel-level attention to model longrange dependency and global information for a better upsample. Bilinear upsample is introduced as the residual connection to complement the upsampled features. Moreover, considering the extensive memory and computation cost of pixel-level attention, we further design a window attention scheme to restrict attention computation in local windows instead of the global range. We evaluate our method (WAU) on classic UNet structure with lateral connections and achieve state-of-the-art performance on Medical Segmentation Decathlon (MSD) Brain and Automatic Cardiac Diagnosis Challenge (ACDC) datasets. We also validate the effectiveness of our method on multiple classic architectures and achieve consistent improvement.

关键词： image segmentation Visualization image resolution Shape Semantics Computer architecture Transformers

来源：评论

学校读者我要写书评

暂无评论

Uncertainty-guided cross teaching semi-supervised framework for histopathology image segmentation with curriculum self-training

引用

Applied Soft Computing 2025年 180卷

作者： Rui Xu Nan Zhou Siyang Feng Zhenbing Liu Huahu Deng Yajun An Jian Li Chu Han Zaiyi Liu Rushi Lan Cheng Lu Xipeng Pan Guangxi Key Laboratory of Image and Graphic Intelligent Processing Guilin University of Electronic Technology Guilin Guangxi 541004 China Department of Radiology Guangdong Provincial People’s Hospital Guangdong Academy of Medical Sciences Guangzhou Guangdong 510080 China Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application Guangzhou Guangdong 510080 China International Joint Research Laboratory of Spatio-temporal Information and Intelligent Location Services Guilin University of Electronic Technology Guilin Guangxi 541004 China

Histopathology image segmentation is crucial in disease diagnosis, therapeutic response evaluation, and prognosis. However, manually annotating pixel-level labels for histopathology images is both time-consuming and labor-demanding task. In this study, we propose a novel semi-supervised semantic segmentation framework called UTCS ( U ncertainty-guided cross T eaching and C urriculum S elf-training) to address the challenges of limited labeled data. UTCS effectively harnesses the benefits of consistency regularization and self-training in semi-supervised learning. Our approach introduces a mutual consistency network, where one network’s prediction is used as a pseudo mask to supervise the other network and vice versa. Addressing the issue of unreliable pseudo labels, we propose a dynamically re-weighted loss function that leverages uncertainty to perform pixel-level selection during the mutual teaching process, referred to as uncertainty-guided cross teaching. Furthermore, inspiring from curriculum learning, we incorporate an self-training strategy, focusing on image-level selection, that prioritizes reliable images during the re-training stage and aims to generate high-quality pseudo-labels for less reliable images. Extensive experiments on two publicly available histopathology datasets, BCSS and LUAD-HistoSeg, demonstrate the superior performance of our method compared to state-of-the-art semi-supervised semantic segmentation methods.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Shadow Detection Of Moving Objects In Traffic Monitoring video

Shadow Detection Of Moving Objects In Traffic Monitoring Vid...

引用

IEEE Joint International Information technology and Artificial Intelligence Conference (ITAIC)

作者： Mingrui Zhang Wenbing Zhao Xiying Li Dan Wang College of Computer Science and Technology Beijing University of Technology Beijing China Intelligent Transportation Research Center School of Intelligent System Engineering Sun Yat-sen University Guangzhou China Key Laboratory of Intelligent Transportation System of Guangdong Province Guangzhou China Key Laboratory of Video Image Intelligent Analysis and Application Technology Ministry of Public Security Guangzhou China

ISBN: (数字)9781728152448

ISBN: (纸本)9781728152455

Moving object detection is an important application of computer vision. Commonly used foreground separation algorithms such as Gaussian mixture modeling, ViBe, frame difference method, etc., do not consider the color of shadow and recognize the shadow of a moving object as a part of the moving object. In many cases, the shadow detection effect is not good. Focus on the detection of moving object shadows in traffic surveillance videos, this paper improves the existing ViBe algorithm, considers the color characteristics of the shadows, recognizes the shadows as part of the background, gains a smaller amount of calculation and better effect of shadow detection with the advantages of ViBe.

关键词： Surveillance Conferences Color Object detection Traffic control Information technology videos

来源：评论

学校读者我要写书评

暂无评论

More than Encoder: Introducing Transformer Decoder to Upsample

arXiv

引用

arXiv 2021年

作者： Li, Yijiang Cai, Wentian Gao, Ying Li, Chengming Hu, Xiping School of Computer Science and Engineering South China University of Technology Guangzhou China Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application Guangdong Provincial People's Hospital Guangdong Academy of Medical Sciences Guangzhou China School of Intelligent Systems Engineering Sun Yat-sen University Shenzhen China School of Medical Technology Beijing Institute of Technology Beijing China

Medical image segmentation methods downsample images for feature extraction and then upsample them to restore resolution for pixel-level predictions. In such schema, upsample technique is vital in restoring information for better performance. However, existing upsample techniques leverage little information from downsampling paths. The local and detailed feature from the shallower layer such as boundary and tissue texture is particularly more important in medical segmentation compared with natural image segmentation. To this end, we propose a novel upsample approach for medical image segmentation, Window Attention Upsample (WAU), which upsamples features conditioned on local and detailed features from downsampling path in local windows by introducing attention decoders of Transformer. WAU could serve as a general upsample method and be incorporated into any segmentation model that possesses lateral connections. We first propose the Attention Upsample which consists of Attention Decoder (AD) and bilinear upsample. AD leverages pixel-level attention to model long-range dependency and global information for a better upsample. Bilinear upsample is introduced as the residual connection to complement the upsampled features. Moreover, considering the extensive memory and computation cost of pixel-level attention, we further design a window attention scheme to restrict attention computation in local windows instead of the global range. We evaluate our method (WAU) on classic U-Net structure with lateral connections and achieve state-of-the-art performance on Synapse multi-organ segmentation, Medical Segmentation Decathlon (MSD) Brain, and Automatic Cardiac Diagnosis Challenge (ACDC) datasets. We also validate the effectiveness of our method on multiple classic architectures and achieve consistent improvement. Copyright © 2021, The Authors. All rights reserved.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

Weakly supervised histopathology tissue semantic segmentation with multi-scale voting and online noise suppression

引用

Engineering applications of Artificial Intelligence 2025年 156卷

作者： Pan, Xipeng Zhang, Hualong Deng, Huahu Wang, Huadeng Li, Lingqiao Liu, Zhenbing Wang, Lin An, Yajun Lu, Cheng Liu, Zaiyi Han, Chu Lan, Rushi School of Computer Science and Information Security Guilin University of Electronic Technology Guangxi Guilin541004 China Southern Medical University Guangdong Guangzhou510080 China Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application Guangdong Provincial People's Hospital Guangdong Guangzhou510080 China Department of Nasopharyngeal Carcinoma Sun Yat-sen University Cancer Center Guangzhou510060 China International Joint Research Laboratory of Spatio-temporal Information and Intelligent Location Services Guilin University of Electronic Technology Guangxi Guilin541004 China

The development of an Artificial Intelligence (AI) assisted tissue segmentation method of digital pathology images is critical for cancer diagnosis and prognosis. Excellent performance has been achieved with the current fully supervised segmentation approach, which relies on a huge number of annotated data. However, drawing dense pixel-level annotations on the giga-pixel whole slide image (WSI) is extremely time-consuming and labor-intensive. To this end, we propose a tissue segmentation method using only patch-level classification labels to reduce such annotation burden and significantly improve the quality of the pseudo-masks. We introduce a framework with two phases of classification and segmentation. In the classification phase, we propose a multi-scale voting method on the Class Activation Map (CAM) based model to obtain more stable pseudo masks. In the segmentation phase, an Online Noise Suppression Strategy (ONSS) is proposed to encourage the model to focus on more reliable signals in the pseudo mask rather than noisy signals. Extensive experiments on two weakly supervised pathology image tissue segmentation datasets Lung Adenocarcinoma (LUAD-HistoSeg) and Breast Cancer Semantic Segmentation (BCSS-WSSS) demonstrate our model outperforms state-of-the-art weakly-supervised semantic segmentation (WSSS) methods using patch-level labels. Furthermore, our method exhibits superior generalization ability compared to other models, and demonstrates promising adaptation performance on unseen domains with only small amounts of data. © 2025 Elsevier Ltd

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

A Car Face Parts Detection Algorithm Based on Faster R-CNN 18

A Car Face Parts Detection Algorithm Based on Faster R-CNN

引用

18th COTA International Conference of Transportation Professionals: Intelligence, Connectivity, and Mobility, CICTP 2018

作者： Zhou, Zhihao Li, Xiying Qiu, Mingkai Research Center of Intelligent Transportation System School of Engineering Sun Yat-sen Univ. Guangzhou Guangdong510006 China Guangdong Provincial Key Laboratory of Intelligent Transportation System Guangzhou Guangdong510006 China Key Laboratory of Video and Image Intelligent Analysis and Application Technology Ministry of Public Security Guangzhou Guangdong510006 China

ISBN: (数字)9780784481523

ISBN: (纸本)9780784481523

The main appearance difference between different types of vehicles is located in the front face area, so the car face parts detection is a key role in fine-grained vehicle recognition. This paper presents a faster R-CNN-based method to detect the position and identify each part of vehicle front face in a complex environment. First, the object information is carried out by K-means clustering and the feature is extracted by VGG-16 network. Second, the candidate regions are obtained by region proposal network (RPN), then uses the Fast R-CNN to obtain the categories and location information of vehicle front face parts. In this paper, 4199 vehicle images of CompCars dataset were used for network training and testing. The experimental results show that the average IoU of the vehicle front face parts is 74.97% and the average recognition precision is 89.59%. Compared with other object detection algorithms, the proposed algorithm shows excellent performance in detecting ability and recognition effect. © 2018 American Society of Civil Engineers.

关键词： Vehicles

来源：评论

学校读者我要写书评

暂无评论

Robust unsupervised feature selection by nonnegative sparse subspace learning

引用

Neurocomputing 2019年 334卷 156-171页

作者： Wei Zheng Hui Yan Jian Yang School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing PR China School of Computer Engineering Jinling Institute of Technology Nanjing PR China PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education and Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology Key Laboratory of Trusted Cloud Computing and Big Data Analysis Nanjing Xiaozhuang University Nanjing PR China

Sparse subspace learning has been demonstrated to be effective in data mining and machine learning. In this paper, we cast the unsupervised feature selection scenario as a matrix factorization problem from the viewpoint of sparse subspace learning. By minimizing the reconstruction residual, the learned feature weight matrix with the l 2,1 -norm and the non-negative constraints not only removes the irrelevant features, but also captures the underlying low dimensional structure of the data points. Meanwhile in order to enhance the model's robustness, l 1 -norm error function is used to resistant to outliers and sparse noise. An efficient iterative algorithm is introduced to optimize this non-convex and non-smooth objective function and the proof of its convergence is given. Although, there is a subtraction item in our multiplicative update rule, we validate its non-negativity. The superiority of our model is demonstrated by comparative experiments on various original datasets with and without malicious pollution.

关键词： Subspace learning Non-negative matrix factorization Unsupervised feature selection

来源：评论

学校读者我要写书评

暂无评论

Consensus algorithms for biased labeling in crowdsourcing

引用

Information Sciences 2017年 382-383卷 254-273页

作者： Jing Zhang Victor S. Sheng Qianmu Li Jian Wu Xindong Wu School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing 210094 P. R. China Key Laboratory of Image and Video Understanding for Social Safety Nanjing University of Science and Technology Nanjing 210094 P. R. China Department of Computer Science University of Central Arkansas Conway AR 72035 USA Jiangsu Engineering Center of Network Monitoring Nanjing University of Information Science and Technology Nanjing 210044 P. R. China Institute of Intelligent Information Processing and Application Soochow University Suzhou 215006 P. R. China School of Computing and Informatics University of Louisiana at Lafayette LA 70504 USA

Although it has become an accepted lay view that when labeling objects through crowdsourcing systems, non-expert annotators often exhibit biases, this argument lacks sufficient evidential observation and systematic empirical study. This paper initially analyzes eight real-world datasets from different domains whose class labels were collected from crowdsourcing systems. Our analyses show that biased labeling is a systematic tendency for binary categorization; in other words, for a large number of annotators, their labeling qualities on the negative class (supposed to be the majority) are significantly greater than are those on the positive class (minority). Therefore, the paper empirically studies the performance of four existing EM-based consensus algorithms , DS, GLAD, RY, and ZenCrowd, on these datasets. Our investigation shows that all of these state-of-the-art algorithms ignore the potential bias characteristics of datasets and perform badly although they model the complexity of the systems. To address the issue of handling biased labeling, the paper further proposes a novel consensus algorithm, namely adaptive weighted majority voting (AWMV), based on the statistical difference between the labeling qualities of the two classes. AWMV utilizes the frequency of positive labels in the multiple noisy label set of each example to obtain a bias rate and then assigns weights derived from the bias rate to negative and positive labels. Comparison results among the five consensus algorithms (AWMV and the four existing) show that the proposed AWMV algorithm has the best overall performance. Finally, this paper notes some potential related topics for future study.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A model based method of pedestrian abnormal behavior detection in traffic scene 1

A model based method of pedestrian abnormal behavior detecti...

引用

1st IEEE International Smart Cities Conference, ISC2 2015

作者： Qianyin, Jiang Guoming, Li Jinwei, Yu Xiying, Li Research Center of Intelligent Transportation System School of Engineering Sun Yat-sen University Guangzhou China Guangdong Provincial Key Laboratory of Intelligent Transportation System Guangzhou China Key Laboratory of Video and Image Intelligent Analysis and Application Technology of MPS Guangzhou China

ISBN: (纸本)9781467365529

In order to reduce traffic accidents caused by the pedestrian, five kinds of dangerous pedestrian abnormal behaviors are studied in the paper. A behavior model between the pedestrian trajectory and the road is built to describe the five kinds of dangerous pedestrian abnormal behaviors: crossing road border, illegal stay, crossing the road, moving along the curb, entering road area. The method contains pedestrian detection, shadow elimination, pedestrian recognition, pedestrian tracking and abnormal behavior detection. Background subtraction method is used to detect moving targets. After shadow elimination, pedestrians are distinguished from vehicles according to the ratio. Then, pedestrian trajectories are gotten by pedestrian tracking. Finally, based on the relation between trajectory and road, the model of five kinds of pedestrian abnormal behaviors is established, and abnormal behaviors are detected according this model. Experiments show that the method can distinguish and detect the pedestrian abnormal behaviors effectively in short time, and it is suitable to use in real time traffic monitoring. © 2015 IEEE.

关键词： Trajectories

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：