检索结果-内蒙古大学图书馆

1st IEEE International Conference on Medical Artificial Intelligence, MedAI 2023

作者： Xu, Xin Lu, Wenjing Lei, Jiahao Qiu, Peng Shen, Hong-Bin Yang, Yang Shanghai Jiao Tong University Key Lab. of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Department of Computer Science and Engineering Shanghai200240 China Shanghai Ninth People's Hospital Shanghai Jiao Tong University School of Medicine Department of Vascular Surgery China Shanghai Jiao Tong University Institute of Image Processing and Pattern Recognition Key Laboratory of System Control and Information Processing Ministry of Education of China Shanghai200240 China

ISBN: (纸本)9798350358780

Interactive medical image segmentation methods have become increasingly popular in recent years. These methods combine manual labeling and automatic segmentation, reducing the workload of annotation while maintaining high accuracy. However, most current interactive segmentation frameworks are limited to 2D image data, and are not suitable for 3D image data due to the large size and high complexity of 3D data, as well as the challenges posed by information asymmetry and sparse annotation. In this paper, we propose SliceProp, an interactive segmentation framework that implements slice-wise Label Bidirectional Propagation (LBP) for 3D medical image segmentation. SliceProp extends the interactive 2D image segmentation algorithm to 3D image segmentation, and can handle 3D data with large size and high complexity. Moreover, equipped with a Backtracking Feedback Check (BFC) module, SliceProp effectively addresses the issues of information asymmetry and spatial sparse annotation in 3D medical image segmentation. Additionally, we adopt an uncertainty-based criterion to pri-oritize the slices to be refined interactively, which enhances the efficiency of the interaction process by enabling the model to focus on the regions with the most unreliable predictions. SliceProp is evaluated on two datasets and achieves promising results compared to state-of-the-art methods. © 2023 IEEE.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Consistency-Guided Adaptive Alternating Training for Semi-Supervised Salient Object Detection

引用

IEEE Transactions on Circuits and Systems for Video Technology 2025年

作者： Chen, Liyuan Liu, Wei Wang, Hua Jeon, Sang-Woon Jiang, Yunliang Zheng, Zhonglong Zhejiang Normal University School of Computer Science and Technology Jinhua321004 China Shanghai Jiao Tong University Institute of Image Processing and Pattern Recognition Department of Automation Shanghai200240 China Victoria University Institute for Sustainable Industries and Liveable Cities College of Engineering and Science MelbourneVIC8001 Australia Hanyang University Department of Electrical and Electronic Engineering Ansan Korea Republic of

This paper presents a novel approach that leverages two models to integrate features from numerous unlabeled images, addressing the challenge of semi-supervised salient object detection (SSOD). Unlike conventional methods that rely on selecting high-quality pseudo labels, our method identifies the model that produces consistent predictions for original images and their color transformation versions from two models to infer reliable pseudo labels for all unlabeled images, improving the diversity of the training set. Specifically, we propose adaptive selection indicators to quantify prediction differences and guide the updates of the two models using the unlabeled set alternatively. Initially, two models used in our framework are trained on the labeled set. Once the adaptive selection indicator conditions are satisfied, one model is designated as the proxy, generating pseudo labels, while the other serves as the saliency model, which is further trained using these pseudo labels. Subsequently, the updated saliency model optimizes the proxy model's parameters according to another adaptive selection indicator. Experimental results and ablation studies on six benchmark salient object detection datasets confirm the effectiveness and robustness of our method. Our approach achieves performance comparable to recent fully supervised methods while using only one eighth of the labeled data, demonstrating its potential for efficient and scalable SSOD. © 2025 IEEE.

关键词： Self-supervised learning

来源：评论

学校读者我要写书评

暂无评论

2M3DF: Advancing 3D Industrial Defect Detection with Multi Perspective Multimodal Fusion Network

引用

IEEE Transactions on Circuits and Systems for Video Technology 2025年

作者： Asad, Mujtaba Azeem, Waqar Jiang, He Mustafa, Hafiz Tayyab Yang, Jie Liu, Wei Shanghai Jiao Tong University Institute of Image Processing and Pattern Recognition Department of Automation Shanghai200240 China Lahore Garrison University Department of Software Engineering Lahore54000 Pakistan China University of Mining and Technology School of Information and Control Engineering Jiangsu Xuzhou221116 China Zhejiang Normal University School of Computer Science and Technology Jinhua321004 China

In the context of Industrial Anomaly Detection (IAD), ensuring the quality of manufactured products is critical. Traditional 2D based methods often fail to capture anomalies present in complex 3D shapes. For effective anomaly detection in 3D shapes, it is essential to incorporate global semantic context, local geometric structure, and color information of the object. To fully leverage these features, we propose a network named 2M3DF, that leverages knowledge from multi-view RGB images and corresponding point cloud information for enhanced anomaly detection performance. Our model initially employs pre-trained feature extractors that generate local features from multi-view RGB images and corresponding point clouds. The novel inter-modality feature representation and fusion module first adapts these inter-modality features and then effectively aligns and aggregates these multimodality features on a pixel-to-point basis. To learn the normality from point-wise fused multimodal features, we fit a multivariate Gaussian distribution to model the normal feature distribution. Comprehensive experimental evaluations using the MVTec3D-AD and Eyecandies dataset validate the effectiveness of our propose model and demonstrate significant improvements in comparison to existing state-of-the-art methods. Our model achieves a 96.6% mean I-AUROC while delivering real-time results. © 1991-2012 IEEE.

关键词： Normal distribution

来源：评论

学校读者我要写书评

暂无评论

Noise Tolerance of Linear vs Non-Linear LiDAR Based Ego-Motion Drift Correction Methods

Noise Tolerance of Linear vs Non-Linear LiDAR Based Ego-Moti...

引用

IEEE International Conference on Intelligent computer Communication and processing (ICCP)

作者： Corvin-Petruț Cobârzan Cătălin-Cosmin Golban Sergiu Nedevschi Computer Science Department Technical University of Cluj-Napoca Romania Image Processing and Pattern Recognition Group Technical University of Cluj-Napoca Romania

ISBN: (纸本)9781665464383

We have previously proposed a linear approach for reducing the global drift of a video-based frame-to-frame trajectory estimation method by correcting it at selected points in time based on the alignment of past and current 3D LiDAR measurements (see [7]). In this paper we assess the tolerance to noise of a series of methods derived from the one previously proposed, this time using both linear and non-linear optimization methods to calculate the correction transform. We generate synthetic datasets with various noise pollution levels and assess the performance of each method under investigation in recovering artificially induced odometry estimation errors.

关键词： Visualization Laser radar Three-dimensional displays Pollution Optimization methods Transforms Time measurement

来源：评论

学校读者我要写书评

暂无评论

Generating Cartoon images from Face Photos with Cycle-Consistent Adversarial Networks

引用

computers, Materials & Continua 2021年第11期69卷 2733-2747页

作者： Tao Zhang Zhanjie Zhang Wenjing Jia Xiangjian He Jie Yang School of Artificial Intelligence and Computer Science Jiangnan UniversityWuxi214000China Key Laboratory of Artificial Intelligence Jiangsu214000China The Global Big Data Technologies Centre University of Technology SydneyUltimoNSW2007Australia The Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong UniversityShanghai201100China

The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications is style *** transfer is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output ***-GAN is a classic GAN model,which has a wide range of scenarios in style *** its unsupervised learning characteristics,the mapping is easy to be learned between an input image and an output ***,it is difficult for CYCLE-GAN to converge and generate high-quality *** order to solve this problem,spectral normalization is introduced into each convolutional kernel of the *** convolutional kernel reaches Lipschitz stability constraint with adding spectral normalization and the value of the convolutional kernel is limited to[0,1],which promotes the training process of the proposed ***,we use pretrained model(VGG16)to control the loss of image content in the position of l1 *** avoid overfitting,l1 regularization term and l2 regularization term are both used in the object loss *** terms of Frechet Inception Distance(FID)score evaluation,our proposed model achieves outstanding performance and preserves more discriminative *** results show that the proposed model converges faster and achieves better FID scores than the state of the art.

关键词： Generative adversarial network spectral normalization Lipschitz stability constraint VGG16 l1 regularization term l2 regularization term Frechet inception distance

来源：评论

学校读者我要写书评

暂无评论

Hybrid Data-Free Knowledge Distillation 39

Hybrid Data-Free Knowledge Distillation

引用

39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

作者： Tang, Jialiang Chen, Shuo Gong, Chen School of Computer Science and Engineering Nanjing University of Science and Technology China Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education China Jiangsu Key Laboratory of Image and Video Understanding for Social Security China Center for Advanced Intelligence Project RIKEN Japan Department of Automation Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University China

ISBN: (纸本)157735897X

Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generation-based methods train student networks by collecting massive real examples and generating synthetic examples, respectively. However, they inevitably become weak in practical scenarios due to the difficulties in gathering or emulating sufficient real-world data. To solve this problem, we propose a novel method called Hybrid Data-Free Distillation (HiDFD), which leverages only a small amount of collected data as well as generates sufficient examples for training student networks. Our HiDFD comprises two primary modules, i.e., the teacher-guided generation and student distillation. The teacher-guided generation module guides a Generative Adversarial Network (GAN) by the teacher network to produce high-quality synthetic examples from very few real-world collected examples. Specifically, we design a feature integration mechanism to prevent the GAN from overfitting and facilitate the reliable representation learning from the teacher network. Meanwhile, we drive a category frequency smoothing technique via the teacher network to balance the generative training of each category. In the student distillation module, we explore a data inflation strategy to properly utilize a blend of real and synthetic data to train the student network via a classifier-sharing-based feature alignment technique. Intensive experiments across multiple benchmarks demonstrate that our HiDFD can achieve state-of-the-art performance using 120 times less collected data than existing methods. Copyright © 2025, Association for the Advancement of Artificia Intelligence (***). All rights reserved.

关键词： Personnel training

来源：评论

学校读者我要写书评

暂无评论

Variational Feature Disentanglement for Few-Shot Domain Adaptation

Variational Feature Disentanglement for Few-Shot Domain Adap...

引用

IEEE International Conference on image processing

作者： Weiduo Wang Yun Gu Jie Yang Department of Automation Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University China Institute of Medical Robotics Shanghai Jiao Tong University China Shanghai Center for Brain Science and Brain-Inspired Technology

In this paper, we focus on the few-shot domain adaptation problem. With limited training data in target domain, a new approach is emerging to acquire the transferable knowledge from the source domain. Previous methods aligned the embedding space between domains by reducing the pair-wise distance. However, these methods are reporting the misalignment and poor generalization. To solve this problem, we propose a variational feature disentanglement framework. The embedding features are explicitly disentangled into domaininvariant and domain-specific components. The distributions of domain-invariant variance are estimated and aligned by the variational inference. For further disentanglement, the domain-invariant and domain-specific components are separated by the orthogonal constraints of subspaces. The experiments on Digits dataset and VisDA-C dataset demonstrate that the proposed method can outperform the state-of-the-art methods.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MBD-Net: Multi-Branch Dilated Convolutional Network With Cyst Discriminator for Renal Multi-Structure Segmentation

MBD-Net: Multi-Branch Dilated Convolutional Network With Cys...

引用

Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

作者： Yusheng Liu Yingjie Zhao Meihuan Wang Yichao Hao Xiuying Wang Lisheng Wang Department of Automation Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University Shanghai China College of Medicine and Biological Information Engineering Northeastern University Shenyang China School of Computer Science The University of Sydney Sydney NSW Australia

In surgery-based renal cancer treatment, one of the most essential tasks is the three-dimensional (3D) kidney parsing on computed tomography angiography (CTA) images. In this paper, we propose an end-to-end convolutional neural network-based framework to segment multiple renal structures, including kidneys, kidney tumors, arteries, and veins from arterial-phase CT images. Our method consists of two collaborative modules: First, we propose an encoding-decoding network, named Multi-Branch Dilated Convolutional Network (MBD-Net), consisting of residual, hybrid dilated convolutional, and reduced-dimensional convolutional structures, which improves the feature extraction ability with relatively fewer network parameters. Given that renal tumors and cysts have confusing geometric structures, we also design the Cyst Discriminator to effectively distinguish tumors from cysts without labeling information via gray-scale curves and radiographic features. We have quantitatively evaluated our approach on a publicly available dataset from MICCAI 2022 Kidney Parsing for Renal Cancer Treatment Challenge (KiPA2022), with mean Dice similarity coefficient (DSC) as 96.18%, 90.99%, 88.66% and 80.35% for the kidneys, kidney tumors, arteries, and veins respectively, winning the stable and top performance in the *** relevance—The proposed CNN-Based framework can automatically segment 3D kidneys, renal tumors, arteries, and veins for kidney parsing techniques, benefiting surgery-based renal cancer treatment.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

arXiv

引用

arXiv 2023年

作者： Tang, Fenghe Nian, Bingkun Ding, Jianrui Quan, Quan Yang, Jie Liu, Wei Zhou, S. Kevin School of Biomedical Engineering Suzhou Institute for Advanced Research University of Science and Technology of China China Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University China School of Computer Science and Technology Harbin Institute of Technology China Institute of Computing Technology China

Due to the scarcity and specific imaging characteristics in medical images, light-weighting Vision Transformers (ViTs) for efficient medical image segmentation is a significant challenge, and current studies have not yet paid attention to this issue. This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integrate the advantages of both worlds at the infrastructure design level. In order to leverage the inductive bias inherent in CNNs, we abstract a Transformer-like lightweight CNNs block (ConvUtr) as the patch embeddings of ViTs, feeding Transformer with denoised, non-redundant and highly condensed semantic information. Moreover, an adaptive Local-Global-Local (LGL) block is introduced to facilitate efficient local-to-global information flow exchange, maximizing Transformer's global context information extraction capabilities. Finally, we build an efficient medical image segmentation model (MobileUtr) based on CNN and Transformer. Extensive experiments on five public medical image datasets with three different modalities demonstrate the superiority of MobileUtr over the state-of-the-art methods, while boasting lighter weights and lower computational cost. Code is available at https://***/FengheTan9/MobileUtr. Copyright © 2023, The Authors. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

RBP-Former: Joint Prediction of RNA-protein Binding Sites on Full-length RNA Transcripts for Multiple RBPs

RBP-Former: Joint Prediction of RNA-protein Binding Sites on...

引用

IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

作者： Yichong Li Xiaojian Liu Fan Cheng Xiaoyong Pan Yang Yang Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University Shanghai China Key Laboratory of System Control and Information Processing Ministry of Education of China Shanghai China Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Shanghai China

ISBN: (数字)9798350386226

ISBN: (纸本)9798350386233

RNA-binding proteins (RBPs) are essential for gene expression, and the complex RNA-protein interaction mechanisms require analysis of global RNA information. Therefore, accurate prediction of RBP binding sites on full-length RNA transcripts is crucial for understanding these mechanisms and their roles in diseases. While machine learning methods can predict RBP binding to RNA fragments, extending this to full-length transcripts presents challenges due to sequence length and data imbalance. In this paper, we introduce RBP-Former, a binding site joint prediction model designed specifically for full-length RNA transcripts that can be used for multiple RBPs. This model processes information at both coarse and fine-grained levels to fully exploit sequence data and its interactions with multiple RBPs. We develop multi-level imbalance learning strategies, achieving favorable results on imbalanced data. Our method outperforms existing methods in predicting binding sites on full-length RNA transcripts for multiple RBPs, demonstrating its effectiveness in handling imbalanced label and sample distributions.

关键词： Proteins Protein engineering Accuracy RNA Machine learning Predictive models Data models Gene expression Bioinformatics Diseases

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：