Real-time ego-motion tracking for endoscope is a significant task for efficient navigation and robotic automation of endoscopy. In this paper, a novel framework is proposed to perform real-time ego-motion tracking for...
详细信息
Weakly Supervised Semantic Segmentation (WSSS) with image-level labels aims to achieve pixel-level predictions using Class Activation Maps (CAMs). Recently, Contrastive Language-image Pre-training (CLIP) has been intr...
详细信息
The accelerated MRI reconstruction poses a challenging ill-posed inverse problem due to the significant undersampling in k-space. Deep neural networks, such as CNNs and ViT, have shown substantial performance improvem...
详细信息
computer-aided Whole Slide image (WSI) classification has the potential to enhance the accuracy and efficiency of clinical pathological diagnosis. It is commonly formulated as a Multiple Instance Learning (MIL) proble...
详细信息
Domain adaptation addresses the challenge of model performance degradation caused by domain gaps. In the typical setup for unsupervised domain adaptation, labeled data from a source domain and unlabeled data from a ta...
Domain adaptation addresses the challenge of model performance degradation caused by domain gaps. In the typical setup for unsupervised domain adaptation, labeled data from a source domain and unlabeled data from a target domain are used to train a target model. However, access to labeled source domain data, particularly in medical datasets, can be restricted due to privacy policies. As a result, research has increasingly shifted to source-free domain adaptation (SFDA), which requires only a pretrained model from the source domain and unlabeled data from the target domain data for adaptation. Existing SFDA methods often rely on domain-specific image style translation and self-supervision techniques to bridge the domain gap and train the target domain model. However, the quality of domain-specific style-translated images and pseudo-labels produced by these methods still leaves room for improvement. Moreover, training the entire model during adaptation can be inefficient under limited supervision. In this paper, we propose a novel SFDA framework to address these challenges. Specifically, to effectively mitigate the impact of domain gap in the initial training phase, we introduce preadaptation to generate a preadapted model, which serves as an initialization of target model and allows for the generation of high-quality enhanced pseudo-labels without introducing extra parameters. Additionally, we propose a data-dependent frequency prompt to more effectively translate target domain images into a source-like style. To further enhance adaptation, we employ a style-related layer fine-tuning strategy, specifically designed for SFDA, to train the target model using the prompted target domain images and pseudo-labels. Extensive experiments on cross-modality abdominal and cardiac SFDA segmentation tasks demonstrate that our proposed method outperforms existing state-of-the-art methods. Our code is available online.
Estimating rigid transformation using noisy correspondences is critical to feature-based point cloud ***,a series of studies have attempted to combine traditional robust model fitting with deep *** them,DHVR proposed ...
详细信息
Estimating rigid transformation using noisy correspondences is critical to feature-based point cloud ***,a series of studies have attempted to combine traditional robust model fitting with deep *** them,DHVR proposed a hough voting-based method,achieving new state-of-the-art ***,we find voting on rotation and translation simultaneously hinders achieving better ***,we proposed a new hough voting-based method,which decouples rotation and translation ***,we first utilize hough voting and a neural network to estimate *** based on good initialization on rotation,we can easily obtain accurate rigid *** experiments on 3DMatch and 3DLoMatch datasets show that our method achieves comparable performances over the state-of-the-art *** further demonstrate the generalization of our method by experimenting on KITTI dataset.
1 Introduction Deep neural networks have exhibited excellent performance in supervised tasks on point clouds,such as classification,segmentation[1]and registration[2].In supervised learning schemes,manual labeling of ...
详细信息
1 Introduction Deep neural networks have exhibited excellent performance in supervised tasks on point clouds,such as classification,segmentation[1]and registration[2].In supervised learning schemes,manual labeling of massive point clouds is needed for model ***,point clouds captured from different scenarios exist inevitable distribution discrepancy,and model trained from one domain always generalize badly in another *** reduce the doamin distribution discrepancy,many studies[3–6]have emerged for point cloud unsupervised domain adaptation(UDA)by learning domain-invariant features,where[5]proposed using adaptive nodes to align the local features between the source and the target domains[3,4],and[6]proposed utilizing self-supervised tasks to help capture highly transferable feature representations.
Protein surface serves as an important representation of protein structure,revealing how protein interacts with other biomolecules to perform its *** forms the basis for pharmaceutical and fundamental biological resea...
详细信息
Protein surface serves as an important representation of protein structure,revealing how protein interacts with other biomolecules to perform its *** forms the basis for pharmaceutical and fundamental biological research[1].Datadriven deep learning methods in protein surface representation face challenges of label scarcity,since labeled data are typically obtained through wet lab experiments.
High-quality whole-slide scanning is expensive, complex, and time-consuming, thus limiting the acquisition and utilization of high-resolution histopathology images in daily clinical work. Deep learning-based single-im...
High-quality whole-slide scanning is expensive, complex, and time-consuming, thus limiting the acquisition and utilization of high-resolution histopathology images in daily clinical work. Deep learning-based single-image super-resolution (SISR) techniques provide an effective way to solve this problem. However, the existing SISR models applied in histopathology images can only work in fixed integer scaling factors, decreasing their applicability. Though methods based on implicit neural representation (INR) have shown promising results in arbitrary-scale super-resolution (SR) of natural images, applying them directly to histopathology images is inadequate because they have unique fine-grained image textures different from natural images. Thus, we propose an Implicit Self-Texture Enhancement-based dual-branch framework (ISTE) for arbitrary-scale SR of histopathology images to address this challenge. The proposed ISTE contains a feature aggregation branch and a texture learning branch. We employ the feature aggregation branch to enhance the learning of the local details for SR images while utilizing the texture learning branch to enhance the learning of high-frequency texture details. Then, we design a two-stage texture enhancement strategy to fuse the features from the two branches to obtain the SR images. Experiments on publicly available datasets, including TMA, HistoSR, and the TCGA lung cancer datasets, demonstrate that ISTE outperforms existing fixed-scale and arbitrary-scale SR algorithms across various scaling factors. Additionally, extensive experiments have shown that the histopathology images reconstructed by the proposed ISTE are applicable to downstream pathology image analysis tasks.
Uncertainty estimation methods are essential for the application of artificial intelligence (AI) models in medicalimage segmentation, particularly in addressing reliability and feasibility challenges in clinical depl...
Uncertainty estimation methods are essential for the application of artificial intelligence (AI) models in medicalimage segmentation, particularly in addressing reliability and feasibility challenges in clinical deployment. Despite their significance, the adoption of uncertainty estimation methods in clinical practice remains limited due to the lack of a comprehensive evaluation framework tailored to their clinical usage. To address this gap, a simulation of uncertainty-assisted clinical workflows is conducted, highlighting the roles of uncertainty in model selection, sample screening, and risk visualization. Furthermore, uncertainty evaluation is extended to pixel, sample, and model levels to enable a more thorough assessment. At the pixel level, the Uncertainty Confusion Metric (UCM) is proposed, utilizing density curves to improve robustness against variability in uncertainty distributions and to assess the ability of pixel uncertainty to identify potential errors. At the sample level, the Expected Segmentation Calibration Error (ESCE) is introduced to provide more accurate calibration aligned with Dice, enabling more effective identification of low-quality samples. At the model level, the Harmonic Dice (HDice) metric is developed to integrate uncertainty and accuracy, mitigating the influence of dataset biases and offering a more robust evaluation of model performance on unseen data. Using this systematic evaluation framework, five mainstream uncertainty estimation methods are compared on organ and tumor datasets, providing new insights into their clinical applicability. Extensive experimental analyses validated the practicality and effectiveness of the proposed metrics. This study offers clear guidance for selecting appropriate uncertainty estimation methods in clinical settings, facilitating their integration into clinical workflows and ultimately improving diagnostic efficiency and patient outcomes.
暂无评论