The proceedings contain 23 papers. The special focus in this conference is on Deep Generative Models for Medical image Computing and Computer Assisted Intervention. The topics include: Characterizing the Features...
ISBN:
(纸本)9783031537660
The proceedings contain 23 papers. The special focus in this conference is on Deep Generative Models for Medical image Computing and Computer Assisted Intervention. The topics include: Characterizing the Features of Mitotic Figures Using a Conditional Diffusion Probabilistic Model;A 3D Generative Model of Pathological Multi-modal MR images and Segmentations;Rethinking a Unified Generative Adversarial Model for MRI Modality Completion;diffusion Models for Generative Histopathology;shape-Guided Conditional Latent Diffusion Models for Synthesising Brain Vasculature;pre-training with Diffusion Models for Dental Radiography Segmentation;ICoNIK: Generating Respiratory-Resolved Abdominal MR reconstructions Using Neural Implicit Representations in k-Space;ultrasound imagereconstruction with Denoising Diffusion Restoration Models;privacy Distillation: Reducing Re-identification Risk of Diffusion Models;Diffusion Model Based Knee Cartilage Segmentation in MRI;Semantic image Synthesis for Abdominal CT;CT reconstructionfrom Few Planar X-Rays with Application Towards Low-Resource Radiotherapy;Reference-Free Isotropic 3D EM reconstruction Using Diffusion Models;federated Multimodal and Multiresolution Graph Integration for Connectional Brain Template Learning;metrics to Quantify Global Consistency in Synthetic Medical images;MIM-OOD: Generative Masked image Modelling for Out-of-Distribution Detection in Medical images;towards Generalised Neural Implicit Representations for image Registration;investigating data Memorization in 3D Latent Diffusion Models for Medical image Synthesis;ViT-DAE: Transformer-Driven Diffusion Autoencoder for Histopathology image Analysis;anomaly Guided Generalizable Super-Resolution of Chest X-Ray images Using Multi-level Information Rendering;importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation.
Brain tumor segmentation extracts meaningful information from various modalities (T1, T1c, T2, and FLAIR), each offering unique insights into brain structure and pathology, essential for accurate tumor subregion delin...
详细信息
In computed tomography (CT) imaging, recent developments in reconstruction algorithm and scan configuration design have provided useful tools for imagereconstructionfromdata collected over a limited-angular range (...
详细信息
ISBN:
(数字)9781510649385
ISBN:
(纸本)9781510649385;9781510649378
In computed tomography (CT) imaging, recent developments in reconstruction algorithm and scan configuration design have provided useful tools for imagereconstructionfromdata collected over a limited-angular range (LAR). In this work, we aim to investigate the impact of angular sampling interval on the accuracy of reconstructionfrom LAR data. In specific, we employ a two-orthogonal-arc scan configuration, and collect datafrom a numerical chest phantom over an LAR with various angular intervals. We then investigate imagereconstruction by using the directional-total-variation (DTV) algorithm and evaluate reconstructions qualitatively and quantitatively. Results show that increased angular sampling interval results in degraded image quality. Results of the simulation study also indicate an appropriate interval for sufficient reconstruction accuracy under specific imaging conditions, which may provide insights for the upper-bound performance of reconstructions in practical use.
The proceedings contain 12 papers. The special focus in this conference is on Graphs in Biomedical image Analysis. The topics include: Graph Neural Networks: A Suitable Alternative to MLPs in Latent 3D Medic...
ISBN:
(纸本)9783031832420
The proceedings contain 12 papers. The special focus in this conference is on Graphs in Biomedical image Analysis. The topics include: Graph Neural Networks: A Suitable Alternative to MLPs in Latent 3D Medical image Classification?;graph Residual Noise Learner Network for Brain Connectivity Graph Prediction;prediction of Radiological Diagnostic Errors from Eye Tracking data Using Graph Neural Networks and Gaze-Guided Transformers;exploring Graphs as data Representation for Disease Classification in Ophthalmology;GAMMA-PD: Graph-Based Analysis of Multi-Modal Motor Impairment Assessments in Parkinson’s Disease;multi-resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping;histoGraphCoarse: Strategizing Graph Coarsening Techniques for Efficient Analysis of Gigapixel Histopathological images;mesh Registration via Geometric Feature Homogenization and Offset Cross-Attention: Application to 3D Photogrammetry;SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction;DVasMesh: Deep Structured Mesh reconstructionfrom Vascular images for Dynamics Modeling of Vessels.
Current deep learning-based manifold learning algorithms such as the variational autoencoder (VAE) require fully sampled data to learn the probability density of real-world datasets. However, fully sampled data is oft...
详细信息
Current deep learning-based manifold learning algorithms such as the variational autoencoder (VAE) require fully sampled data to learn the probability density of real-world datasets. However, fully sampled data is often unavailable in a variety of problems, including the recovery of dynamic and high-resolution magnetic resonance imaging (MRI). We introduce a novel variational approach to learn a manifold from undersampled data. The VAE uses a decoder fed by latent vectors, drawn from a conditional density estimated from the fully sampled images using an encoder. Since fully sampled images are not available in our setting, we approximate the conditional density of the latent vectors by a parametric model whose parameters are estimated from the undersampled measurements using back-propagation. We use the framework for the joint alignment and recovery of multi-slice free breathing and ungated cardiac MRI datafrom highly undersampled measurements. Experimental results demonstrate the utility of the proposed scheme in dynamic imaging alignment and reconstructions.
Multi-modal aerial view image translation involves converting aerial images from one modality to another while preserving basic details and features. These modalities encompass Synthetic Aperture Radar (SAR), Infrared...
详细信息
ISBN:
(纸本)9798350365474
Multi-modal aerial view image translation involves converting aerial images from one modality to another while preserving basic details and features. These modalities encompass Synthetic Aperture Radar (SAR), Infrared (IR), Visible Light (RGB), Electro-Optical (EO), and other image types. Recently, various methods have been proposed to tackle this task, but the focus tends to be on paired image research, overlooking the discrepancies found in aerial images of the same location captured at different times and angles, termed incomplete matching or multi-view image translation. Consequently, we propose MvAV-pix2pixHD to address this issue. For multi-view data sampling, we propose two methods: random sampling and time-priority sampling. Additionally, within the pix2pixHD framework, we introduce an inverse generator to ensure the basic semantic features of the generated images and incorporate three robust loss functions to constrain the authenticity of the generated images. We conduct extensive experiments on two multi-view image translation tasks in the Multi-modal Aerial View imagery Challenge: Translation (MAVIC-T). Experimental results demonstrate the superiority of our proposed method, and we achieved second place in the MAVIC-T competition in the 20th IEEE Workshop on Perception Beyond the Visible Spectrum of the CVPR 2024.
Tasks such as autonomous navigation, 3D reconstruction, and object recognition near the water surfaces are crucial in marine robotics applications. However, challenges arise due to dynamic disturbances, e.g., light re...
详细信息
ISBN:
(纸本)9798350377712;9798350377705
Tasks such as autonomous navigation, 3D reconstruction, and object recognition near the water surfaces are crucial in marine robotics applications. However, challenges arise due to dynamic disturbances, e.g., light reflections and refraction from the random air-water interface, irregular liquid flow, and similar factors, which can lead to potential failures in perception and navigation systems. Traditional computer vision algorithms struggle to differentiate between real and virtual image regions, significantly complicating tasks. A virtual image region is an apparent representation formed by the redirection of light rays, typically through reflection or refraction, creating the illusion of an object's presence without its actual physical location. This work proposes a novel approach for segmentation on real and virtual image regions, exploiting synthetic images combined with domain-invariant information, a Motion Entropy Kernel, and Epipolar Geometric Consistency. Our segmentation network does not need to be re-trained if the domain changes. We show this by deploying the same segmentation network in two different domains: simulation and the real world. By creating realistic synthetic images that mimic the complexities of the water surface, we provide fine-grained training data for our network (MARVIS) to discern between real and virtual images effectively. By motion & geometry-aware design choices and through comprehensive experimental analysis, we achieve state-of-the-art real-virtual image segmentation performance in unseen real world domain, achieving an IoU over 78% and a F-1-Score over 86% while ensuring a small computational footprint. MARVIS offers over 43 FPS (8 FPS) inference rates on a single GPU (CPU core). Our code and dataset are available here https://***/jiayi-wu-umd/MARVIS.
This study evaluates deep-learning and Shape from Silhouette (SfS) methods for 3D reconstruction of smoke plumes. It demonstrates the deep-learning method's superiority in cases with limited camera views and calib...
详细信息
This study evaluates deep-learning and Shape from Silhouette (SfS) methods for 3D reconstruction of smoke plumes. It demonstrates the deep-learning method's superiority in cases with limited camera views and calibration data, achieving high-quality reconstructions of semi-transparent smoke without precise calibration. The research emphasizes the significance of pre-processing and data appearance for neural network efficacy. By improving 3D reconstruction techniques, this work aids in advancing wildfire tracking and environmental analysis, offering a practical approach for real-world applications in fire science.
This paper presents an original approach to the imagereconstruction problem for spiral CT scanners where the multi-source and/or the Flying Focal Spot (FFS) technology is implemented. The geometry of those scanners c...
详细信息
ISBN:
(纸本)9783031425073;9783031425080
This paper presents an original approach to the imagereconstruction problem for spiral CT scanners where the multi-source and/or the Flying Focal Spot (FFS) technology is implemented. The geometry of those scanners causes problems for computed tomography systems based on traditional (FDK) reconstruction methods. Therefore, we propose an original rebinning strategy, where does not occur the problem of non-equiangular X-rays. It is possible to implement this approach in all three types of CT scanners (only Multi-Source, only Flying Focal Spot, mixed Multi-Source with Flying Focal Spot). This approach is divided into two blocks (rebinning and iterative reconstruction procedure). This method is based on statistical model-based iterative reconstruction (MBIR), where the reconstruction problem is formulated as a shift-invariant system (a continuous-to-continuous data model). Our method allows for reducing the X-ray dose absorbed by patients during examinations. The most significant feature of the proposed method is the possibility of parallel implementation using the various GPU graphic card - what we have done for the NVIDIA graphic card. This fact resulted in the acceleration of the calculation and the significantly shortened time for the first reconstructed image.
In the domain of histopathology analysis, existing representation learning methods for biomarkers prediction from whole slide images (WSIs) face challenges due to the complexity of tissue subtypes and label noise prob...
详细信息
In the domain of histopathology analysis, existing representation learning methods for biomarkers prediction from whole slide images (WSIs) face challenges due to the complexity of tissue subtypes and label noise problems. This paper proposed a novel partial-label contrastive representation learning approach to enhance the discrimination of histopathology image representations for fine-grained biomarkers prediction. We designed a partial-label contrastive clustering (PLCC) module for partial-label disambiguation and a dynamic clustering algorithm to sample the most representative features of each category to the clustering queue during the contrastive learning process. We conducted comprehensive experiments on three gene mutation prediction datasets, including USTC-EGFR, BRCA-HER2, and TCGA-EGFR. The results show that our method outperforms 9 existing methods in terms of Accuracy, AUC, and F1 Score. Specifically, our method achieved an AUC of 0.950 in EGFR mutation subtyping of TCGA-EGFR and an AUC of 0.853 in HER2 0/1+/2+/3+ grading of BRCA-HER2, which demonstrates its superiority in fine-grained biomarkers prediction from histopathology whole slide images.
暂无评论