The conversion of 2D images into 3D images has been a crucial scientific problem. In the industries of computer graphics and animations, medical imaging augmented and virtual reality, and others, reconstruction of 3D ...
详细信息
We propose an image-conditioned diffusion model to estimate high angular resolution diffusion weighted imaging (DWI) from a low angular resolution acquisition. Our model, which we call QID2, takes as input a set of lo...
详细信息
Healthcare industries face challenges when experiencing rare diseases due to limited samples. Artificial Intelligence (AI) communities overcome this situation to create synthetic data which is an ethical and privacy i...
详细信息
Depth completion is the task of generating dense depth images from sparse depth measurements, e.g., LiDARs. Existing unguided approaches fail to recover dense depth images with sharp object boundaries due to depth ble...
详细信息
ISBN:
(纸本)9781665493468
Depth completion is the task of generating dense depth images from sparse depth measurements, e.g., LiDARs. Existing unguided approaches fail to recover dense depth images with sharp object boundaries due to depth bleeding, especially from extremely sparse measurements. State-ofthe-art guided approaches require additional processing for spatial and temporal alignment of multi-modal inputs, and sophisticated architectures for data fusion, making them non-trivial for customized sensor setup. To address these limitations, we propose an unguided approach based on UNet that is invariant to sparsity of inputs. Boundary consistency in reconstruction is explicitly enforced through auxiliary learning on a synthetic dataset with dense depth and depth contour images as targets, followed by fine-tuning on a real-world dataset. With our network architecture and simple implementation approach, we achieve competitive results among unguided approaches on KITTI benchmark and show that the reconstructed image has sharp boundaries and is robust even towards extremely sparse LiDAR measurements.
Domain adaptation is a practicable tool in real world application where there exists data scarcity. Multi-source domain adaptation attracts increasing attention due to its ability to enrich transfer knowledge by combi...
详细信息
The ever-growing field of medical diagnostics is witnessing a paradigm shift with the integration of quantum computing and deep neural networks (DNN). This paper presents a novel approach that harnesses the computatio...
详细信息
Accurate depth reconstruction is vital for numerous applications including autonomous vehicles, virtual reality, and robot perception. However, the depth imaging is challenging because of limited hardware operations, ...
详细信息
ISBN:
(纸本)9781665404358
Accurate depth reconstruction is vital for numerous applications including autonomous vehicles, virtual reality, and robot perception. However, the depth imaging is challenging because of limited hardware operations, resource-constrained limitations, and incompletedata measurements. To address such shortcomings, this paper introduces an imaging model for efficient depth image estimation fromincomplete depth pixels using non-local low-rank (NLLR) and total variation (TV) representations. The motivation is that NLLR is used to model global similar structure among depth patches, and the TV is incorporated to capture the correlations among local depth pixels. We reformulate the problem of depth reconstruction as a regularized least squares minimization problem with the non-local LR and TV regularizers. Furthermore, this paper proposes an iterative algorithm using the alternating direction method of multipliers (ADMM) to solve the optimization model, yielding an estimate of the depth map from far reduced data points. Experimental results on benchmark datasets validate the efficiency of the proposed approach.
Novel view synthesis aims to generate new view images of a given view image collection. Recent attempts address this problem relying on 3D geometry priors (e.g., shapes, sizes, and positions) learned from multi-view i...
详细信息
ISBN:
(纸本)9798350353006
Novel view synthesis aims to generate new view images of a given view image collection. Recent attempts address this problem relying on 3D geometry priors (e.g., shapes, sizes, and positions) learned from multi-view images. However, such methods encounter the following limitations: 1) they require a set of multi-view images as training data for a specific scene (e.g., face, car or chair), which is often unavailable in many real-world scenarios;2) they fail to ex-tract the geometry priors from single-view images due to the lack of multi-view supervision. In this paper, we propose a Geometry-enhanced NeRF (G-NeRF), which seeks to enhance the geometry priors by a geometry-guided multi-view synthesis approach, followed by a depth-aware train-ing. In the synthesis process, inspired that existing 3D GAN models can unconditionally synthesize high-fidelity multi-view images, we seek to adopt off-the-shelf 3D GAN models, such as EG3D, as a free source to provide geometry priors through synthesizing multi-view data. Simultaneously, to further improve the geometry quality of the synthetic data, we introduce a truncation method to effectively sample latent codes within 3D GAN models. To tackle the absence of multi-view supervision for single-view images, we design the depth-aware training approach, incorporating a depth-aware discriminator to guide geometry priors through depth maps. Experiments demonstrate the effectiveness of our method in terms of both qualitative and quantitative results.
image inpainting is a widely used technique for reconstructing damaged or missing portions of images, with applications in image editing, object removal, and image restoration. This paper proposes a novel approach to ...
详细信息
Unsupervised anomaly detection methods are at the forefront of industrial anomaly detection efforts and have made notable progress. Previous work primarily used 2D information as input, while multi-modal industrial an...
详细信息
暂无评论