The visual stimulus-oriented brain activity reconstruction method based on BCI technology has the advantages of high transmission rate and easy operation. However, the problem of losing feature information due to vect...
详细信息
Most of the scanning multispectral datafrom domestic land observation satellites only provide the imaging geometry of the scene center, missing the geometry of other pixels. This paper proposes a method for the recon...
详细信息
Protecting sensitive images is nowadays a key issue in information security domain. As such, numerous techniques have emerged to securely transmit or store such multimedia data, as encryption, steganography or secret ...
详细信息
ISBN:
(纸本)9781665469647
Protecting sensitive images is nowadays a key issue in information security domain. As such, numerous techniques have emerged to securely transmit or store such multimedia data, as encryption, steganography or secret sharing. Most of today's secret image sharing methods relie on the polynomialbased scheme proposed by Shamir. However, some of the shared images distributed to the participants may be noised between their creation and their use to retrieve the secret image. Noise can be added to a shared image during transmission, storage or JPEG compression for example. However, to our knowledge, to date no analysis has been made on the impact of using a noised shared image in the reconstruction process of a secret image. In this paper, we propose a method to correct the errors during the reconstruction of a secret image using noised shared images.
With the increasing availability of remote sensing data and the development of machine learning-based collaborative interpretation techniques, remote sensing image transmission needs to serve both human and machine vi...
详细信息
The rapidly developing non-line-of-sight (NLOS) imaging technology in recent years is capable of intelligent visual perception of concealed targets, holding broad application prospects in security, emergency rescue, a...
详细信息
Thousands of people suffer from cranial injuries every year. They require personalized implants that need to be designed and manufactured before the reconstruction surgery. The manual design is expensive and time-cons...
详细信息
We revisit the well-studied image segmentation problem from a soft labeling perspective: instead of estimating integer labels per pixel indicating a finite set of classes, each pixel is assigned a real number that con...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
We revisit the well-studied image segmentation problem from a soft labeling perspective: instead of estimating integer labels per pixel indicating a finite set of classes, each pixel is assigned a real number that conveys the level of uncertainty in the estimated class label. Soft labels are useful, for example, for subsequent human editing or composition. Specifically, given a set of pre-computed super-pixel labels and feature vectors per pixel, we formulate a convex optimization objective regularized by signal-dependent gradient graph Laplacian regularizers (GGLR), which promotes piecewise planar (PWP) signal reconstruction. Unlike a previous well-known soft segmentation scheme that requires expensive computation of the first 100 eigenvectors, our optimization can be solved efficiently in linear time via conjugate gradient (CG). Experimental results show that our method produces satisfactory soft labels per pixel for images in two public datasets at a reduced computation cost compared to the previous soft segmentation scheme.
Due to the low signal-to-noise ratio and limited resolution of functional MRI data, and the high complexity of natural images, reconstructing a visual stimulus from human brain fMRI measurements is a challenging task....
详细信息
Due to the low signal-to-noise ratio and limited resolution of functional MRI data, and the high complexity of natural images, reconstructing a visual stimulus from human brain fMRI measurements is a challenging task. In this work, we propose a novel approach for this task, which we call Cortex2image, to decode visual stimuli with high semantic fidelity and rich fine-grained detail. In particular, we train a surface-based convolutional network model that maps from brain response to semantic image features first (Cortex2Semantic). We then combine this model with a high-quality image generator (Instance-Conditioned GAN) to train another mapping from brain response to fine-grained image features using a variational approach (Cortex2Detail). imagereconstructions obtained by our proposed method achieve state-of-the-art semantic fidelity, while yielding good fine-grained similarity with the ground-truth stimulus. Our code is available on https://***/zijin-gu/***.
3D face reconstruction plays a major role in many human-robot interaction systems, from automatic face authentication to human-computer interface-based entertainment. To improve robustness against occlusions and noise...
详细信息
ISBN:
(纸本)9798350323658
3D face reconstruction plays a major role in many human-robot interaction systems, from automatic face authentication to human-computer interface-based entertainment. To improve robustness against occlusions and noise, 3D face reconstruction networks are often trained on a set of in-the-wild face images preferably captured along different viewpoints of the subject. However, collecting the required large amounts of 3D annotated face data is expensive and time-consuming. To address the high annotation cost and due to the importance of training on a useful set, we propose an Active Learning (AL) framework that actively selects the most informative and representative samples to be labeled. To the best of our knowledge, this paper is the first work on tackling active learning for 3D face reconstruction to enable a label-efficient training strategy. In particular, we propose a Reinforcement Active Learning approach in conjunction with a clustering-based pooling strategy to select informative view-points of the subjects. Experimental results on 300W-LP and AFLW2000 datasets demonstrate that our proposed method is able to 1) efficiently select the most influencing view-points for labeling and outperforms several baseline AL techniques and 2) further improve the performance of a 3D Face reconstruction network trained on the full dataset.
Nowadays, robotics, AR, and 3D modeling applications attract considerable attention to single-view depth estimation (SVDE) as it allows estimating scene geometry from a single RGB image. Recent works have demonstrated...
详细信息
ISBN:
(数字)9781665469463
ISBN:
(纸本)9781665469463
Nowadays, robotics, AR, and 3D modeling applications attract considerable attention to single-view depth estimation (SVDE) as it allows estimating scene geometry from a single RGB image. Recent works have demonstrated that the accuracy of an SVDE method hugely depends on the diversity and volume of the training data. However, RGB-D datasets obtained via depth capturing or 3D reconstruction are typically small, synthetic datasets are not photorealistic enough, and all these datasets lack diversity. The large-scale and diverse data can be sourced from stereo images or stereo videos from the web. Typically being uncalibrated, stereo data provides disparities up to unknown shift (geometrically incompletedata), so stereo-trained SVDE methods cannot recover 3D geometry. It was recently shown that the distorted point clouds obtained with a stereo-trained SVDE method can be corrected with additional point cloud modules (PCM) separately trained on the geometrically complete data. On the contrary, we propose GP(2), General-Purpose and Geometry-Preserving training scheme, and show that conventional SVDE models can learn correct shifts themselves without any post-processing, benefiting from using stereo data even in the geometry-preserving setting. Through experiments on different dataset mixtures, we prove that GP(2)-trained models outperform methods relying on PCM in both accuracy and speed, and report the state-of-the-art results in the general-purpose geometry-preserving SVDE. Moreover, we show that SVDE models can learn to predict geometrically correct depth even when geometrically complete data comprises the minor part of the training set.
暂无评论