Semantic communication has emerged as the break-through beyond the Shannon theorem by transmitting and receiving semantic information instead of data bits or symbols regardless of its content. This paper proposes a tw...
详细信息
We address a challenging lifelong few- shot image generation task for the first time. In this situation, a generative model learns a sequence of tasks using only a few samples per task. Consequently, the learned model...
ISBN:
(纸本)9798350307184
We address a challenging lifelong few- shot image generation task for the first time. In this situation, a generative model learns a sequence of tasks using only a few samples per task. Consequently, the learned model encounters both catastrophic forgetting and overfitting problems at a time. Existing studies on lifelong GANs have proposed modulation-based methods to prevent catastrophic forgetting. However, they require considerable additional parameters and cannot generate high-fidelity and diverse images from limited data. On the other hand, the existing few-shot GANs suffer from severe catastrophic forgetting when learning multiple tasks. To alleviate these issues, we propose a framework called Lifelong Few-Shot GAN (LFS-GAN) that can generate high-quality and diverse images in lifelong few-shot image generation task. Our proposed framework learns each task using an efficient task-specific modulator - Learnable Factorized Tensor (LeFT). LeFT is rank-constrained and has a rich representation ability due to its unique reconstruction technique. Furthermore, we propose a novel mode seeking loss to improve the diversity of our model in low-data circumstances. Extensive experiments demonstrate that the proposed LFS-GAN can generate high-fidelity and diverse images without any forgetting and mode collapse in various domains, achieving state-of-the-art in lifelong few-shot image generation task. Surprisingly, we find that our LFS-GAN even outperforms the existing few-shot GANs in the few-shot image generation task. The code is available at Github.
As an active microwave imaging technology, synthetic aperture radar (SAR) has extensive application prospects in remote sensing, surveying, and mapping. How to reconstruct SAR images from the subsampled echo data has ...
详细信息
ISBN:
(纸本)9781665436694
As an active microwave imaging technology, synthetic aperture radar (SAR) has extensive application prospects in remote sensing, surveying, and mapping. How to reconstruct SAR images from the subsampled echo data has always been a challenging issue. Compressed sensing technology attempts to use the sparse characteristics of SAR images to recover the SAR images from the compressed samples. However, in practical applications, SAR images do not always meet the characteristic of sparsity. In this paper, we propose an imaging method from sub-Nyquist sampled data based on deep priors of SAR images. First, we use a generative flow network to model the deep prior information of the images based on the existing SAR imagedatasets. Then, the pre-trained network modeling deep prior information of images is embedded in a typical compressed sensing method, i.e., the Iterative Shrinkage-Thresholding Algorithm (ISTA), to replace the sparse shrinkage function of it. Meanwhile, in order to improve the accuracy and convergence speed of reconstruction, we used the subsampled echo data to fine-tune the iterative parameters of the embedded ISTA method. The results of the experiments show that through the use of the deep prior information of the images, this method can accurately reconstruct non-sparse SAR images from the subsampled echoes, even if only a few echo samples are available.
Synthesis and reconstruction of 3D human head has gained increasing interests in computer vision and computer graphics recently. Existing state-of-the-art 3D generative adversarial networks (GANs) for 3D human head sy...
详细信息
ISBN:
(纸本)9798350301298
Synthesis and reconstruction of 3D human head has gained increasing interests in computer vision and computer graphics recently. Existing state-of-the-art 3D generative adversarial networks (GANs) for 3D human head synthesis are either limited to near-frontal views or hard to preserve 3D consistency in large view angles. We propose PanoHead, the first 3D-aware generative model that enables high-quality view-consistent image synthesis of full heads in 360 degrees. with diverse appearance and detailed geometry using only in-the-wild unstructured images for training. At its core, we lift up the representation power of recent 3D GANs and bridge the data alignment gap when training from in-the-wild images with widely distributed views. Specifically, we propose a novel two-stage self-adaptive image alignment for robust 3D GAN training. We further introduce a tri-grid neural volume representation that effectively addresses front-face and backhead feature entanglement rooted in the widely-adopted triplane formulation. Our method instills prior knowledge of 2D image segmentation in adversarial learning of 3D neural scene structures, enabling compositable head synthe- sis in diverse backgrounds. Benefiting from these designs, our method significantly outperforms previous 3D GANs, generating high-quality 3D heads with accurate geometry and diverse appearances, even with long wavy and afro hairstyles, renderable from arbitrary poses. Furthermore, we show that our system can reconstruct full 3D heads from single input images for personalized realistic 3D avatars.
The advanced deep learning-based Autoencoding techniques have enabled the introduction of efficient Unsupervised Anomaly Detection (UAD) approaches. Several autoencoder-based approaches have been used to solve UAD tas...
详细信息
Motivated by the fact that forward and backward passes of a deep network naturally form symmetric mappings between input and output representations, we introduce a simple yet effective self-supervised vision model pre...
详细信息
In our latest development of an open-source simulation framework, we introduce the innovative concept of "food-atoms" for simulating mechanical phenomena in food systems, covering a broad spectrum from produ...
详细信息
image processing is a method for enhancing unprocessed images from cameras on aircraft, spacecraft, and satellites as well as images taken regularly for a variety of uses. In general, the following strategies can be u...
详细信息
Phase-retrieval approaches seek to recover an original signal using only the modulus of its Fourier transform, which is often much easier to detect than its phase, but normal iterative techniques frequently fail when ...
详细信息
ISBN:
(数字)9798350377323
ISBN:
(纸本)9798350377330
Phase-retrieval approaches seek to recover an original signal using only the modulus of its Fourier transform, which is often much easier to detect than its phase, but normal iterative techniques frequently fail when only a portion of the modulus information is available [1]. We demonstrate that a neural network can be trained to perform phase retrieval with partial information, and we analyze the benefits and drawbacks of this technique. In practice we consider a reconstruction problem where we wish to recover the image of a concealed object from an estimate of its autocorrelation, obtained from the analysis of the spatial correlations of the speckle image. We perform the image retrieval task on a synthetic dataset and under different conditions, where the autocorrelation information may be either fully available or, more interestingly, partially removed [2].
The proceedings contain 128 papers. The topics discussed include: a study on multimodal approach for early detection of dementia using deep learning;market analysis of various types of rapid testing glucose level equi...
ISBN:
(纸本)9798350383287
The proceedings contain 128 papers. The topics discussed include: a study on multimodal approach for early detection of dementia using deep learning;market analysis of various types of rapid testing glucose level equipments;optimal feature selection using enhanced bacterial foraging optimization for healthcare application based big data;transformer based lightweight model for punctuation restoration and truecasing;fuzzy based energy management strategy for battery and fuel cell hybrid vehicles;upgrade better accuracy by analyzing the behavior of classifying algorithm to detect intrusion in network traffic;enhanced pothole detection in road condition assessment using YOLOv8;towards efficient encrypted medical image retrieval from heterogeneous multi cloud environment;and enhancing medical imaging resolution: exploring SRGAN for high-quality medical imagereconstruction.
暂无评论