In recent years, deep learning has achieved promising success for multimedia quality assessment, especially for image quality assessment (IQA). However, since there exist more complex temporal characteristics in video...
详细信息
Image quality assessment (IQA) aims to estimate human perception based image visual quality. Although existing deep neural networks (DNNs) have shown significant effectiveness for tackling the IQA problem, it still ne...
详细信息
We propose an image coding scheme that compresses image into semantically scalable bitstream using deep neural networks. This scheme is expected to support intelligent analysis when the bitstream is partially decoded,...
详细信息
ISBN:
(数字)9781728133201
ISBN:
(纸本)9781728133218
We propose an image coding scheme that compresses image into semantically scalable bitstream using deep neural networks. This scheme is expected to support intelligent analysis when the bitstream is partially decoded, as well as high-fidelity reconstruction of image when the bitstream is completely decoded. We implement such a semantically scalable image coding scheme based on semantic map. In the proposed scheme, the original image is firstly semantically segmented and the semantic map is compressed as the base layer. Then, the original image is segmented into several individual objects according to the semantic map, and each object is coded separately. A recurrent neural network-based encoder is used to compress these objects at several quality levels. At the decoder side, the semantic map can be directly applied for intelligent analysis. A generative adversarial network is used to synthesize a rough image using the semantic map. If user is interested in a certain object, more bits can be transmitted to enhance the quality of the object. Experimental results show that the proposed method achieves comparable compression performance with JPEG2000 at high bit rates, while facilitates intelligent analysis at low bit rates.
Quality assessment of omnidirectional images has become increasingly urgent due to the rapid growth of virtual reality applications. Different from traditional 2D images and videos, omnidirectional contents can provid...
详细信息
Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task. To alleviate this problem, we raise t...
详细信息
Hybrid-distorted image restoration (HD-IR) is dedicated to restore real distorted image that is degraded by multiple distortions. Existing HD-IR approaches usually ignore the inherent interference among hybrid distort...
详细信息
Generative adversarial nets (GANs) have been successfully applied in many fields like image generation, inpainting, super-resolution, and drug discovery, etc. By now, the inner process of GANs is far from being unders...
详细信息
Generative adversarial nets (GANs) have been successfully applied in many fields like image generation, inpainting, super-resolution, and drug discovery, etc. By now, the inner process of GANs is far from being understood. To get a deeper insight into the intrinsic mechanism of GANs, in this paper, a method for interpreting the latent space of GANs by analyzing the correlation between latent variables and the corresponding semantic contents in generated images is proposed. Unlike previous methods that focus on dissecting models via feature visualization, the emphasis of this work is put on the variables in latent space, i.e. how the latent variables affect the quantitative analysis of generated results. Given a pre-trained GAN model with weights fixed, the latent variables are intervened to analyze their effect on the semantic content in generated images. A set of controlling latent variables can be derived for specific content generation, and the controllable semantic content manipulation is achieved. The proposed method is testified on the datasets Fashion-MNIST and UT Zappos50K, experiment results show its effectiveness.
Synthetic aperture radar (SAR) tomography (TomoSAR) has attracted remarkable interest for its ability in achieving three-dimensional reconstruction along the elevation direction from multiple observations. In recent y...
详细信息
Rain removal is important for many computer vision applications, such as surveillance, autonomous car, etc. Traditionally, rain removal is regarded as a signal removal problem which usually causes over-smoothing by re...
详细信息
Imagery geometry models (IGMs) of the high-resolution satellite images (HRSIs) are always of great interest in the photogrammetry and remote sensing community for the raising new kinds of sensors and imaging systems. ...
详细信息
暂无评论