检索结果-内蒙古大学图书馆

Deep multi-scale features learning for distorted image quality assessment

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Zhou, Wei Chen, Zhibo CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Image quality assessment (IQA) aims to estimate human perception based image visual quality. Although existing deep neural networks (DNNs) have shown significant effectiveness for tackling the IQA problem, it still needs to improve the DNN-based quality assessment models by exploiting efficient multi-scale features. In this paper, motivated by the human visual system (HVS) combining multi-scale features for perception, we propose to use pyramid features learning to build a DNN with hierarchical multi-scale features for distorted image quality prediction. Our model is based on both residual maps and distorted images in luminance domain, where the proposed network contains spatial pyramid pooling and feature pyramid from the network structure. Our proposed network is optimized in a deep end-to-end supervision manner. To validate the effectiveness of the proposed method, extensive experiments are conducted on four widely-used image quality assessment databases, demonstrating the superiority of our algorithm. © 2020, CC BY.

关键词： Deep neural networks

Towards Semantically Scalable Image Coding using Semantic Map

学校读者我要写书评

暂无评论

Towards Semantically Scalable Image Coding using Semantic Ma...

IEEE International Symposium on Circuits and systems (ISCAS)

作者： Ning Yan Dong Liu Houqiang Li Feng Wu Zhiwei Xiong Zheng-Jun Zha CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei 230027 China

ISBN: (数字)9781728133201

ISBN: (纸本)9781728133218

We propose an image coding scheme that compresses image into semantically scalable bitstream using deep neural networks. This scheme is expected to support intelligent analysis when the bitstream is partially decoded, as well as high-fidelity reconstruction of image when the bitstream is completely decoded. We implement such a semantically scalable image coding scheme based on semantic map. In the proposed scheme, the original image is firstly semantically segmented and the semantic map is compressed as the base layer. Then, the original image is segmented into several individual objects according to the semantic map, and each object is coded separately. A recurrent neural network-based encoder is used to compress these objects at several quality levels. At the decoder side, the semantic map can be directly applied for intelligent analysis. A generative adversarial network is used to synthesize a rough image using the semantic map. If user is interested in a certain object, more bits can be transmitted to enhance the quality of the object. Experimental results show that the proposed method achieves comparable compression performance with JPEG2000 at high bit rates, while facilitates intelligent analysis at low bit rates.

关键词： Semantics Image coding Image reconstruction Transform coding Encoding Image segmentation Scalability

Blind omnidirectional image quality assessment with viewport oriented graph convolutional networks

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Xu, Jiahua Zhou, Wei Chen, Zhibo CAS Key Laboratory of Technology Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Quality assessment of omnidirectional images has become increasingly urgent due to the rapid growth of virtual reality applications. Different from traditional 2D images and videos, omnidirectional contents can provide consumers with freely changeable viewports and a larger field of view covering the 360° × 180° spherical surface, which makes the objective quality assessment of omnidirectional images more challenging. In this paper, motivated by the characteristics of the human vision system (HVS) and the viewing process of omnidirectional contents, we propose a novel Viewport oriented Graph Convolution Network (VGCN) for blind omnidirectional image quality assessment (IQA). Generally, observers tend to give the subjective rating of a 360-degree image after passing and aggregating different viewports information when browsing the spherical scenery. Therefore, in order to model the mutual dependency of viewports in the omnidirectional image, we build a spatial viewport graph. Specifically, the graph nodes are first defined with selected viewports with higher probabilities to be seen, which is inspired by the HVS that human beings are more sensitive to structural information. Then, these nodes are connected by spatial relations to capture interactions among them. Finally, reasoning on the proposed graph is performed via graph convolutional networks. Moreover, we simultaneously obtain global quality using the entire omnidirectional image without viewport sampling to boost the performance according to the viewing experience. Experimental results demonstrate that our proposed model outperforms state-of-the-art full-reference and no-reference IQA metrics on two public omnidirectional IQA databases. Copyright © 2020, The Authors. All rights reserved.

关键词： Convolution

LIRA: Lifelong image restoration from unknown blended distortions

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Liu, Jianzhao Lin, Jianxin Li, Xin Zhou, Wei Liu, Sen Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task. To alleviate this problem, we raise the novel lifelong image restoration problem for blended distortions. We first design a base fork-join model in which multiple pre-trained expert models specializing in individual distortion removal task work cooperatively and adaptively to handle blended distortions. When the input is degraded by a new distortion, inspired by adult neurogenesis in human memory system, we develop a neural growing strategy where the previously trained model can incorporate a new expert branch and continually accumulate new knowledge without interfering with learned knowledge. Experimental results show that the proposed approach can not only achieve state-of-the-art performance on blended distortions removal tasks in both PSNR/SSIM metrics, but also maintain old expertise while learning new restoration tasks. Copyright © 2020, The Authors. All rights reserved.

关键词： Image reconstruction

learning disentangled feature representation for Hybrid-distorted image restoration

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Li, Xin Jin, Xin Lin, Jianxin Yu, Tao Liu, Sen Wu, Yaojun Zhou, Wei Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Hybrid-distorted image restoration (HD-IR) is dedicated to restore real distorted image that is degraded by multiple distortions. Existing HD-IR approaches usually ignore the inherent interference among hybrid distortions which compromises the restoration performance. To decompose such interference, we introduce the concept of Disentangled Feature Learning to achieve the feature-level divide-and-conquer of hybrid distortions. Specifically, we propose the feature disentanglement module (FDM) to distribute feature representations of different distortions into different channels by revising gain-control-based normalization. We also propose a feature aggregation module (FAM) with channel-wise attention to adaptively filter out the distortion representations and aggregate useful content information from different channels for the construction of raw image. The effectiveness of the proposed scheme is verified by visualizing the correlation matrix of features and channel responses of different distortions. Extensive experimental results also prove superior performance of our approach compared with the latest HD-IR schemes. Copyright © 2020, The Authors. All rights reserved.

关键词： Image reconstruction

Interpreting the Latent Space of GANs via Correlation Analysis for Controllable Concept Manipulation

学校读者我要写书评

暂无评论

Interpreting the Latent Space of GANs via Correlation Analys...

International Conference on Pattern Recognition

作者： Ziqiang Li Rentuo Tao Hongjing Niu Mingdao Yue Bin Li CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application Systems University of Science and Technology of China Hefei Anhui China College of Mechanical and Electrical Engineering Suzhou University Suzhou Anhui China

Generative adversarial nets (GANs) have been successfully applied in many fields like image generation, inpainting, super-resolution, and drug discovery, etc. By now, the inner process of GANs is far from being understood. To get a deeper insight into the intrinsic mechanism of GANs, in this paper, a method for interpreting the latent space of GANs by analyzing the correlation between latent variables and the corresponding semantic contents in generated images is proposed. Unlike previous methods that focus on dissecting models via feature visualization, the emphasis of this work is put on the variables in latent space, i.e. how the latent variables affect the quantitative analysis of generated results. Given a pre-trained GAN model with weights fixed, the latent variables are intervened to analyze their effect on the semantic content in generated images. A set of controlling latent variables can be derived for specific content generation, and the controllable semantic content manipulation is achieved. The proposed method is testified on the datasets Fashion-MNIST and UT Zappos50K, experiment results show its effectiveness.

关键词： Drugs Visualization Analytical models Correlation Statistical analysis Image synthesis Semantics

TomoSAR-ALISTA: Efficient TomoSAR Imaging via Deep Unfolded Network

学校读者我要写书评

暂无评论

arXiv 2022年

作者： Wang, Muhan Zhang, Zhe Wang, Yue Gao, Silin Qiu, Xiaolan Key Laboratory of Technology in Geo-spatial Information Processing and Application System Chinese Academy of Sciences Beijing100190 China Key Laboratory of Intelligent Aerospace Big Data Application Technology Suzhou215123 China Suzhou Aerospace Information Research Institute Suzhou215123 China School of Electronic Electrical and Communication Engineering University of Chinese Academy of Sciences Beijing100049 China Aerospace Information Research Institute Chinese Academy of Sciences Beijing100094 China Electrical and Computer Engineering Department George Mason University FairfaxVA22030 United States

Synthetic aperture radar (SAR) tomography (TomoSAR) has attracted remarkable interest for its ability in achieving three-dimensional reconstruction along the elevation direction from multiple observations. In recent years, compressed sensing (CS) technique has been introduced into TomoSAR considering for its super-resolution ability with limited samples. Whereas, the CS-based methods suffer from several drawbacks, including weak noise resistance, high computational complexity and complex parameter fine-tuning. Among the different CS algorithms, iterative soft-thresholding algorithm (ISTA) is widely used as a robust reconstruction approach, however, the parameters in the ISTA algorithm are manually chosen, which usually requires a time-consuming fine-tuning process to achieve the best performance. Aiming at efficient TomoSAR imaging, a novel sparse unfolding network named analytic learned ISTA (ALISTA) is proposed towards the TomoSAR imaging problem in this paper, and the key parameters of ISTA are learned from training data via deep learning to avoid complex parameter fine-tuning and significantly relieves the training burden. In addition, experiments verify that it is feasible to use traditional CS algorithms as training labels, which provides a tangible supervised training method to achieve better 3D reconstruction performance even in the absence of labeled data in real applications. Copyright © 2022, The Authors. All rights reserved.

关键词： Compressed sensing

A decomposed dual-cross generative adversarial network for image rain removal 29

学校读者我要写书评

暂无评论

A decomposed dual-cross generative adversarial network for i...

29th British Machine Vision Conference, BMVC 2018

作者： Jin, Xin Chen, Zhibo Lin, Jianxin Chen, Jiale Zhou, Wei Shan, Chaowei CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Rain removal is important for many computer vision applications, such as surveillance, autonomous car, etc. Traditionally, rain removal is regarded as a signal removal problem which usually causes over-smoothing by removing texture details in non-rain background regions. This paper considers the issue of rain removal from a completely different perspective, to treat rain removal as a signal decomposition problem. Specifically, we decompose the rain image into two components, namely non-rain background image and rain streaks image. Then, we introduce an adversarial training mechanism to synthesize non-rain background image and rain streaks image in a Dual-Cross manner, which makes the two adversarial branches interact with each other, archiving a win-win result ultimately. The proposed Decomposed Dual-Cross Generative Adversarial Network (DDC-GAN) shows significantly performance improvement compared with state-of-the-art methods on both synthetic and real-world images in terms of qualitative and quantitative measures (over 3dB gains in PSNR). © 2018. The copyright of this document resides with its authors.

关键词： Rain

THE REMOTE SENSING IMAGE geoMETRICAL MODEL of BP NEURAL NETWORK

学校读者我要写书评

暂无评论

THE REMOTE SENSING IMAGE GEOMETRICAL MODEL of BP NEURAL NETW...

2020 International Conference on geomatics in the Big Data Era, ICGBD 2020

作者： Yue, C.Y. Sun, T. Xie, J.F. Beijing Institute of Space Mechanics and Electricity Beijing China Beijing Key Laboratory of Advanced Optical Remote Sensing Technology Beijing China Key Laboratory of Technology in Geo-spatial Information Processing and Application System Aerospace Information Research Institute Chinese Academy of Sciences Beijing China Land Satellite Remote Sensing Application Center Ministry of Natural Resources of P. R. China Beijing China

Imagery geometry models (IGMs) of the high-resolution satellite images (HRSIs) are always of great interest in the photogrammetry and remote sensing community for the raising new kinds of sensors and imaging systems. Especially the generalized sensor models (GSMs) have been widely used for positioning of satellite images, and the accuracy are already validated. Since Back propagation (BP) neural network is a better choice for the two key reasons of the replacement of physical sensor models by generalized sensor models, numerous mathematical estimations for every specialized sensor, and secret equations of the IGMs. Experiments are carried out to test the approximation accuracy of the new generalized sensor model. And the experimental results show that, the BP neural network is of extremely high accuracy for satellite imagery photogrammetric restitution. © 2020 C. Y. Yue et al.

关键词： Backpropagation