检索结果-内蒙古大学图书馆

IEEE International geoscience and Remote Sensing Symposium

作者： Lei Liu Zongxu Pan Xiaolan Qiu Lingxiao Peng Key Laboratory of Technology in Geo-spatial Information Processing and Application System Chinese Academy of Sciences Beijing China

Target classification is an important part in automatic target recognition (ATR) systems. Deep learning methods get state of the art performance in SAR target classification. Simulation is a useful data augmentation method when the numbers of real samples for training is not sufficient. This article discusses how to release the full potential of simulated samples which is used to improve performance of SAR target classifier. The proposed method is based on cycle adversarial network (CycleGAN), which can transfer simulated samples to be more similar with real samples in image domain. Experiments show that adding simulated samples straightforward into training dataset is not helpful to improve the performance. However, adding the transferred simulated samples for training results in about 10% increase in accuracy in the designed SAR airplane classification experiment, compared with training without data augmentation.

关键词： Training Testing Synthetic aperture radar Azimuth Gallium nitride Data models Airplanes

来源：评论

学校读者我要写书评

暂无评论

SUPER-RESOLUTION OF REMOTE SENSING IMAGES BASED ON TRANSFERRED GENERATIVE ADVERSARIAL NETWORK

SUPER-RESOLUTION OF REMOTE SENSING IMAGES BASED ON TRANSFERR...

引用

IEEE International geoscience and Remote Sensing Symposium

作者： Wen Ma Zongxu Pan Jiayi Guo Bin Lei Key Laboratory of Technology in Geo-spatial Information Processing and Application System Chinese Academy of Sciences Beijing China

Single image super-resolution (SR) has been widely studied in recent years as a crucial technique for remote sensing applications. This paper proposes a SR method for remote sensing images based on a transferred generative adversarial network (TGAN). Different from the previous GAN-based SR approaches, the novelty of our method mainly reflects from two aspects. First, the batch normalization layers are removed to reduce the memory consumption and the computational burden, as well as raising the accuracy. Second, our model is trained in a transfer-learning fashion to cope with the insufficiency of training data, which is the crux of applying deep learning methods to remote sensing applications. The model is firstly trained on an external dataset DIV2K and further fine-tuned with the remote sensing dataset. Our experimental results demonstrate that the proposed method is superior to SRCNN and SRGAN in terms of both the objective evaluation and the subjective perspective.

关键词： Remote sensing images Super-resolution Generative adversarial network Transfer learning

来源：评论

学校读者我要写书评

暂无评论

Curved-path SAR geolocation error analysis based on BP algorithm 38

Curved-path SAR geolocation error analysis based on BP algor...

引用

38th Annual IEEE International geoscience and Remote Sensing Symposium, IGARSS 2018

作者： Liu, Junbin Qiu, Xiaolan Huang, Lijia Ding, Chibiao Liu, Ming University of Chinese Academy of Sciences China Key Laboratory of Technology in Geo-spatial Information Processing and Application System CAS China Institute of Electronics Chinese Academy of Sciences China National Disaster Reduction Center Ministry of Civil Affairs NDRCC China

ISBN: (纸本)9781538671504

The theoretical modeling and analysis of SAR location error play an important role in SAR system design and error source budget. Existing SAR geolocation error models are mainly implicit, which are not easy to do analysis, especially in the curved-path case. In this paper, a theoretical explicit model of the relationship between image geolocation error and the path measurement error is established for curved-path SAR, based on BP imaging algorithm. Simulations are given which verify the correctness of the model. The explicit model and the analysis results provide an effective reference for understanding and budgeting the system-level geometric location error for curved-path SAR, such as geoSAR. © 2018 IEEE

关键词： geology Imaging Synthetic aperture radar Trajectory Mathematical model Analytical models Simulation

来源：评论

学校读者我要写书评

暂无评论

InSAR DEM Reconstruction Based on Backprojection Algorithm in Two Converse Flights

InSAR DEM Reconstruction Based on Backprojection Algorithm i...

引用

Asian and Pacific Conference on Synthetic Aperture Radar (APSAR)

作者： Xiaoning Hu Maosheng Xiang Bingnan Wang Xikai Fu University of Chinese Academy of Sciences National Key Laboratory of Science and Technology on Microwave Imaging Institute of Electronics Chinese Academy of Sciences Beijing China National Key Laboratory of Science and Technology on Microwave Imaging Institute of Electronics Chinese Academy of Sciences Beijing China Key Laboratory of Technology in Geo-spatial Information Processing and Application System Institute of Electronics Chinese Academy of Sciences Beijing China

ISBN: (数字)9781728129129

ISBN: (纸本)9781728129136

Interferometric synthetic aperture radar (InSAR) can be used to extract digital elevation model (DEM) with high accuracy. However, the side looking geometry of synthetic aperture radar (SAR) may cause geometric distortions such as shadow and layover in the mountainous terrain, which will reduce the quality of generated DEM. Fusion of two or more different aspects of InSAR data can deal with this problem. We propose an InSAR DEM reconstruction method based on backprojection (BP) algorithm in two converse flights. This method utilizes the feature of BP algorithm that geocoding has been realized in imaging process to simplify the fusion process of multi-aspect InSAR data. In addition, an iterative DEM extraction method is introduced to improve DEM accuracy. Experimental results verify the effectiveness of the proposed method.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SDM: Semantic Distortion Measurement for Video Encryption

SDM: Semantic Distortion Measurement for Video Encryption

引用

International Conference on Automatic Face and Gesture Recognition

作者： Yongquan Hu Wei Zhou Shuxin Zhao Zhibo Chen Weiping Li CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

Semantic information is important in video encryption. However, existing image quality assessment (IQA) methods, such as the peak signal to noise ratio (PSNR), are still widely applied to measure the encryption security. Generally, these traditional IQA methods aim to evaluate the image quality from the perspective of visual signal rather than semantic information. In this paper, we propose a novel semantic-level full-reference image quality assessment (FR-IQA) method named Semantic Distortion Measurement (SDM) to measure the degree of semantic distortion for video encryption. Then, based on a semantic saliency dataset, we verify that the proposed SDM method outperforms state-of-the-art algorithms. Furthermore, we construct a Region Of Semantic Saliency (ROSS) video encryption system to demonstrate the effectiveness of our proposed SDM method in the practical application.

关键词： Semantics Encryption Distortion Distortion measurement Visualization Object segmentation

来源：评论

学校读者我要写书评

暂无评论

Multiscale Progressive Image Compression Network Guided by Learnable Just Noticeable Distortion

Multiscale Progressive Image Compression Network Guided by L...

引用

IEEE Visual Communications and Image processing (VCIP)

作者： Xin Jin Runchun Ye Zhibo Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (纸本)9781538644591;9781538644584

One key challenge to the learning-based image compression is that adaptive bit allocation is crucial for compression effectiveness but can hardly be trained into a neural network. Hereby, in this work, We presents an end-to-end trainable image compression framework, named Multi-scale Progressive Network (MPN) to achieve spatially variant bit allocation and rate control through the guidance of a novel learnable just noticeable distortion (JND) map. Specifically, MPN's encoder archives multi-scale feature representation through a three-branched structure. Each branch employs an independent feature extraction strategy for the specific receptive field and merge progressively under the guidance of corresponding learnable JND maps that generated by our proposed Bit-Allocation sub-Network (BAN), which make MPN focus on the areas where attract the human visual system (HVS) and preserve more texture of the image during the compression procedure. Finally, a hybrid objective function is introduced to further make MPN more efficient and mimic the discriminative characteristics of the human visual system (HVS). Experiments show that MPN significantly outperforms traditional JPEG, JPEG 2000 and few state-of-art learning-based methods by multi-scale structural similarity (MS-SSIM) index, and has the ability to produce the much better visual result with rich textures, sharp edges, and fewer artifacts.

关键词： Image coding Feature extraction Transform coding Distortion Image reconstruction Bit rate Visualization

来源：评论

学校读者我要写书评

暂无评论

Convolutional Neural Network-Based Residue Super-Resolution for Video Coding

Convolutional Neural Network-Based Residue Super-Resolution ...

引用

IEEE Visual Communications and Image processing (VCIP)

作者： Kang Liu Dong Liu Houqiang Li Feng Wu CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (纸本)9781538644591;9781538644584

Inspired by the progress of image and video super-resolution (SR) achieved by convolutional neural network (CNN), we propose a CNN-based residue SR method for video coding. Different from the previous works that operate in the pixel domain, i.e. down- and up-sampling of image or video frame, we propose to perform down- and up-sampling in the residue domain. Specifically, for each block, we perform motion estimation and compensation to achieve residual signal at the original resolution, then we down-sample the residue and compress it at low resolution, and perform residue SR using a trained CNN model. We design a new CNN for residue SR with the help of the motion compensated prediction signal. We integrate the residue SR method into the High Efficiency Video Coding (HEVC) scheme, providing mode decision at the level of coding tree unit. Experimental results show that our method achieves on average 4.0% and 2.8% BD-rate reduction under low-delay P and low-delay B configurations, respectively.

关键词： Encoding Signal resolution Video coding spatial resolution Delays Convolution

来源：评论

学校读者我要写书评

暂无评论

LEARNED SCALABLE IMAGE COMPRESSION WITH BIDIRECTIONAL CONTEXT DISENTANGLEMENT NETWORK

arXiv

引用

arXiv 2018年

作者： Zhang, Zhizheng Chen, Zhibo Lin, Jianxin Li, Weiping CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

In this paper, we propose a learned scalable/progressive image compression scheme based on deep neural networks (DNN), named Bidirectional Context Disentanglement Network (BCD-Net). For learning hierarchical representations, we first adopt bit-plane decomposition to decompose the information coarsely before the deep-learning-based transformation. However, the information carried by different bit-planes is not only unequal in entropy but also of different importance for reconstruction. We thus take the hidden features corresponding to different bit-planes as the context and design a network topology with bidirectional flows to disentangle the contextual information for more effective compressed representations. Our proposed scheme enables us to obtain the compressed codes with scalable rates via a one-pass encoding-decoding. Experiment results demonstrate that our proposed model outperforms the state-of-the-art DNN-based scalable image compression methods in both PSNR and MS-SSIM metrics. In addition, our proposed model achieves better performance in MS-SSIM metric than conventional scalable image codecs. Effectiveness of our technical components is also verified through sufficient ablation experiments. Copyright © 2018, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Generative Adversarial Network-Based Frame Extrapolation for Video Coding

Generative Adversarial Network-Based Frame Extrapolation for...

引用

IEEE Visual Communications and Image processing (VCIP)

作者： Jianping Lin Dong Liu Houqiang Li Feng Wu CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (纸本)9781538644591;9781538644584

Motion estimation and motion compensation are fundamental in video coding to remove the temporal redundancy between video frames. The current video coding schemes usually adopt block-based motion estimation and compensation using simple translational or affine motion models, which cannot efficiently characterize complex motions in natural video signal. In this paper, we propose a frame extrapolation method for motion estimation and compensation. Specifically, based on the several previous frames, our method directly extrapolates the current frame using a trained deep network model. The deep network we adopted is a redesigned Video Coding oriented LAplacian Pyramid of Generative Adversarial Networks (VC-LAPGAN). The extrapolated frame is then used as an additional reference frame. Experimental results show that the VC-LAPGAN is capable in estimating and compensating for complex motions, and extrapolating frames with high visual quality. Using the VC-LAPGAN, our method achieves on average 2.0% BD-rate reduction than High Efficiency Video Coding (HEVC) under low-delay P configuration.

关键词： Video coding Motion estimation Training Computational modeling Extrapolation Laplace equations Convolutional codes

来源：评论

学校读者我要写书评

暂无评论

Learning based Facial Image Compression with Semantic Fidelity Metric

arXiv

引用

arXiv 2018年

作者： Chen, Zhibo He, Tianyu CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

Surveillance and security scenarios usually require high efficient facial image compression scheme for face recognition and identification. While either traditional general image codecs or special facial image compression schemes only heuristically refine codec separately according to face verification accuracy metric. We propose a Learning based Facial Image Compression (LFIC) framework with a novel Regionally Adaptive Pooling (RAP) module whose parameters can be automatically optimized according to gradient feedback from an integrated hybrid semantic fidelity metric, including a successfully exploration to apply Generative Adversarial Network (GAN) as metric directly in image compression scheme. The experimental results verify the framework’s efficiency by demonstrating performance improvement of 71.41%, 48.28% and 52.67% bitrate saving separately over JPEG2000, WebP and neural network-based codecs under the same face verification accuracy distortion metric. We also evaluate LFIC’s superior performance gain compared with latest specific facial image codecs. Visual experiments also show some interesting insight on how LFIC can automatically capture the information in critical areas based on semantic distortion metrics for optimized compression, which is quite different from the heuristic way of optimization in traditional image compression algorithms. Copyright © 2018, The Authors. All rights reserved.

关键词： Image compression

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：