检索结果-内蒙古大学图书馆

Deep Scalable Image Compression via Hierarchical Feature Decorrelation

学校读者我要写书评

暂无评论

Deep Scalable Image Compression via Hierarchical Feature Dec...

Picture Coding Symposium, PCS

作者： Zongyu Guo Zhizheng Zhang Zhibo Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

The following topics are dealt with: video coding; data compression; image coding; convolutional neural nets; decoding; learning (artificial intelligence); motion compensation; video codecs; image reconstruction; filt...

关键词：

Video-based point cloud compression artifact removal

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Akhtar, Anique Gao, Wen Li, Li Li, Zhu Jia, Wei Liu, Shan Department of Computer Science and Electrical Engineering University of Missouri-Kansas City Kansas CityMO64110 United States Tencent America 661 Bryant St. Palo AltoCA94301 United States Department of Computer Science and Electrical Engineering University of Missouri-Kansas City Kansas CityMO64110 United States CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

—Photo-realistic point cloud capture and transmission are the fundamental enablers for immersive visual communication. The coding process of dynamic point clouds, especially video-based point cloud compression (V-PCC) developed by the MPEG standardization group, is now delivering state-of-the-art performance in compression efficiency. V-PCC is based on the projection of the point cloud patches to 2D planes and encoding the sequence as 2D texture and geometry patch sequences. However, the resulting quantization errors from coding can introduce compression artifacts, which can be very unpleasant for the quality of experience (QoE). In this work, we developed a novel out-of-the-loop point cloud geometry artifact removal solution that can significantly improve reconstruction quality without additional bandwidth cost. Our novel framework consists of a point cloud sampling scheme, an artifact removal network, and an aggregation scheme. The point cloud sampling scheme employs a cube-based neighborhood patch extraction to divide the point cloud into patches. The geometry artifact removal network then processes these patches to obtain artifact-removed patches. The artifact-removed patches are then merged together using an aggregation scheme to obtain the final artifact-removed point cloud. We employ 3D deep convolutional feature learning for geometry artifact removal that jointly recovers both the quantization direction and the quantization noise level by exploiting projection and quantization prior. The simulation results demonstrate that the proposed method is highly effective and can considerably improve the quality of the reconstructed point cloud. Copyright © 2021, The Authors. All rights reserved.

关键词： geometry

3D reconstruction and error analysis of multi-view space-borne SAR images under different configurations

学校读者我要写书评

暂无评论

3D reconstruction and error analysis of multi-view space-bor...

IET International Radar Conference 2018, IRC 2018

作者： Wang, Chao Qiu, Xiaolan Li, Fangfang Lei, Bin University of the Chinese Academy of Sciences Beijing China Institute of Electrics Chinese Academy of Sciences Beijing China Key Laboratory of Technology in Geo-spatial Information Processing and Application System Institute of Electrics Chinese Academy of Sciences Beijing China

Synthetic aperture radar (SAR) has a good ability to detect the microwave scattering characteristics of the target and has a good capability of slant range Doppler positioning. Using multi-view SAR images in combination with image matching and positioning equations, the 3D position of the target can be obtained. In the past, multi-view 3D reconstruction mainly used side-looking images with parallel trajectories but different incident angles. That method is not universal for different configurations and lacks analysis of the relationship between solution and parameter error. This study aims at the problem of multi-view SAR 3D reconstruction. The authors establish general 3D reconstruction equations that can be used for non-parallel track and non-side-looking and the solution method is deduced. Based on this, an analysis method of error sensitivity is proposed and the dependence relationship between the reconstruction error and configuration, velocity, position, and slant range is analysed. The correctness of this method is verified by simulation experiments, which provides guidance for selection of configuration and sensor accuracy when applying 3D reconstruction. © 2019 Institution of Engineering and technology. All rights reserved.

关键词： Synthetic aperture radar

A coarse-to-fine framework for learned color enhancement with non-local attention

学校读者我要写书评

暂无评论

arXiv 2019年

作者： Shan, Chaowei Zhang, Zhizheng Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Automatic color enhancement is aimed to adaptively adjust photos to expected styles and tones. For current learned methods in this field, global harmonious perception and local details are hard to be well-considered in a single model simultaneously. To address this problem, we propose a coarse-tofine framework with non-local attention for color enhancement in this paper. Within our framework, we propose to divide enhancement process into channel-wise enhancement and pixel-wise refinement performed by two cascaded Convolutional Neural Networks (CNNs). In channel-wise enhancement, our model predicts a global linear mapping for RGB channels of input images to perform global style adjustment. In pixel-wise refinement, we learn a refining mapping using residual learning for local adjustment. Further, we adopt a non-local attention block to capture the long-range dependencies from global information for subsequent fine-grained local refinement. We evaluate our proposed framework on the commonly using benchmark and conduct sufficient experiments to demonstrate each technical component within it. Copyright © 2019, The Authors. All rights reserved.

关键词： Pixels

No-reference light field image quality assessment based on micro-lens image

学校读者我要写书评

暂无评论

arXiv 2019年

作者： Luo, Ziyuan Zhou, Wei Shi, Likun Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

Light field image quality assessment (LF-IQA) plays a significant role due to its guidance to Light Field (LF) contents acquisition, processing and application. The LF can be represented as 4-D signal, and its quality depends on both angular consistency and spatial quality. However, few existing LF-IQA methods concentrate on effects caused by angular inconsistency. Especially, no-reference methods lack effective utilization of 2-D angular information. In this paper, we focus on measuring the 2-D angular consistency for LF-IQA. The Micro-Lens Image (MLI) refers to the angular domain of the LF image, which can simultaneously record the angular information in both horizontal and vertical directions. Since the MLI contains 2-D angular information, we propose a No-Reference Light Field image Quality assessment model based on MLI (LF-QMLI). Specifically, we first utilize Global Entropy Distribution (GED) and Uniform Local Binary Pattern descriptor (ULBP) to extract features from the MLI, and then pool them together to measure angular consistency. In addition, the information entropy of Sub-Aperture Image (SAI) is adopted to measure spatial quality. Extensive experimental results show that LF-QMLI achieves the state-of-the-art performance. Copyright © 2019, The Authors. All rights reserved.

关键词： Image quality

A STUDY ON THE FREQUENCY AND AZIMUTH COHERENCE OF HIGH-RESOLUTION SAR IMAGE

学校读者我要写书评

暂无评论

A STUDY ON THE FREQUENCY AND AZIMUTH COHERENCE OF HIGH-RESOL...

IEEE International geoscience and Remote Sensing Symposium

作者： Wenji Xing Xiaolan Qiu Chibiao Ding The Key Laboratory of Technology in Geo-spatial Information Processing and Application System Institute of Electronics Chinese Academy of Sciences Beijing China

High-resolution SAR has large transmitting bandwidth and wide synthetic aperture. How to understand and take advantage of the variation characteristics of SAR scattering characteristics with angle and frequency is a topic that worth studying. This article establishes a coherence matrix of sub-band and sub-aperture SAR images, and analyzes its ability to classify scattering mechanism. Experiments are conducted using the TerraSAR-X high-resolution data of different scenarios, and some meaningful results are got, which may provide some support to the analysis and application of high-resolution SAR data.

关键词： Coherence Entropy Azimuth Radar polarimetry Synthetic aperture radar Image resolution

A COOPERATIVE MULTITEMPORAL SEGMENTATION METHOD FOR SAR AND OPTICAL IMAGES CHANGE DETECTION

学校读者我要写书评

暂无评论

A COOPERATIVE MULTITEMPORAL SEGMENTATION METHOD FOR SAR AND ...

IEEE International geoscience and Remote Sensing Symposium

作者： Ling Wan Yuming Xiang Hongjian You Key Laboratory of Technology in Geo-spatial Information Processing and Application System Institute of Electronics Chinese Academy of Sciences Beijing China

This paper proposes an extension version of our previous work MS-CC to achieve optical and SAR images change detection. The proposed method introduces a cooperative multitemporal segmentation, whose merging process considers the heterogeneity of SAR and optical images as parallel information, making sure that the multitemporal information can be fully utilized without interfering with each other. Then, the change detection strategy based on compound classification is carried out on the segmentation results, obtaining the multi-scale change detection maps. Experimental validation is conducted with GoaFen3 and Google Earth data.

关键词： Image segmentation Optical imaging Radar polarimetry Optical sensors Compounds Interference Stacking

Deep learning-based video coding: A review and a case study

学校读者我要写书评

暂无评论

arXiv 2019年

作者： Liu, Dong Li, Yue Lin, Jianping Li, Houqiang Wu, Feng CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

The past decade has witnessed great success of deep learning technology in many disciplines, especially in computer vision and image processing. However, deep learning-based video coding remains in its infancy. This paper reviews the representative works about using deep learning for image/video coding, which has been an actively developing research area since the year of 2015. We divide the related works into two categories: new coding schemes that are built primarily upon deep networks (deep schemes), and deep network-based coding tools (deep tools) that shall be used within traditional coding schemes or together with traditional coding tools. For deep schemes, pixel probability modeling and auto-encoder are the two approaches, that can be viewed as predictive coding scheme and transform coding scheme, respectively. For deep tools, there have been several proposed techniques using deep learning to perform intra-picture prediction, inter-picture prediction, cross-channel prediction, probability distribution prediction, transform, post- or in-loop filtering, down- and up-sampling, as well as encoding optimizations. According to the newest reports, deep schemes have achieved comparable or even higher compression efficiency than the state-of-the-art traditional schemes, such as High Efficiency Video Coding (HEVC) based scheme, for image coding;deep tools have demonstrated the compression capability beyond HEVC for video coding. However, deep schemes have not yet reached the current height of HEVC for video coding, and deep tools remain largely unexplored at many aspects including the tradeoff between compression efficiency and encoding/decoding complexity, the optimization for perceptual naturalness or semantic quality, the speciality and universality, the federated design of multiple deep tools, and so on. In the hope of advocating the research of deep learning-based video coding, we present a case study of our developed prototype video codec, namely Deep Learning Vi

关键词： Video signal processing

W-net: Simultaneous segmentation of multi-anatomical retinal structures using a multi-task deep neural network

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Zhao, Hongwei Peng, Chengtao Liu, Lei Li, Bin School of Information Science and Technology University of Science and Technology of China Hefei Anhui230022 China Department of Precision Machinery and Instrumentation University of Science and Technology of China HefeiAnhui230022 China CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei Anhui230026 China

Segmentation of multiple anatomical structures is of great importance in medical image analysis. In this study, we proposed a W-net to simultaneously segment both the optic disc (OD) and the exudates in retinal images based on the multi-task learning (MTL) scheme. We introduced a class-balanced loss and a multi-task weighted loss to alleviate the imbalanced problem and to improve the robustness and generalization property of the W-net. We demonstrated the effectiveness of our approach by applying five-fold cross-validation experiments on two public datasets e_ophtha_EX and DiaRetDb1. We achieved F1-score of 94.76% and 95.73% for OD segmentation, and 92.80% and 94.14% for exudates segmentation. To further prove the generalization property of the proposed method, we applied the trained model on the DRIONS-DB dataset for OD segmentation and on the MESSIDOR dataset for exudate segmentation. Our results demonstrated that by choosing the optimal weights of each task, the MTL based W-net outperformed separate models trained individually on each task. Code and pre-trained models will be available at: https://***/FundusResearch/MTL_for_OD_and_***. Copyright © 2020, The Authors. All rights reserved.

关键词： Deep neural networks