检索结果-内蒙古大学图书馆

arXiv 2019年

作者： Zhou, Wei Shi, Likun Chen, Zhibo Cas Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027

Light field image (LFI) quality assessment is becoming more and more important, which helps to better guide the acquisition, processing and application of immersive media. However, due to the inherent high dimensional characteristics of LFI, the LFI quality assessment turns into a multi-dimensional problem that requires consideration of the quality degradation in both spatial and angular dimensions. Therefore, we propose a novel Tensor oriented No-reference Light Field image Quality evaluator (Tensor-NLFQ) based on tensor theory. Specifically, since the LFI is regarded as a low-rank 4D tensor, the principle components of four oriented sub-aperture view stacks are obtained via Tucker decomposition. Then, the Principal Component spatial Characteristic (PCSC) is designed to measure the spatial-dimensional quality of LFI considering its global naturalness and local frequency properties. Finally, the Tensor Angular Variation Index (TAVI) is proposed to measure angular consistency quality by analyzing the structural similarity distribution between the first principal component and each view in the view stack. Extensive experimental results on four publicly available LFI quality databases demonstrate that the proposed Tensor-NLFQ model outperforms state-of-the-art 2D, 3D, multi-view, and LFI quality assessment algorithms. Copyright © 2019, The Authors. All rights reserved.

关键词： Tensors

来源：评论

学校读者我要写书评

暂无评论

Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics

arXiv

引用

arXiv 2019年

作者： Shi, Jun Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

Rapid growing intelligent applications require optimized bit allocation in image/video coding to support specific task-driven scenarios such as detection, classification, segmentation, etc. Some learning-based frameworks have been proposed for this purpose due to their inherent end-to-end optimization mechanisms. However, it is still quite challenging to integrate these task-driven metrics seamlessly into traditional hybrid coding framework. To the best of our knowledge, this paper is the first work trying to solve this challenge based on reinforcement learning (RL) approach. Specifically, we formulate the bit allocation problem as a Markovian Decision Process (MDP) and train RL agents to automatically decide the quantization parameter (QP) of each coding tree unit (CTU) for HEVC intra coding, according to the task-driven semantic distortion metrics. This bit allocation scheme can maximize the semantic level fidelity of the task, such as classification accuracy, while minimizing the bit-rate. We also employ gradient class activation map (Grad-CAM) and Mask R-CNN tools to extract task-related importance maps to help the agents make decisions. Extensive experimental results demonstrate the superior performance of our approach by achieving 43.1% to 73.2% bit-rate saving over the anchor of HEVC under the equivalent task-related distortions. Copyright © 2019, The Authors. All rights reserved.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Improving Semantic Segmentation via Label Propagation and Temporal Consistency

Improving Semantic Segmentation via Label Propagation and Te...

引用

Signal, information and Data processing (ICSIDP), IEEE International Conference on

作者： Feiyu Qin Lumeng Cao Xuejin Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (数字)9781728123455

ISBN: (纸本)9781728123462

Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, the performance of existing networks is greatly limited. In this paper, we exploit temporal correlation in video frames to improve the performance and robustness of segmentation networks. Two effective learning strategies are proposed to propagate the information from a few labeled frames to their immediate neighbor frames. First, we scale up training dataset for supervised semantic segmentation networks by generating pseudo ground-truth for neighboring frames from a labeled frame using filtered homography transformation. Furthermore, we introduce a self-supervised loss function to ensure temporal consistency between the segmentation results of adjacent frames. The experimental results demonstrate that our proposed method outperforms state-of-the-art techniques for semantic segmentation on NYU-Depth V2 dataset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Edge-Guided Panoramic Video Stitching with Limited Overlap

Edge-Guided Panoramic Video Stitching with Limited Overlap

引用

Signal, information and Data processing (ICSIDP), IEEE International Conference on

作者： Chaoyu Xie Xuejin Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (数字)9781728123455

ISBN: (纸本)9781728123462

Video stitching remains a challenging problem in computer vision. In this paper, we propose a novel edge-guided method to stitch multiple videos that have small overlapped regions. Our algorithm consists of three steps: (1) spherical projection of the input video frames based on camera calibration, (2) edge detection and edge-guided feature matching for video registration, and (3) seam optimization to eliminate distortions and ghosts in the composited panoramic videos. The experimental results and user studies demonstrate that our method is robust to videos that have small overlapped regions and produces more visually pleasing panoramic videos than state-of-the-art techniques.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deep Scalable Image Compression via Hierarchical Feature Decorrelation

Deep Scalable Image Compression via Hierarchical Feature Dec...

引用

Picture Coding Symposium, PCS

作者： Zongyu Guo Zhizheng Zhang Zhibo Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

The following topics are dealt with: video coding; data compression; image coding; convolutional neural nets; decoding; learning (artificial intelligence); motion compensation; video codecs; image reconstruction; filt...

关键词：

来源：评论

学校读者我要写书评

暂无评论

Video-based point cloud compression artifact removal

arXiv

引用

arXiv 2021年

作者： Akhtar, Anique Gao, Wen Li, Li Li, Zhu Jia, Wei Liu, Shan Department of Computer Science and Electrical Engineering University of Missouri-Kansas City Kansas CityMO64110 United States Tencent America 661 Bryant St. Palo AltoCA94301 United States Department of Computer Science and Electrical Engineering University of Missouri-Kansas City Kansas CityMO64110 United States CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027 China

—Photo-realistic point cloud capture and transmission are the fundamental enablers for immersive visual communication. The coding process of dynamic point clouds, especially video-based point cloud compression (V-PCC) developed by the MPEG standardization group, is now delivering state-of-the-art performance in compression efficiency. V-PCC is based on the projection of the point cloud patches to 2D planes and encoding the sequence as 2D texture and geometry patch sequences. However, the resulting quantization errors from coding can introduce compression artifacts, which can be very unpleasant for the quality of experience (QoE). In this work, we developed a novel out-of-the-loop point cloud geometry artifact removal solution that can significantly improve reconstruction quality without additional bandwidth cost. Our novel framework consists of a point cloud sampling scheme, an artifact removal network, and an aggregation scheme. The point cloud sampling scheme employs a cube-based neighborhood patch extraction to divide the point cloud into patches. The geometry artifact removal network then processes these patches to obtain artifact-removed patches. The artifact-removed patches are then merged together using an aggregation scheme to obtain the final artifact-removed point cloud. We employ 3D deep convolutional feature learning for geometry artifact removal that jointly recovers both the quantization direction and the quantization noise level by exploiting projection and quantization prior. The simulation results demonstrate that the proposed method is highly effective and can considerably improve the quality of the reconstructed point cloud. Copyright © 2021, The Authors. All rights reserved.

关键词： geometry

来源：评论

学校读者我要写书评

暂无评论

Multi-Channel Spaceborne SAR Imaging Method for Maritime Scenarios

Multi-Channel Spaceborne SAR Imaging Method for Maritime Sce...

引用

Sensor Array and Multichannel Signal processing Workshop

作者： Xiaolan Qiu Junying Yang Mingyang Shang Lihua Zhong Chibiao Ding Key Laboratory of Technology in GeoSpatial Information Processing and Application Systems Aerospace Information Research Institute Chinese Academy of Sciences Beijing China National Key Laboratory of Microwave Imaging Technology Aerospace Information Research Institute Chinese Academy of Sciences Beijing China

ISBN: (数字)9781728119465

ISBN: (纸本)9781728119472

The spaceborne SAR is required to fulfill the increasing demands for improved spatial resolution and wider swath coverage in recent years. The azimuth multi-channel SAR system is a typical technique adopted for realizing high-resolution and wide-swath (HRWS) simultaneously. The flexibility of this system also provides favorable conditions for moving target detection. Aiming at the problems of moving target detection, velocity estimation, and imaging of SAR maritime scenes in the spaceborne multi-channel system, this paper proposes a set of processes based on coarse imaging, detection, velocity estimation and refocusing. The experimental results of the simulation verify the effectiveness of the method.

关键词： Azimuth Conferences Imaging Estimation Object detection Radar polarimetry Doppler effect

来源：评论

学校读者我要写书评

暂无评论

SAR data processing for GF3 12

SAR data processing for GF3

引用

12th European Conference on Synthetic Aperture Radar, EUSAR 2018

作者： Bing, Han Lihua, Zhong Jiayin, Liu Xiaolan, Qiu Yuxin, Hu Bin, Lei Key Laboratory of Technology in Geo-spatial Information Processing and Application Systems In-stituteof Electronics Chinese Academyof Sciences Beijing China

ISBN: (纸本)9783800746361

The Gaofen-3 (GF3) data processor was developed as a workstation-based GF3 synthetic aperture radar (SAR) data processing system. The processor consists of two subsystems of the GF3 ground segment, which are referred to as the data ingesting subsystem (DDS) and the product generation subsystem (PGS). The primary purpose of the DDS is to record and catalogue GF3 raw data with transferring format, and the PGS is to produce slant range or geocoded imagery from the signal data. This paper provides an overview of the GF3 data processor, including descriptions of the system architecture, the imagery generating procedures for different imaging modes and the output format. Some processing results will also be given here. © VDE VERLAG GMBH • Berlin • Offenbach.

关键词： Synthetic aperture radar

来源：评论

学校读者我要写书评

暂无评论

W-net: Simultaneous segmentation of multi-anatomical retinal structures using a multi-task deep neural network

arXiv

引用

arXiv 2020年

作者： Zhao, Hongwei Peng, Chengtao Liu, Lei Li, Bin School of Information Science and Technology University of Science and Technology of China Hefei Anhui230022 China Department of Precision Machinery and Instrumentation University of Science and Technology of China HefeiAnhui230022 China CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei Anhui230026 China

Segmentation of multiple anatomical structures is of great importance in medical image analysis. In this study, we proposed a W-net to simultaneously segment both the optic disc (OD) and the exudates in retinal images based on the multi-task learning (MTL) scheme. We introduced a class-balanced loss and a multi-task weighted loss to alleviate the imbalanced problem and to improve the robustness and generalization property of the W-net. We demonstrated the effectiveness of our approach by applying five-fold cross-validation experiments on two public datasets e_ophtha_EX and DiaRetDb1. We achieved F1-score of 94.76% and 95.73% for OD segmentation, and 92.80% and 94.14% for exudates segmentation. To further prove the generalization property of the proposed method, we applied the trained model on the DRIONS-DB dataset for OD segmentation and on the MESSIDOR dataset for exudate segmentation. Our results demonstrated that by choosing the optimal weights of each task, the MTL based W-net outperformed separate models trained individually on each task. Code and pre-trained models will be available at: https://***/FundusResearch/MTL_for_OD_and_***. Copyright © 2020, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

3D reconstruction and error analysis of multi-view space-borne SAR images under different configurations

3D reconstruction and error analysis of multi-view space-bor...

引用

IET International Radar Conference 2018, IRC 2018

作者： Wang, Chao Qiu, Xiaolan Li, Fangfang Lei, Bin University of the Chinese Academy of Sciences Beijing China Institute of Electrics Chinese Academy of Sciences Beijing China Key Laboratory of Technology in Geo-spatial Information Processing and Application System Institute of Electrics Chinese Academy of Sciences Beijing China

Synthetic aperture radar (SAR) has a good ability to detect the microwave scattering characteristics of the target and has a good capability of slant range Doppler positioning. Using multi-view SAR images in combination with image matching and positioning equations, the 3D position of the target can be obtained. In the past, multi-view 3D reconstruction mainly used side-looking images with parallel trajectories but different incident angles. That method is not universal for different configurations and lacks analysis of the relationship between solution and parameter error. This study aims at the problem of multi-view SAR 3D reconstruction. The authors establish general 3D reconstruction equations that can be used for non-parallel track and non-side-looking and the solution method is deduced. Based on this, an analysis method of error sensitivity is proposed and the dependence relationship between the reconstruction error and configuration, velocity, position, and slant range is analysed. The correctness of this method is verified by simulation experiments, which provides guidance for selection of configuration and sensor accuracy when applying 3D reconstruction. © 2019 Institution of Engineering and technology. All rights reserved.

关键词： Synthetic aperture radar

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：