检索结果-内蒙古大学图书馆

IEEE International geoscience and Remote Sensing Symposium

作者： Niangang Jiao Feng wang Hongjian You Kun Hu Mudan Yang Key Laboratory of Technology in Geo-Spatial Information Processing and Application Systems Institute of Electronics Beijing China

geo-positioning accuracy improvement is one of the most important step of remote sensing image preprocessing. Traditional methods require a large number of ground control points (GCPs) which consuming lots of manpower and financial resources. With the resolution up to 0.8m, the original geo-positioning accuracy of the Chinese Gaofen (GF-2) multi-angle imagery is about 90m which means a limited application in geometric processing. In this paper, we propose a new method to improve the geometric performance of the multi-angle satellite imagery based on the geometric error sources of this experimental dataset without GCPs. Under the condition of weak intersection of our test dataset, we use a DEM-assisted approach to acquire a more accurate initial position accuracy of all tie points, and all extracted data is clustered by the Density based spatial clustering of applications with noise (DBSCAN) algorithm in order to eliminate points or impages with large positioning error automatically. Then, the error-based block adjustment model are proposed and investigated to improved the geometric performance of the experimental dataset. Based on our proposed method, 142 multi-angle GF-2 satellite images covering the western Beijing area are experimented and the root mean square error (RMSE) of the geometric accuracy is improved up to about 12m in plane and 6m in height, which shows a significantly improvement in geo-positioning accuracy of these multi-angle GF-2 remote sensing imagery.

关键词： Satellites Image resolution Mathematical model Unified modeling language Remote sensing Clustering algorithms Cameras

来源：评论

学校读者我要写书评

暂无评论

Tensor oriented no-reference light field image quality assessment

arXiv

引用

arXiv 2019年

作者： Zhou, Wei Shi, Likun Chen, Zhibo Cas Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei230027

Light field image (LFI) quality assessment is becoming more and more important, which helps to better guide the acquisition, processing and application of immersive media. However, due to the inherent high dimensional characteristics of LFI, the LFI quality assessment turns into a multi-dimensional problem that requires consideration of the quality degradation in both spatial and angular dimensions. Therefore, we propose a novel Tensor oriented No-reference Light Field image Quality evaluator (Tensor-NLFQ) based on tensor theory. Specifically, since the LFI is regarded as a low-rank 4D tensor, the principle components of four oriented sub-aperture view stacks are obtained via Tucker decomposition. Then, the Principal Component spatial Characteristic (PCSC) is designed to measure the spatial-dimensional quality of LFI considering its global naturalness and local frequency properties. Finally, the Tensor Angular Variation Index (TAVI) is proposed to measure angular consistency quality by analyzing the structural similarity distribution between the first principal component and each view in the view stack. Extensive experimental results on four publicly available LFI quality databases demonstrate that the proposed Tensor-NLFQ model outperforms state-of-the-art 2D, 3D, multi-view, and LFI quality assessment algorithms. Copyright © 2019, The Authors. All rights reserved.

关键词： Tensors

来源：评论

学校读者我要写书评

暂无评论

No-Reference Light Field Image Quality Assessment Based on Micro-Lens Image

No-Reference Light Field Image Quality Assessment Based on M...

引用

Picture Coding Symposium, PCS

作者： Ziyuan Luo Wei Zhou Likun Shi Zhibo Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

Light field image quality assessment (LF-IQA) plays a significant role due to its guidance to Light Field (LF) contents acquisition, processing and application. The LF can be represented as 4-D signal, and its quality depends on both angular consistency and spatial quality. However, few existing LF-IQA methods concentrate on effects caused by angular inconsistency. Especially, no-reference methods lack effective utilization of 2D angular information. In this paper, we focus on measuring the 2-D angular consistency for LF-IQA. The Micro-Lens Image (MLI) refers to the angular domain of the LF image, which can simultaneously record the angular information in both horizontal and vertical directions. Since the MLI contains 2D angular information, we propose a No-Reference Light Field image Quality assessment model based on MLI (LF-QMLI). Specifically, we first utilize Global Entropy Distribution (GED) and Uniform Local Binary Pattern descriptor (ULBP) to extract features from the MLI, and then pool them together to measure angular consistency. In addition, the information entropy of SubAperture Image (SAI) is adopted to measure spatial quality. Extensive experimental results show that LF-QMLI achieves the state-of-the-art performance.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics

arXiv

引用

arXiv 2019年

作者： Shi, Jun Chen, Zhibo CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

Rapid growing intelligent applications require optimized bit allocation in image/video coding to support specific task-driven scenarios such as detection, classification, segmentation, etc. Some learning-based frameworks have been proposed for this purpose due to their inherent end-to-end optimization mechanisms. However, it is still quite challenging to integrate these task-driven metrics seamlessly into traditional hybrid coding framework. To the best of our knowledge, this paper is the first work trying to solve this challenge based on reinforcement learning (RL) approach. Specifically, we formulate the bit allocation problem as a Markovian Decision Process (MDP) and train RL agents to automatically decide the quantization parameter (QP) of each coding tree unit (CTU) for HEVC intra coding, according to the task-driven semantic distortion metrics. This bit allocation scheme can maximize the semantic level fidelity of the task, such as classification accuracy, while minimizing the bit-rate. We also employ gradient class activation map (Grad-CAM) and Mask R-CNN tools to extract task-related importance maps to help the agents make decisions. Extensive experimental results demonstrate the superior performance of our approach by achieving 43.1% to 73.2% bit-rate saving over the anchor of HEVC under the equivalent task-related distortions. Copyright © 2019, The Authors. All rights reserved.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Deep Grammatical Multi-classifier for Continuous Sign Language Recognition

Deep Grammatical Multi-classifier for Continuous Sign Langua...

引用

IEEE International Conference on Multimedia Big Data (BigMM)

作者： Chengcheng Wei Wengang Zhou Junfu Pu Houqiang Li CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

In this paper, we propose a novel deep architecture with multiple classifiers for continuous sign language recognition. Representing the sign video with a 3D convolutional residual network and a bidirectional LSTM, we formulate continuous sign language recognition as a grammatical-rule-based classification problem. We first split a text sentence of sign language into isolated words and n-grams, where an n-gram is a sequence of consecutive n words in a sentence. Then, we propose a word-independent classifiers (WIC) module and an n-gram classifier (NGC) module to identify the words and n-grams in a sentence, respectively. A greedy decoding algorithm is employed to integrate words and n-grams into the sentence based on the confidence scores provided by both modules. Our method is evaluated on a Chinese continuous sign language recognition benchmark, and the experimental results demonstrate its effectiveness and superiority.

关键词： Videos Assistive technology Gesture recognition Feature extraction Task analysis Decoding Cats

来源：评论

学校读者我要写书评

暂无评论

Deep Scalable Image Compression via Hierarchical Feature Decorrelation

Deep Scalable Image Compression via Hierarchical Feature Dec...

引用

Picture Coding Symposium, PCS

作者： Zongyu Guo Zhizheng Zhang Zhibo Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

The following topics are dealt with: video coding; data compression; image coding; convolutional neural nets; decoding; learning (artificial intelligence); motion compensation; video codecs; image reconstruction; filt...

关键词：

来源：评论

学校读者我要写书评

暂无评论

Improving Semantic Segmentation via Label Propagation and Temporal Consistency

Improving Semantic Segmentation via Label Propagation and Te...

引用

Signal, information and Data processing (ICSIDP), IEEE International Conference on

作者： Feiyu Qin Lumeng Cao Xuejin Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (数字)9781728123455

ISBN: (纸本)9781728123462

Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, the performance of existing networks is greatly limited. In this paper, we exploit temporal correlation in video frames to improve the performance and robustness of segmentation networks. Two effective learning strategies are proposed to propagate the information from a few labeled frames to their immediate neighbor frames. First, we scale up training dataset for supervised semantic segmentation networks by generating pseudo ground-truth for neighboring frames from a labeled frame using filtered homography transformation. Furthermore, we introduce a self-supervised loss function to ensure temporal consistency between the segmentation results of adjacent frames. The experimental results demonstrate that our proposed method outperforms state-of-the-art techniques for semantic segmentation on NYU-Depth V2 dataset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Quality Assessment of Stereoscopic 360-degree Images from Multi-viewports

Quality Assessment of Stereoscopic 360-degree Images from Mu...

引用

Picture Coding Symposium, PCS

作者： Jiahua Xu Ziyuan Luo Wei Zhou Wenyuan Zhang Zhibo Chen CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China

Objective quality assessment of stereoscopic panoramic images becomes a challenging problem owing to the rapid growth of 360-degree contents. Different from traditional 2D image quality assessment (IQA), more complex aspects are involved in 3D omnidirectional IQA, especially unlimited field of view (FoV) and extra depth perception, which brings difficulty to evaluate the quality of experience (QoE) of 3D omnidirectional images. In this paper, we propose a multi-viewport based full-reference stereo 360 IQA model. Due to the freely changeable viewports when browsing in the head-mounted display, our proposed approach processes the image inside FoV rather than the projected one such as equirectangular projection (ERP). In addition, since overall QoE depends on both image quality and depth perception, we utilize the features estimated by the difference map between left and right views which can reflect disparity. The depth perception features along with binocular image qualities are employed to further predict the overall QoE of 3D 360 images. The experimental results on our public Stereoscopic OmnidirectionaL Image quality assessment Database (SOLID) show that the proposed method achieves a significant improvement over some well-known IQA metrics and can accurately reflect the overall QoE of perceived images.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Robust Beamformer based on Magnitude Response Constraint and Sparse Constraint

Robust Beamformer based on Magnitude Response Constraint and...

引用

2019 IEEE International Conference on Signal, information and Data processing, ICSIDP 2019

作者： Lei, Songlin Qiu, Xiaolan DIng, Chibiao Zhang, Yueting Aerospace Information Research Institute Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing100049 China Key Laboratory of Technology in Geo-spatial Information Processing and Application System CAS China Institute of Electronics Chinese Academy of Sciences Beijing100190 China

ISBN: (纸本)9781728123455

Beamformer with magnitude response constraint can flexibly control the response region by specified beamwidth and response ripple, which has a significant performance against steering vector mismatch. However, a high sidelobe level of the beam is accompanied, resulting in performance degradation. To solve this problem, a novel robust beamformer based on magnitude response constraint and sparse constraint is proposed. This method adds the sparse constraint, that is, Lp-norm to the beamformer with magnitude response constraint, then the non-convex cost function can be formulated as a semidefinite programming (SDP) problem, finally the matrix decomposition theory is used to get the array weight vector. Simulation results demonstrate that the proposed method can not only produce large controlled region against steering vector mismatch and reduce the sidelobe level of the beampattern, but also achieve good performance in Signal to Interference plus Noise Ratio (SINR) enhancement. © 2019 IEEE.

关键词： Vectors

来源：评论

学校读者我要写书评

暂无评论

Edge-Guided Panoramic Video Stitching with Limited Overlap

Edge-Guided Panoramic Video Stitching with Limited Overlap

引用

Signal, information and Data processing (ICSIDP), IEEE International Conference on

作者： Chaoyu Xie Xuejin Chen CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Hefei China

ISBN: (数字)9781728123455

ISBN: (纸本)9781728123462

Video stitching remains a challenging problem in computer vision. In this paper, we propose a novel edge-guided method to stitch multiple videos that have small overlapped regions. Our algorithm consists of three steps: (1) spherical projection of the input video frames based on camera calibration, (2) edge detection and edge-guided feature matching for video registration, and (3) seam optimization to eliminate distortions and ghosts in the composited panoramic videos. The experimental results and user studies demonstrate that our method is robust to videos that have small overlapped regions and produces more visually pleasing panoramic videos than state-of-the-art techniques.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：