The dominant scattering mechanism is of great significance for the application of ground objects classification and target detection. It can also verify the quality of the polarimetric data by check the dominant scatt...
详细信息
The dominant scattering mechanism is of great significance for the application of ground objects classification and target detection. It can also verify the quality of the polarimetric data by check the dominant scattering mechanism of known ground objects. In order to improve the application performance, this paper studies the dominant scattering mechanism of GF-3 typical ground objects based on a large number of data slices. The GF-3 fully polarimetric data slices are classified based on the MODIS global classification map, and the GF-3 slice library of typical ground objects is constructed. Based on large amounts of GF-3 samples, we carry out the statistical analysis of dominant scattering mechanism separation results for typical GF-3 ground objects (building, woodland, cultivated land, grassland and waters) of by means of h/alpha/A decomposition. The quantitative results reveal the polarimetric scattering feature of different ground objects, and provide reference for fully polarimetric SAR application.
This study introduces an automatic method for change detection of multi-sensor remote-sensing images (e.g. optical and synthetic aperture radar (SAR) images). As object-based image analysis can effectively reduce the ...
详细信息
A comprehensive comparison of the trends and drivers of global surface and canopy urban heat islands (termed Is and Ic trends, respectively) is critical for better designing urban heat mitigation strategies. However, ...
详细信息
In this paper, we propose a novel deep architecture with multiple classifiers for continuous sign language recognition. Representing the sign video with a 3D convolutional residual network and a bidirectional LSTM, we...
详细信息
In this paper, we propose a novel deep architecture with multiple classifiers for continuous sign language recognition. Representing the sign video with a 3D convolutional residual network and a bidirectional LSTM, we formulate continuous sign language recognition as a grammatical-rule-based classification problem. We first split a text sentence of sign language into isolated words and n-grams, where an n-gram is a sequence of consecutive n words in a sentence. Then, we propose a word-independent classifiers (WIC) module and an n-gram classifier (NGC) module to identify the words and n-grams in a sentence, respectively. A greedy decoding algorithm is employed to integrate words and n-grams into the sentence based on the confidence scores provided by both modules. Our method is evaluated on a Chinese continuous sign language recognition benchmark, and the experimental results demonstrate its effectiveness and superiority.
Light field image (LFI) quality assessment is becoming more and more important, which helps to better guide the acquisition, processing and application of immersive media. However, due to the inherent high dimensional...
详细信息
Light field image quality assessment (LF-IQA) plays a significant role due to its guidance to Light Field (LF) contents acquisition, processing and application. The LF can be represented as 4-D signal, and its quality...
Light field image quality assessment (LF-IQA) plays a significant role due to its guidance to Light Field (LF) contents acquisition, processing and application. The LF can be represented as 4-D signal, and its quality depends on both angular consistency and spatial quality. However, few existing LF-IQA methods concentrate on effects caused by angular inconsistency. Especially, no-reference methods lack effective utilization of 2D angular information. In this paper, we focus on measuring the 2-D angular consistency for LF-IQA. The Micro-Lens Image (MLI) refers to the angular domain of the LF image, which can simultaneously record the angular information in both horizontal and vertical directions. Since the MLI contains 2D angular information, we propose a No-Reference Light Field image Quality assessment model based on MLI (LF-QMLI). Specifically, we first utilize Global Entropy Distribution (GED) and Uniform Local Binary Pattern descriptor (ULBP) to extract features from the MLI, and then pool them together to measure angular consistency. In addition, the information entropy of SubAperture Image (SAI) is adopted to measure spatial quality. Extensive experimental results show that LF-QMLI achieves the state-of-the-art performance.
Objective quality assessment of stereoscopic panoramic images becomes a challenging problem owing to the rapid growth of 360-degree contents. Different from traditional 2D image quality assessment (IQA), more complex ...
Objective quality assessment of stereoscopic panoramic images becomes a challenging problem owing to the rapid growth of 360-degree contents. Different from traditional 2D image quality assessment (IQA), more complex aspects are involved in 3D omnidirectional IQA, especially unlimited field of view (FoV) and extra depth perception, which brings difficulty to evaluate the quality of experience (QoE) of 3D omnidirectional images. In this paper, we propose a multi-viewport based full-reference stereo 360 IQA model. Due to the freely changeable viewports when browsing in the head-mounted display, our proposed approach processes the image inside FoV rather than the projected one such as equirectangular projection (ERP). In addition, since overall QoE depends on both image quality and depth perception, we utilize the features estimated by the difference map between left and right views which can reflect disparity. The depth perception features along with binocular image qualities are employed to further predict the overall QoE of 3D 360 images. The experimental results on our public Stereoscopic OmnidirectionaL Image quality assessment Database (SOLID) show that the proposed method achieves a significant improvement over some well-known IQA metrics and can accurately reflect the overall QoE of perceived images.
Rapid growing intelligent applications require optimized bit allocation in image/video coding to support specific task-driven scenarios such as detection, classification, segmentation, etc. Some learning-based framewo...
详细信息
Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, t...
ISBN:
(数字)9781728123455
ISBN:
(纸本)9781728123462
Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, the performance of existing networks is greatly limited. In this paper, we exploit temporal correlation in video frames to improve the performance and robustness of segmentation networks. Two effective learning strategies are proposed to propagate the information from a few labeled frames to their immediate neighbor frames. First, we scale up training dataset for supervised semantic segmentation networks by generating pseudo ground-truth for neighboring frames from a labeled frame using filtered homography transformation. Furthermore, we introduce a self-supervised loss function to ensure temporal consistency between the segmentation results of adjacent frames. The experimental results demonstrate that our proposed method outperforms state-of-the-art techniques for semantic segmentation on NYU-Depth V2 dataset.
This study investigated the optimal tracking performance (OTP) of multi-input multi-output (MIMO), discrete- time networked control systems (NCSs). The limits of tracking performance (TP) under the influences of bandw...
详细信息
This study investigated the optimal tracking performance (OTP) of multi-input multi-output (MIMO), discrete- time networked control systems (NCSs). The limits of tracking performance (TP) under the influences of bandwidth, encoding- decoding, and additive coloured Gaussian noise (ACGN) are derived using the techniques of coprime decomposition and all-pass decomposition. The results reveal the negative impact of non-minimum phase (NMP) zeros and unstable poles of the plant as well as network communication constraints on the TP of NCSs. Finally, a numerical simulation is discussed and verifies our conclusions.
暂无评论