—Inspired by the recent advances of image super-resolution using convolutional neural network (CNN), we propose a CNN-based block up-sampling scheme for intra frame coding. A block can be down-sampled before being co...
详细信息
Two approximations, center-beam approximation and reference digital elevation model (DEM) approximation, are used in synthetic aperture radar (SAR) motion compensation procedures. They usually introduce residual m...
详细信息
Two approximations, center-beam approximation and reference digital elevation model (DEM) approximation, are used in synthetic aperture radar (SAR) motion compensation procedures. They usually introduce residual motion compensation errors for airborne single-antenna SAR imaging and SAR interferometry. In this paper, we investigate the effects of residual uncompensated motion errors, which are caused by the above two approximations, on the performance of airborne along-track interferometric SAR (ATI-SAR). The residual uncompensated errors caused by center-beam approximation in the absence and in the presence of elevation errors are derived, respectively. Airborne simulation parameters are used to verify the correctness of the analysis and to show the impacts of residual uncompensated errors on the interferometric phase errors for ATI-SAR. It is shown that the interferometric phase errors caused by the center-beam approximation with an accurate DEM could be neglected, while the interferometric phase errors caused by the center-beam approximation with an inaccurate DEM cannot be neglected when the elevation errors exceed a threshold. This research provides theoretical bases for the error source analysis and signal processing of airborne ATI-SAR.
Recently high-level pose features (HLPF) have been shown to be efficient for action recognition in joint-annotated tasks. However, the relative positions between pairs of joints in actual situations and the spatio-tem...
详细信息
ISBN:
(纸本)9781509015535
Recently high-level pose features (HLPF) have been shown to be efficient for action recognition in joint-annotated tasks. However, the relative positions between pairs of joints in actual situations and the spatio-temporal information are not considered in constructing HLPF. To tackle their problems, we propose a set of novel high-level pose features (NHLPF). Specifically, considering that the distances between adjacent pairs of joints usually remain unchanged, we propose a horizontally relative position feature and a vertically relative position feature. In addition, a joint inner product feature is proposed to code the spatialinformation among each triplet of joints. To code temporal information, we calculate the trajectories of the above-mentioned three types of features as corresponding trajectory features. Furthermore, to combine the spatial and temporal information, we present a joint energy change feature, which is designed using observations of the magnitude and direction of the force between joints. We evaluate our NHLPF on a benchmark dataset. The results show that NHPLF are superior features for action recognition.
With the explosive growth in the number of mobile terminals, the demand for visual communication with mobility is increasing. However, traditional solutions for mobility over IP network cannot always meet the demand o...
详细信息
ISBN:
(纸本)9781509053179
With the explosive growth in the number of mobile terminals, the demand for visual communication with mobility is increasing. However, traditional solutions for mobility over IP network cannot always meet the demand of satisfying visual communication. Named Data Networking (NDN) is a new communication model that aims to replace IP model brings a different background to mobile visual communication problems. In this paper, we take advantage of the NDN model to realize seamless mobile visual communication. We introduce a delegate with calculation functions and a globally unique identifier (GUID) which can provide native identity indication into the NDN mechanism. The use of GUID benefits real-time applications like visual communication and further works with the delegate to decrease unnecessary routing update. We also specify the naming rule and design a FIB+ to support seamless mobile visual communication. To test the performance of our solutions, we build a proof-of-concept prototype and run experiments on it. The experiments demonstrate that our solution can provide real-time video communication with seamless mobility experience.
Part-based trackers have achieved promising performance in many tracking tasks. However, most part-based trackers use the same feature representation for all parts and simply combine them together to form an integral ...
详细信息
ISBN:
(纸本)9781509053179
Part-based trackers have achieved promising performance in many tracking tasks. However, most part-based trackers use the same feature representation for all parts and simply combine them together to form an integral representation for the tracking target. It may not guarantee that all parts of the tracking target can well distinguish the foreground from the background. Better performance is expected by exploring different feature representations on different parts of the tracking target. In this paper, following the framework of the classic Compressive Tracker (CT), we model each part of the target adaptively by using a multi-dimensional color representation. By using color name, we select the color feature presentation that best distinguishes the foreground from background. In order to better handle deformation and illumination change, we use multi-Gaussian to model different appearance changes of the tracking target. Both qualitative and quantitative evaluations demonstrate that the proposed method makes a consistent performance improvement compared with the conventional Compressive Tracker on tracking benchmark dataset. Besides, it also outperforms many state-of-the-art trackers while running at averagely 20 frames per second (FPS).
In this paper, we consider video communication over fading channel, where the perfect instantaneous channel state information (CSI) is available at both sender and receiver. Most of existing coding schemes are ineffic...
详细信息
ISBN:
(纸本)9781479953424
In this paper, we consider video communication over fading channel, where the perfect instantaneous channel state information (CSI) is available at both sender and receiver. Most of existing coding schemes are inefficient in this communication scenario. The reason is that for digital coding scheme, it has high coding efficiency but unavoidably leads to the cliff effect;while for analog scheme, it has graceful video quality variation with channel varying, but has low coding efficiency. Hence, to integrate the advantages of digital coding and analog coding, we propose a hybrid digital-analog (HDA) scheme. In our scheme, we have adopted adaptive power allocation and adaptive forward error coding (FEC) in digital part to accommodate instantaneous channel quality. The evaluation results show that the proposed HDA scheme outperforms ParCast (a state-of-the-art analog scheme) 0.3~2.2dB under the channel Signal-to-Noise Ratio (SNR) from 3dB to 20dB.
InSAR interferogram quality assessment is a key step for the using of interferogram map. Traditionally, the interferogram is qualitatively assessed visually and quantitatively assessed by the number of residues. Howev...
详细信息
InSAR interferogram quality assessment is a key step for the using of interferogram map. Traditionally, the interferogram is qualitatively assessed visually and quantitatively assessed by the number of residues. However, the important structure information is hardly quantifiable. This paper presents a novel index to evaluate the quality of InSAR interferogram based on connected area. After discomposing the fringes into independent connected areas, we analyze the statistical ratio of an area to its margin. Then we use the ratio as an index to quantitatively evaluate the interferogram. In the end, the presented index is used for the filtered interferogram of popular filters, and the results fit the visual judging.
Ionosphere is an important factor in highresolution spaceborne synthetic aperture radar(SAR) and geosynchronous(geo) SAR. An approach based on point target deviation between range sub-images is proposed in this pa...
详细信息
Ionosphere is an important factor in highresolution spaceborne synthetic aperture radar(SAR) and geosynchronous(geo) SAR. An approach based on point target deviation between range sub-images is proposed in this paper to estimate and correct the ionosphere. Due to the dispersion effect, the ionosphere causes propagation delay deviation in range subimages with different carrier frequency. This deviation can be used to estimate the total electron content(TEC) along the propagation path, and then the ionospheric effects can be corrected according to the signal model deduced out in this paper. The simulation results show that our approach is valid and robust.
Content-aware image retargeting has attracted substantial research interests in the related research community. However, so far there is still no method can preserve important image contents and structure well without...
详细信息
ISBN:
(纸本)9781479999897
Content-aware image retargeting has attracted substantial research interests in the related research community. However, so far there is still no method can preserve important image contents and structure well without introducing deformation. To address this problem, we propose a Saliency & Structure Preserving Multi-operator (SSPM) method. SSPM classifies images into three categories utilizing SIFT density to improve performance of saliency preservation, helping to mitigate negative influence from center-bias property of most existing saliency detection models. SSPM also employs different principles to improve structure preservation performance, including Earth Mover's Distance (EMD) and Gray-Level Cooccurrence Matrix (GLCM) to get optimal operator sequences for smart content-aware image retargeting. SSPM method not only can well preserve salient contents and structure, but also can greatly improve deformation resilience. Experimental results demonstrated that our method outperforms state-of-art image retargeting methods.
This paper addresses the issue on how to more effectively coordinate the depth with RGB aiming at boosting the performance of RGB-D object detection. Particularly, we investigate two primary ideas under the CNN model:...
详细信息
This paper addresses the issue on how to more effectively coordinate the depth with RGB aiming at boosting the performance of RGB-D object detection. Particularly, we investigate two primary ideas under the CNN model: property derivation and property fusion. Firstly, we propose that the depth can be utilized not only as a type of extra information besides RGB but also to derive more visual properties for comprehensively describing the objects of interest. So a two-stage learning framework consisting of property derivation and fusion is constructed. Here the properties can be derived either from the provided color/depth or their pairs (e.g. the geometry contour adopted in this paper). Secondly, we explore the fusion method of different properties in feature learning, which is boiled down to, under the CNN model, from which layer the properties should be fused together. The analysis shows that different semantic properties should be learned separately and combined before passing into the final classifier. Actually, such a detection way is in accordance with the mechanism of the primary neural cortex (V1) in brain. We experimentally evaluate the proposed method on the challenging dataset, and have achieved state-of-the-art performance.
暂无评论