—The explosive increase and ubiquitous accessibility of visual data on the Web have led to the prosperity of research activity in image search or retrieval. With the ignorance of visual content as a ranking clue, met...
详细信息
With the advantage in compact representation and efficient comparison, binary hashing has been extensively investigated for approximate nearest neighbor search. In this paper, we propose a novel and general hashing fr...
详细信息
Users’ download history is a primary data source for analyzing user interests. Recent work has shown that user interests are indeed time varying, and accurate profiling of user interest drifts requires the temporal d...
详细信息
—Inspired by the recent advances of image super-resolution using convolutional neural network (CNN), we propose a CNN-based block up-sampling scheme for intra frame coding. A block can be down-sampled before being co...
详细信息
Convolutional neural networks (CNNs) have achieved stateof-the-art results on many visual recognition tasks. However, current CNN models still exhibit a poor ability to be invariant to spatial transformations of image...
详细信息
With the explosive growth in the number of mobile terminals, the demand for visual communication with mobility is increasing. However, traditional solutions for mobility over IP network cannot always meet the demand o...
详细信息
ISBN:
(纸本)9781509053179
With the explosive growth in the number of mobile terminals, the demand for visual communication with mobility is increasing. However, traditional solutions for mobility over IP network cannot always meet the demand of satisfying visual communication. Named Data Networking (NDN) is a new communication model that aims to replace IP model brings a different background to mobile visual communication problems. In this paper, we take advantage of the NDN model to realize seamless mobile visual communication. We introduce a delegate with calculation functions and a globally unique identifier (GUID) which can provide native identity indication into the NDN mechanism. The use of GUID benefits real-time applications like visual communication and further works with the delegate to decrease unnecessary routing update. We also specify the naming rule and design a FIB+ to support seamless mobile visual communication. To test the performance of our solutions, we build a proof-of-concept prototype and run experiments on it. The experiments demonstrate that our solution can provide real-time video communication with seamless mobility experience.
Recently high-level pose features (HLPF) have been shown to be efficient for action recognition in joint-annotated tasks. However, the relative positions between pairs of joints in actual situations and the spatio-tem...
详细信息
ISBN:
(纸本)9781509015535
Recently high-level pose features (HLPF) have been shown to be efficient for action recognition in joint-annotated tasks. However, the relative positions between pairs of joints in actual situations and the spatio-temporal information are not considered in constructing HLPF. To tackle their problems, we propose a set of novel high-level pose features (NHLPF). Specifically, considering that the distances between adjacent pairs of joints usually remain unchanged, we propose a horizontally relative position feature and a vertically relative position feature. In addition, a joint inner product feature is proposed to code the spatialinformation among each triplet of joints. To code temporal information, we calculate the trajectories of the above-mentioned three types of features as corresponding trajectory features. Furthermore, to combine the spatial and temporal information, we present a joint energy change feature, which is designed using observations of the magnitude and direction of the force between joints. We evaluate our NHLPF on a benchmark dataset. The results show that NHPLF are superior features for action recognition.
In this paper, we consider video communication over fading channel, where the perfect instantaneous channel state information (CSI) is available at both sender and receiver. Most of existing coding schemes are ineffic...
详细信息
ISBN:
(纸本)9781479953424
In this paper, we consider video communication over fading channel, where the perfect instantaneous channel state information (CSI) is available at both sender and receiver. Most of existing coding schemes are inefficient in this communication scenario. The reason is that for digital coding scheme, it has high coding efficiency but unavoidably leads to the cliff effect;while for analog scheme, it has graceful video quality variation with channel varying, but has low coding efficiency. Hence, to integrate the advantages of digital coding and analog coding, we propose a hybrid digital-analog (HDA) scheme. In our scheme, we have adopted adaptive power allocation and adaptive forward error coding (FEC) in digital part to accommodate instantaneous channel quality. The evaluation results show that the proposed HDA scheme outperforms ParCast (a state-of-the-art analog scheme) 0.3~2.2dB under the channel Signal-to-Noise Ratio (SNR) from 3dB to 20dB.
Scattering structure features of targets is of great importance for Synthetic Aperture Radar (SAR) image analysis. In this paper, a novel algorithm for aircraft recognition in high resolution apron area of SAR images ...
详细信息
ISBN:
(纸本)9781509033331
Scattering structure features of targets is of great importance for Synthetic Aperture Radar (SAR) image analysis. In this paper, a novel algorithm for aircraft recognition in high resolution apron area of SAR images is proposed. The algorithm combines the strength of gradient saliency map and scattering structure features to improve accuracy and efficiency. Specially, Constant False-Alarm Rate (CFAR) algorithm is carried out to segment images. Then, a new efficient object locating method based on directional local gradient map is proposed to detect aircraft targets. Then, the candidate slices as well as template slices are modeled using Gaussian Mixture Model (GMM), which will be treated as structure features. In the recognition stage, a novel similarity measurement algorithm based on Kullback-Leibler Divergence for GMM models is proposed for classification. We conduct experiments on the dataset with 3.0m resolution and the recognition results demonstrate the accuracy of our proposed method.
Part-based trackers have achieved promising performance in many tracking tasks. However, most part-based trackers use the same feature representation for all parts and simply combine them together to form an integral ...
详细信息
ISBN:
(纸本)9781509053179
Part-based trackers have achieved promising performance in many tracking tasks. However, most part-based trackers use the same feature representation for all parts and simply combine them together to form an integral representation for the tracking target. It may not guarantee that all parts of the tracking target can well distinguish the foreground from the background. Better performance is expected by exploring different feature representations on different parts of the tracking target. In this paper, following the framework of the classic Compressive Tracker (CT), we model each part of the target adaptively by using a multi-dimensional color representation. By using color name, we select the color feature presentation that best distinguishes the foreground from background. In order to better handle deformation and illumination change, we use multi-Gaussian to model different appearance changes of the tracking target. Both qualitative and quantitative evaluations demonstrate that the proposed method makes a consistent performance improvement compared with the conventional Compressive Tracker on tracking benchmark dataset. Besides, it also outperforms many state-of-the-art trackers while running at averagely 20 frames per second (FPS).
暂无评论