The High Efficiency Video Coding (HEVC) with the transform bypass mode is simple but inefficient for lossless coding. For this reason, we propose a novel transform to further eliminate the redundancy between residues ...
详细信息
ISBN:
(纸本)9781479934331
The High Efficiency Video Coding (HEVC) with the transform bypass mode is simple but inefficient for lossless coding. For this reason, we propose a novel transform to further eliminate the redundancy between residues of different blocks in intra prediction. Dependent on intra prediction modes, the proposed transform is adaptable to exploit correlations of residues formed by different modes. In order to accurately obtain parameters of the transform matrix, an approach similar to the Wiener filtering method is adopted. Experimental results show that on top of the lossless coding mode in HEVC, our method offers the performance with a 7.4% bit-rate reduction on average for All Intra Main configuration. Compared with other representative algorithms, our proposal still shows an improvement in the compression ratio, without substantial increases of computational complexity in the encoder or decoder.
Image compression had been extensively studied for reducing coding rate yet producing acceptable visual quality. However, there are many application scenarios where the compressed images are used for automatic recogni...
详细信息
ISBN:
(纸本)9781479973408
Image compression had been extensively studied for reducing coding rate yet producing acceptable visual quality. However, there are many application scenarios where the compressed images are used for automatic recognition rather than human viewing, thus the visual quality is no longer critical for compression. SIFT features have demonstrated their utility in many recognition scenarios and SIFT-preserving compression is developed recently. In this paper, we firstly study the SIFT-preserving compression of license plate images for recognition accuracy rather than visual quality. According to extracted SIFT features, each image is divided into SIFT coding-units and non-SIFT coding-units. Each coding-unit is assigned with a different quality parameter when using JPEG for compression. We compare our proposed scheme with the standard JPEG that uses a unified quality parameter. Experimental results with manually tuned parameters show that on average 14% bit-rate can be saved by our scheme, without any loss of recognition accuracy.
Query difficulty estimation (QDE) attempts to automatically predict the performance of the search results returned for a given query. QDE has been widely investigated in text document retrieval for many years. However...
详细信息
ISBN:
(纸本)9781479947607
Query difficulty estimation (QDE) attempts to automatically predict the performance of the search results returned for a given query. QDE has been widely investigated in text document retrieval for many years. However, few research works have been explored in image retrieval. State-of-the-art QDE methods in image retrieval mainly investigate the statistical characteristics (coherence, robustness, etc.) of the returned images to derive a value for indicating the query difficulty degree. To the best of our knowledge, little research has been done to directly estimate the real retrieval performance of the search results, such as average precision, instead of only an indicator. In this paper, we propose a novel query difficulty estimation approach which automatically estimate the average precision of the image search results. Specifically, we first select a set of query relevant and query irrelevant images for each query via pseudo relevance feedback. Then an efficient and effective voting scheme is proposed to estimate the relevance label of each image in the search results. Based on the images' relevance labels, the average precision of the search results returned for the given query is derived. The experimental results on a benchmark image search dataset demonstrate the effectiveness of the proposed method.
In this article, we present a novel non-local video denoising scheme using low-rank representation and total variation regularization. The proposed scheme attempts to make full use of the intrinsic properties that the...
详细信息
ISBN:
(纸本)9781479934331
In this article, we present a novel non-local video denoising scheme using low-rank representation and total variation regularization. The proposed scheme attempts to make full use of the intrinsic properties that the grouping similar patches not only lie in a low-rank subspace but are also sparse in total variation (TV) domain. For a group of similar patches, we formulate video denoising problem into a concise model that combines nuclear norm, TV regularization and l_1 norm. The experiments demonstrate that the proposed scheme is capable of handling multi-type noise including dense Gaussian noise and random-valued sparse noise, while maintaining the texture information meantime. The results show that our scheme achieves noticeable performance improvement over the state-of-the-art video denoising methods.
Existing approaches for image annotation generally demand training data with exact image labels or human-generated tags, which are often difficult to obtain. In this paper we present a novel model that utilizes the ri...
详细信息
Multiple scattering may render synthetic aperture radar (SAR) image interpretation difficult, particularly when it comes to imaging of man-made structures, which can be modeled as composite scatterers. To isolate diff...
详细信息
Multiple scattering may render synthetic aperture radar (SAR) image interpretation difficult, particularly when it comes to imaging of man-made structures, which can be modeled as composite scatterers. To isolate different scattering mechanism, we designed an airborne SAR experiment based on the high resolution, sub-metric to decimetric range, capabilities full-polarimetric SAR system, CARSS (Chinese Airborne Remote Sensing system), developing by IECAS. The imaging results are quite accord with the theoretical analysis. With polarimetric target decomposition, we can simply distinguish the different scattering mechanism. The idea to interpret the man-made targets as the combination of simple scattering mechanisms is supported by the experiment results.
With the widespread use of mobile devices, the location-based service (LBS) applications become increasingly popular, which introduces the new security challenge to protect user's location privacy. On one hand, a ...
详细信息
This article mainly deals with the control and stability problems of networked Hammerstein with nonlinear input. A novel predictive controller design method is proposed to offset the effect of network delay and data d...
详细信息
ISBN:
(纸本)9781479947249
This article mainly deals with the control and stability problems of networked Hammerstein with nonlinear input. A novel predictive controller design method is proposed to offset the effect of network delay and data dropout. The controller gain which depends on the time delay of the feedback channel is time-variant. Since we assume that the state is not measurable, the control signal is based on the state estimated by the observer. As for the nonlinear part of the input,We assume it satisfies a sector constraint and treat it as a input inaccuracy. Theoretical results are presented for the closed-loop stability by modeling the system as time-delay Hammerstein system with nonlinear inputs. A second-order Hammerstein system is implemented to show the enhanced performance of this control method.
3D reconstruction from multiple-view images has drawn a lot of attentions in computer graphics and computer vision communities. Traditional techniques usually end at discrete 3D point clouds computed from feature corr...
详细信息
3D reconstruction from multiple-view images has drawn a lot of attentions in computer graphics and computer vision communities. Traditional techniques usually end at discrete 3D point clouds computed from feature correspondence. However, geometric structure remains ambiguous in these unstructured point clouds, especially for man-made objects like buildings, indoor scenes. This paper proposes a new method to automatically reconstruct the main geometric structure of the scene composed of planar faces. First, dense 3D point clouds are reconstructed by applying patch-based multi-view stereo (PMVS) algorithm [1]. Then 3D planar primitives are extracted using a RANSAC-based approach [2]. We present a novel method to analyze the adjacency relations of the planar primitives to estimate the 3D intersection lines on the corresponding faces. Junctions and polygonal faces are computed from the 3D intersection lines along with complementary image information to compose the topology structure. Finally, texture for each face is extracted from the image under the best view. Experimental results demonstrate the feasibility of our system by successfully reconstructing the main structure of a wide range of scenes and constructing a texture-mapped piecewise-planar 3D model from images in multiple views.
Social networks nowadays have become an important form of communication in which users can post their current status or share their lives by mobile phones or the Web. In this paper, we develop an effective and efficie...
详细信息
Social networks nowadays have become an important form of communication in which users can post their current status or share their lives by mobile phones or the Web. In this paper, we develop an effective and efficient model to estimate continuous tie strength between users for friend recommendation with the heterogeneous data from social media community. We categorize those multimodal data into two classes: interaction data (e.g., comments, marking favorite photos) and similarity data(e.g., common friends, groups, tags, geo, visual). We propose to use asymmetric relationship in the interaction data for tie strength estimation instead of using the conventional symmetric ones. Furthermore, by exploring the behavior of users in a social media community, we find that the tie strength between users can be approximately modeled as a linear function of their social connections. Based on this observation, we propose an effective and highly efficient user specific linear model for the tie strength estimation. The experiments on a popular social network show promising results and demonstrate the effectiveness of our proposed method.
暂无评论