Image compression had been extensively studied for reducing coding rate yet producing acceptable visual quality. However, there are many application scenarios where the compressed images are used for automatic recogni...
详细信息
ISBN:
(纸本)9781479973408
Image compression had been extensively studied for reducing coding rate yet producing acceptable visual quality. However, there are many application scenarios where the compressed images are used for automatic recognition rather than human viewing, thus the visual quality is no longer critical for compression. SIFT features have demonstrated their utility in many recognition scenarios and SIFT-preserving compression is developed recently. In this paper, we firstly study the SIFT-preserving compression of license plate images for recognition accuracy rather than visual quality. According to extracted SIFT features, each image is divided into SIFT coding-units and non-SIFT coding-units. Each coding-unit is assigned with a different quality parameter when using JPEG for compression. We compare our proposed scheme with the standard JPEG that uses a unified quality parameter. Experimental results with manually tuned parameters show that on average 14% bit-rate can be saved by our scheme, without any loss of recognition accuracy.
Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT(scale invariant feature transform) features. ...
详细信息
Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT(scale invariant feature transform) features. Feature quantization reduces the discriminative power of local features and unavoidably causes many false local matches between images, which degrades the retrieval accuracy. To filter those false matches, geometric context among visual words has been popularly explored for the verification of geometric consistency. However, existing studies with global or local geometric verification are either computationally expensive or achieve limited accuracy. To address this issue, in this paper, we focus on partialduplicate Web image retrieval, and propose a scheme to encode the spatial context for visual matching verification. An efficient affine enhancement scheme is proposed to refine the verification results. Experiments on partial-duplicate Web image search, using a database of one million images, demonstrate the effectiveness and efficiency of the proposed *** on a 10-million image database further reveals the scalability of our approach.
In this article, we present a novel non-local video denoising scheme using low-rank representation and total variation regularization. The proposed scheme attempts to make full use of the intrinsic properties that the...
详细信息
ISBN:
(纸本)9781479934331
In this article, we present a novel non-local video denoising scheme using low-rank representation and total variation regularization. The proposed scheme attempts to make full use of the intrinsic properties that the grouping similar patches not only lie in a low-rank subspace but are also sparse in total variation (TV) domain. For a group of similar patches, we formulate video denoising problem into a concise model that combines nuclear norm, TV regularization and l_1 norm. The experiments demonstrate that the proposed scheme is capable of handling multi-type noise including dense Gaussian noise and random-valued sparse noise, while maintaining the texture information meantime. The results show that our scheme achieves noticeable performance improvement over the state-of-the-art video denoising methods.
With the widespread use of mobile devices, the location-based service (LBS) applications become increasingly popular, which introduces the new security challenge to protect user's location privacy. On one hand, a ...
详细信息
This article mainly deals with the control and stability problems of networked Hammerstein with nonlinear input. A novel predictive controller design method is proposed to offset the effect of network delay and data d...
详细信息
ISBN:
(纸本)9781479947249
This article mainly deals with the control and stability problems of networked Hammerstein with nonlinear input. A novel predictive controller design method is proposed to offset the effect of network delay and data dropout. The controller gain which depends on the time delay of the feedback channel is time-variant. Since we assume that the state is not measurable, the control signal is based on the state estimated by the observer. As for the nonlinear part of the input,We assume it satisfies a sector constraint and treat it as a input inaccuracy. Theoretical results are presented for the closed-loop stability by modeling the system as time-delay Hammerstein system with nonlinear inputs. A second-order Hammerstein system is implemented to show the enhanced performance of this control method.
3D reconstruction from multiple-view images has drawn a lot of attentions in computer graphics and computer vision communities. Traditional techniques usually end at discrete 3D point clouds computed from feature corr...
详细信息
3D reconstruction from multiple-view images has drawn a lot of attentions in computer graphics and computer vision communities. Traditional techniques usually end at discrete 3D point clouds computed from feature correspondence. However, geometric structure remains ambiguous in these unstructured point clouds, especially for man-made objects like buildings, indoor scenes. This paper proposes a new method to automatically reconstruct the main geometric structure of the scene composed of planar faces. First, dense 3D point clouds are reconstructed by applying patch-based multi-view stereo (PMVS) algorithm [1]. Then 3D planar primitives are extracted using a RANSAC-based approach [2]. We present a novel method to analyze the adjacency relations of the planar primitives to estimate the 3D intersection lines on the corresponding faces. Junctions and polygonal faces are computed from the 3D intersection lines along with complementary image information to compose the topology structure. Finally, texture for each face is extracted from the image under the best view. Experimental results demonstrate the feasibility of our system by successfully reconstructing the main structure of a wide range of scenes and constructing a texture-mapped piecewise-planar 3D model from images in multiple views.
Social networks nowadays have become an important form of communication in which users can post their current status or share their lives by mobile phones or the Web. In this paper, we develop an effective and efficie...
详细信息
Social networks nowadays have become an important form of communication in which users can post their current status or share their lives by mobile phones or the Web. In this paper, we develop an effective and efficient model to estimate continuous tie strength between users for friend recommendation with the heterogeneous data from social media community. We categorize those multimodal data into two classes: interaction data (e.g., comments, marking favorite photos) and similarity data(e.g., common friends, groups, tags, geo, visual). We propose to use asymmetric relationship in the interaction data for tie strength estimation instead of using the conventional symmetric ones. Furthermore, by exploring the behavior of users in a social media community, we find that the tie strength between users can be approximately modeled as a linear function of their social connections. Based on this observation, we propose an effective and highly efficient user specific linear model for the tie strength estimation. The experiments on a popular social network show promising results and demonstrate the effectiveness of our proposed method.
Multiple scattering may render synthetic aperture radar (SAR) image interpretation difficult, particularly when it comes to imaging of man-made structures, which can be modeled as composite scatterers. To isolate diff...
详细信息
Multiple scattering may render synthetic aperture radar (SAR) image interpretation difficult, particularly when it comes to imaging of man-made structures, which can be modeled as composite scatterers. To isolate different scattering mechanism, we designed an airborne SAR experiment based on the high resolution, sub-metric to decimetric range, capabilities full-polarimetric SAR system, CARSS (Chinese Airborne Remote Sensing System), developing by IEcas. The imaging results are quite accord with the theoretical analysis. With polarimetric target decomposition, we can simply distinguish the different scattering mechanism. The idea to interpret the man-made targets as the combination of simple scattering mechanisms is supported by the experiment results.
Subpixel-based downsampling has shown its advantages over pixel-based downsampling in terms of preserving more spatial details along edges and generating sharper images, at the cost of certain amount of color-fringing...
详细信息
Subpixel-based downsampling has shown its advantages over pixel-based downsampling in terms of preserving more spatial details along edges and generating sharper images, at the cost of certain amount of color-fringing artifacts in the downsampled image. To balance the sharpness and color-fringing artifacts, some algorithms are proposed to design optimal anti-aliasing (AA) filters, which are either image independent, or computationally too expensive. And all of the existing AA filters are designed for fixed downsampling factor, which makes them impractical for real applications. In this paper we propose two fast algorithms to design AA filter for arbitrary factor subpixel downsampling based on frequency analysis of the input image. The proposed algorithms generate image dependent AA filter which is as good as the state-of-the-art algorithm, but much faster.
As a new Internet architecture, Named Data Networking (NDN) decouples location from the data itself to achieve security, scalability, and mobility. Although router-side data caching used in NDN reduces data acquisitio...
详细信息
As a new Internet architecture, Named Data Networking (NDN) decouples location from the data itself to achieve security, scalability, and mobility. Although router-side data caching used in NDN reduces data acquisition delay, it introduces a new copyright protection challenge: how to prevent unauthorized users to retrieve data cached in routers that are out of the control of its publisher? Current approaches that rely on a common encryption key among authorized users cannot protect copyright well since if one authorized user secretly leaks the key, we cannot tell who has leaked the key out. In this paper, we present a split-based scheme to solve this copyright protection problem for large-sized data. The data is split into a large part that could be cached in routers for all users to retrieve, and a small part that is unique for each authorized user. This scheme exploits the fact that in the bit-wise OR operation, both bit 0 and bit 1 can OR with 1 to generate the same result of bit 1. The analysis of our scheme shows that it has a good performance in terms of copyright protection, data retrieval efficiency, and overhead.
暂无评论