With proliferation of social media, image has become ubiquitous giving rise to the demand and importance of image semantic analysis and retrieval to access information quickly on social media. However, even with humon...
详细信息
With proliferation of social media, image has become ubiquitous giving rise to the demand and importance of image semantic analysis and retrieval to access information quickly on social media. However, even with humongous information available, there are certain categories of images which are important for certain applications but are very scarce. Convolutional neural network is an effective method to extract high-level semantic features for image database retrieval. To overcome the problem of over-fitting when the number of training samples in dataset is limited, this paper proposes an image database retrieval algorithm based on the framework of transfer learning and feature fusion. Based on the fine-tuning of the pre-trained Convolutional Neural Network (CNN), the proposed algorithm first extracts the semantic features of the images. Principal Component Analysis (PCA) is then applied for dimension reduction and to reduce the computational complexity. Last, the semantic feature extracted from the CNN is fused with traditional low-level visual feature to improve the retrieval accuracy further. Experimental results demonstrated the effectiveness of the proposed method for image database retrieval.
Data retrieval serves a critical role in the development of multimedia applications. However, due to the exponential growth of multimedia data, high-speed and efficient indexing is becoming more and more difficult tha...
详细信息
ISBN:
(纸本)9781538629376
Data retrieval serves a critical role in the development of multimedia applications. However, due to the exponential growth of multimedia data, high-speed and efficient indexing is becoming more and more difficult than ever. In this paper, we propose a novel approach to speed up the retrieval process by adopting a distributed computing paradigm through the Apache Spark framework Utilizing search trees in a Big Data ecosystem leads to fast and cost-effective media databaseretrievals by caching indexing structures into memory and aggregating ranked results with flexibilities for users to specify the importance of search cues. We conducted computational experiments on large-scaled vector files for remote sensing imagedatabase and synthesized pollen imagedatabase to demonstrate the effectiveness and scalability of our system with reasonably high accuracy.
Content-Based imageretrieval (CBIR) systems have been developed to support the imageretrieval based on image properties, such as color, shape and texture. In this paper, we are concerned with shape-based image retri...
详细信息
ISBN:
(纸本)9781479946471
Content-Based imageretrieval (CBIR) systems have been developed to support the imageretrieval based on image properties, such as color, shape and texture. In this paper, we are concerned with shape-based imageretrieval. In this context, we propose a new descriptor invariant to common geometrical transformations and robust to noise to describe shapes, it is an extension of the descriptor of Berretti and al., it is based on the curvature zero-crossing points from a Gaussian smoothed boundary to obtain primitives, called tokens. We propose to describe each token by its inverse of the curvature corresponding to the radius of osculating circle, its orientation and its normalized surface. Several experiments were conducted on the widely used MPEG-7 database. The performance was measured in terms of recall and precision. The obtained results show the promising performance of our method and its superiority over descriptor proposed by Berreti and al.
We introduce a novel rich local descriptor of an image point, we call the (labeled) distance set, which is determined by the spatial arrangement of image features around that point. We describe a two-dimensional (2-D)...
详细信息
We introduce a novel rich local descriptor of an image point, we call the (labeled) distance set, which is determined by the spatial arrangement of image features around that point. We describe a two-dimensional (2-D) visual object by the set of (labeled) distance sets associated with the feature points of that object. Based on a dissimilarity measure between (labeled) distance sets and a dissimilarity measure between sets of (labeled) distance sets, we address two problems that are often encountered in object recognition: object segmentation, for which we formulate a distance sets shape filter, and shape matching. The use of the shape filter is illustrated on printed and handwritten character recognition and detection of traffic signs in complex scenes. The shape comparison procedure is illustrated on handwritten character classification, COIL-20 database object recognition and MPEG-7 silhouette databaseretrieval.
An integrated approach to shape and color-based imageretrieval, where the cues color and shape are both utilized in a local rather than a global way, is presented in this paper. An experimental retrieval system has b...
详细信息
An integrated approach to shape and color-based imageretrieval, where the cues color and shape are both utilized in a local rather than a global way, is presented in this paper. An experimental retrieval system has been developed, which enables the user to search a color imagedatabase intuitively by presenting simple sketches. In order to be able to perform an elastic matching, which is especially needed in sketch-based imageretrieval, objects in the images are represented by Hidden Markov Models. The use of streams (sets of features that are assumed to be statistically independent) within the HMM framework allows the integration of shape and color derived features into a single model, thereby allowing to control the influence of the different streams via stream weights. The approach has been evaluated on a color imagedatabase containing 120 different isolated objects with arbitrary orientation and showed good retrieval results with several users. Furthermore, the use of HMMs allows efficient pruning and thus a fast retrieval even with large databases.
Curvature scale space (CSS) image is a multi-scale organisation of the inflection points of a closed planar curve as it is smoothed. It consists of several arch shape contours, each related to a concavity or a convexi...
详细信息
Curvature scale space (CSS) image is a multi-scale organisation of the inflection points of a closed planar curve as it is smoothed. It consists of several arch shape contours, each related to a concavity or a convexity of the curve. The maxima of these contours have already been used as shape descriptors to find similar shapes in large imagedatabases. In this article, we address the problem of shallow concavities. These may give rise to large contours in the CSS image. These contours may then match those corresponding to deep and wide concavities during the matching process. The phenomenon can be explained by recalling the fact that Gaussian smoothing leads to an approximation of geometric heat equation deformation. We have introduced a method to enrich the CSS image and create different contours for different types of concavities. We tested the proposed method on a database of 1100 images of marine creatures. A significant improvement was observed in the performance of the system on shapes with shallow segments. (C) 2000 Elsevier Science B.V. All rights reserved.
We propose a perceptually based system for pattern retrieval and matching. There is a need for such an "intelligent" retrieval system in applications such as digital museums and libraries, design, architectu...
详细信息
We propose a perceptually based system for pattern retrieval and matching. There is a need for such an "intelligent" retrieval system in applications such as digital museums and libraries, design, architecture, and digital stock photography. The central idea of the work is that similarity judgment has to be modeled along perceptual dimensions. Hence, we detect basic visual categories that people use in judgment of similarity, and design a computational model that accepts patterns as input, and depending on the query, produces a set of choices that follow human behavior in pattern matching. There are two major research aspects to our work. The first one addresses the issue of how humans perceive and measure similarity within the domain of color patterns. To understand and describe this mechanism we performed a subjective experiment. The experiment yielded five perceptual criteria used in comparison between color patterns (vocabulary), as well as a set of rules governing the use of these criteria in similarity judgment (grammar). The second research aspect is the actual implementation of the perceptual criteria and rules in an imageretrieval system. Following the processing typical for human vision, we design a system to: 1)extract perceptual features from the vocabulary and 2) perform the comparison between the patterns according to the grammar rules. The modeling of human perception of color patterns is new-starting with a new color codebook design, compact color representation, and texture description through multiple scale edge distribution along different directions. Moreover, we propose new color and texture distance functions that correlate with human performance. The performance of the system is illustrated with numerous examples from imagedatabases from different application domains.
This paper presents a new approach to content based retrieval in imagedatabases. The basic new idea in the proposed technique is to organize the quantized and truncated wavelet coefficients of an image into a suitabl...
详细信息
ISBN:
(纸本)0819435902
This paper presents a new approach to content based retrieval in imagedatabases. The basic new idea in the proposed technique is to organize the quantized and truncated wavelet coefficients of an image into a suitable tree structure. The tree structure respects the natural hierarchy imposed on the coefficients by the successive resolution levels. Ail the trees relative to the images in a database are organized into a trie. This structure helps in the error tolerant retrieval of queries. The results obtained show that this approach is promising provided that a suitable distance function between trees is adapted.
In many applications, the user of an imagedatabase system points to an image, and wishes to retrieve similar images from the database. Computer vision researchers aim to capture image information in feature vectors w...
详细信息
In many applications, the user of an imagedatabase system points to an image, and wishes to retrieve similar images from the database. Computer vision researchers aim to capture image information in feature vectors which describe shape, texture and color properties of the image. These vectors are indexed or compared to one another during query processing to find images from the database. This paper is concerned with the problem of shape similarity retrieval in imagedatabases. Curvature scale space (CSS) image representation along with a small number of global parameters are used for this purpose. The CSS image consists of several arch-shape contours representing the inflection points of the shape as it is smoothed. The maxima of these contours are used to represent a shape. The method is then tested on a database of 1100 images of marine creatures. A classified subset of this database is used to evaluate the method and compare it with other methods. The results show the promising performance of the method and its superiority over Fourier descriptors and moment invariants.
We propose the use of co-occurrence matrices/histograms (of relative distance, relative angle) between pairs of orientation tokens for silhouette recognition and texture discrimination. The orientation tokens are defi...
详细信息
We propose the use of co-occurrence matrices/histograms (of relative distance, relative angle) between pairs of orientation tokens for silhouette recognition and texture discrimination. The orientation tokens are defined as the tangent vectors to the boundary of the silhouette, or the gradient vectors for grey images. The efficiency of the method is demonstrated with the help of three different series of experiments with real data. (C) 1998 Elsevier Science B.V. All rights reserved.
暂无评论