imageretrieval systems that compare the query image exhaustively with each individual image in the database are not scalable to large databases. A scalable search system should ensure that the search time does not in...
详细信息
ISBN:
(纸本)0819431273
imageretrieval systems that compare the query image exhaustively with each individual image in the database are not scalable to large databases. A scalable search system should ensure that the search time does not increase linearly with the number of images in the database. We present a clustering based indexing technique, where the images in the database are grouped into clusters of images with similar color content using a hierarchical clustering algorithm. At search time the query image is not compared with all the images in the database, but only with a small subset. Experiments show that this clustering based approach offers a superior response time with a high retrieval accuracy. Experiments with different database sizes indicate that for a given retrieval accuracy the search time does not increase linearly with the database size.
This paper describes a rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems. The features used for image indexing are color based, which are extracted using th...
详细信息
ISBN:
(纸本)0819431273
This paper describes a rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems. The features used for image indexing are color based, which are extracted using the principal component analysis, Hotelling transform, and moment invariants. This synthesized feature extraction technique is devised to be computationally efficient for on-line fast imagestorage and retrieval using color information Since the database indexing relies on a the use of average (mean) color vector and seven moment invariants of an image, the index storage requirement of the method is only a ten-dimensional (10-D) vector. This index storage efficiency is very desirable for many imaging database applications. A new similarity measure is also proposed based on the Tanimoto measure of recognizing similar patterns for fast imageretrieval in large database systems. The underlying similarity measure is computationally effective since the vector inner product is the only operation needed for its computation. Four databases are experimented in the computer simulation of the algorithm to demonstrate the RTS property of the imageretrieval. It is determined experimentally that the proposed method is not affected by substantial changes in the database images due to rotation, translation, and scaling. This is attributed to the fact that the moment features used for retrieval are not predefined sets;rather, they are obtained directly from the images submitted for recording or searching. This makes the algorithm very robust and attractive for many applications of the imagestorage and retrieval systems.
video editors are frequently required to access sections of a video sequence which contain a particular scene. This may be regarded as an imageretrieval-by-content problem where the user wishes to select images from ...
详细信息
ISBN:
(纸本)081941767X
video editors are frequently required to access sections of a video sequence which contain a particular scene. This may be regarded as an imageretrieval-by-content problem where the user wishes to select images from within a large database according to a measure of similarity to a target. We present an intelligent video editing system based on a neural network coding scheme. The transformation learnt by the neural network maps each image into a very compact index which supports rapid fuzzy matching of videoimages. The neural network is trained using a learning law which produces an information preserving transform. Trained in this way, the node learns features which characterize the distribution of scenes within the video sequence. Each image frame in the sequence is coded with respect to these features. We show how the system performs on a typical sequence of newsreel footage and discuss the factors affecting the performance of both the training and the retrieval mechanism.
Anchoring is a technique for representing objects by their distances to a few well chosen landmarks, or anchors. Objects are mapped to distance-based feature vectors, which can be used for content-based retrieval, cla...
详细信息
ISBN:
(纸本)0819448214
Anchoring is a technique for representing objects by their distances to a few well chosen landmarks, or anchors. Objects are mapped to distance-based feature vectors, which can be used for content-based retrieval, classification, clustering, and relevance feedback of images, audio, and video. The anchoring transformation typically reduces dimensionality and replaces expensive similarity computations in the original domain with simple distance computations in the anchored feature domain, while guaranteeing lack of false dismissals. Anchoring is therefore surprisingly simple, yet effective, and flavors of it have seen application in speech recognition, audio classification, protein homology detection, and shape matching. In this paper, we describe the anchoring technique in some detail and study methods for anchor selection, both from an analytical, as well as empirical, standpoint. Most work to date has largely ignored this problem by fixing the anchors to be the entire set of objects or by using greedy selection from among the set of objects. We generalize previous work by considering anchors from outside of the object space, and by deriving an analytical upper bound on the distance-approximation error of the method.
In recent years, databases have evolved from storing pure textual information to storing multimedia information - text, audio, video, and images. With such databases comes the need for a richer set of search keys that...
详细信息
ISBN:
(纸本)081941767X
In recent years, databases have evolved from storing pure textual information to storing multimedia information - text, audio, video, and images. With such databases comes the need for a richer set of search keys that include keywords, shapes, sounds, examples, sketches, color, texture and motion. In this paper we address the problem of imageretrieval where keys are object shapes or user sketches. In our scheme, shape features are extracted from each image as it is stored. The image is first segmented and points of high curvature are extracted. Regions surrounding the points of high curvature are used to compute feature values by comparing the regions with a number of references. The references themselves are picked out from the set of orthonormal wavelet basis vectors. An ordered set of distance measures between each local region and the wavelet references form a feature vector. When a user queries the database through a sketch, the feature vectors for high curvature points on the sketch are determined. An efficient nearest neighbor search then yields a set of images which contain objects that match the user's sketch closely. The process is completely automated. Initial experimental results are presented.
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatica...
详细信息
ISBN:
(纸本)0819427527
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatically segmented into homogeneous regions. Generic features are computed and stored for each segment. Specifically, we maintain not only geometric and photometric attributes but also simple spatial information for each extracted region. This approach asks the user to construct queries in terms of the given primitives, i.e. regions and their spatial relations. Preliminary results show that the success of the system depends on how well the images can be modelled by homogeneous regions, on how useful the generic features are for the given application, and on the knowledge that the user puts into the formulation of the queries. A fully automatic segmentation algorithm is of paramount importance. We have designed an algorithm called Perceptual Region Growing that combines region growing, edge detection, and perceptual organization principles, without resorting to any kind of high level knowledge or interactive user intervention. Decision thresholds and quality measures are directly derived from the image data, based on image statistics. Search through critical parameter spaces is the key idea to cope with noise in uncontrolled environments. The dynamics of the region growing process is constantly monitored and exploited.
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of field...
详细信息
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of fields, and there are many issues that are still waiting a satisfactory solution. In this paper we present our (albeit still partial) answer to some of these problems, and the future directions of our work. Our design is divided in two layers: a Logbook which operates as a short time repository of unsummarized and unprocessed data, and a long term spatio-temporal database which stores and queries summarized data.
video hypermedia systems enable users to retrieve information related to an object by selecting it directly in a video sequence. In video hypermedia systems, users must locate an anchor position according to the motio...
详细信息
ISBN:
(纸本)0819427527
video hypermedia systems enable users to retrieve information related to an object by selecting it directly in a video sequence. In video hypermedia systems, users must locate an anchor position according to the motion of the object. But it is very laborious to locate an anchor to its suitable position according to the motion of the object. We have proposed a new automatic object tracking method and implemented it to the system. A feature of this method is that it includes various automatic error correction algorithms. We evaluated this system on effectiveness on reducing human operations. As a result, the number of operations reduced to 30.3% of the former method, and the time of operations reduced to 60.1% of the former method.
Motion and structure analysis in video sequences can lead to efficient descriptions of objects and their motions. Interesting events in videos can be detected using such an analysis--for instance independent object mo...
详细信息
ISBN:
(纸本)0819414808
Motion and structure analysis in video sequences can lead to efficient descriptions of objects and their motions. Interesting events in videos can be detected using such an analysis--for instance independent object motion when the camera itself is moving, figure-ground segregation based on the saliency of a structure compared to its surroundings. In this paper we present a method for 3D motion and structure analysis that uses a planar surface in the environment as a reference coordinate system to describe a video sequence. The motion in the video sequence is described as the motion of the reference plane, and the parallax motion of all the non-planar components of the scene. It is shown how this method simplifies the otherwise hard general 3D motion analysis problem. In addition, a natural coordinate system in the environment is used to describe the scene which can simplify motion based segmentation. This work is a part of an ongoing effort in our group towards video annotation and analysis for indexing and retrieval. Results from a demonstration system being developed are presented.
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consumi...
详细信息
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consuming it is useful to identify a minimal set allowing for an automatic classification of some class or genre. Further it can be shown that deleting the coherence of the features characterizing some class is not suitable to guarantee an optimal classification result. The central question of the paper is thus which features should be selected and how they should be weighted to optimize a classification problem.
暂无评论