A high-level representation of a video clip comprising information about its physical and semantic structure is necessary for providing appropriate processing, indexing and retrieval capabilities for videodatabases. ...
详细信息
ISBN:
(纸本)0819427527
A high-level representation of a video clip comprising information about its physical and semantic structure is necessary for providing appropriate processing, indexing and retrieval capabilities for videodatabases. We describe a novel technique which reduces a sequence of MPEG encoded video frames to a trail of points in a low dimensional space. In our earlier work,(1) we presented techniques applicable in 3-D, but in this paper, we describe techniques that can be extended to higher dimensions where improved performance is expected. In the low-dimensional space, we can cluster frames, analyze transitions between clusters and compute properties of the resulting trail. Portions of the trail can be classified as either stationary or transitional, leading to high-level descriptions of the video. Tracking the interaction of clusters over time, we lay the groundwork for the complete analysis and representation of the video's physical and semantic structure.
The usefulness of a collection of scanned graphical documents can be measured by the facilities available for their retrieval. We present an approach for indexing a collection of line drawings automatically. The index...
详细信息
ISBN:
(纸本)081941767X
The usefulness of a collection of scanned graphical documents can be measured by the facilities available for their retrieval. We present an approach for indexing a collection of line drawings automatically. The indexing is based on the textual and graphical content of the drawings. This approach has been developed to facilitate `retrieval by example' in heterogeneous collections of graphical documents. No a priori knowledge about the application domain is assumed. Starting with a raster image, candidate character patterns and graphical primitives (i.e., line segments and arcs) are extracted. Candidate character patterns are classified by an OCR method and grouped into word hypotheses. Graphical features of various types are computed from groupings of graphical primitives (e.g., sequences of adjacent lines, pairs of parallel lines). retrieval occurs with a weighted information retrieval system. Each document of the collection and each query are described with a set of indexing features with their corresponding weights. The weight of an indexing feature reflects the descriptive nature of the feature and is computed from the number of occurrences of the indexing feature in the document (feature frequency ff) and the number of documents containing the indexing feature (document frequency df).
There is a growing need for the ability to query imagedatabases based on image content rather than strict keyword search. Most current image database systems that perform query by content require a distance computati...
详细信息
ISBN:
(纸本)0819424331
There is a growing need for the ability to query imagedatabases based on image content rather than strict keyword search. Most current image database systems that perform query by content require a distance computation for each image in the database. Distance computations can be time consuming, limiting the usability of such systems. There is thus a need for indexing systems and algorithms that can eliminate candidate images without performing distance calculations. As user needs may change from session to session, there is also a need for run-time creation of distance measures. In this paper, we introduce FIDS, or ''Flexible image Database System.'' FIDS allows the user to query the database based on user-defined polynomial combinations of predefined distance measures. Using an indexing scheme and algorithms based on the triangle inequality, FIDS can return matches to the query image without directly comparing the query image to much of the database. FIDS is currently being tested on a database of eighteen hundred images.
This paper proposes an imageretrieval system which searches a database for images similar to a target imagined by a user. The system uses image features, rather than keywords, and retrieves images by reducing a multi...
详细信息
This paper proposes an imageretrieval system which searches a database for images similar to a target imagined by a user. The system uses image features, rather than keywords, and retrieves images by reducing a multidimensional feature space generated by the image feature vectors. First, the system presents the user some sample images with a suitable feature vector value and requires the user's interaction to obtain information on which image is similar to the target he/she has in his/her mind. Then, the information is used to appropriately reduce the feature space. This process is continued until the target region is reduced to a suitable volume. Since this method requires neither real target image nor keywords in retrieving, it is quite simple and practical. Experimental results show the advantage and efficiency of the proposed system.
The use of video data in the multimedia environment is increasing rapidly, and so tools to handle large volumes of video data are required. One of the first steps towards creating more versatile video tools is to segm...
详细信息
ISBN:
(纸本)0819427527
The use of video data in the multimedia environment is increasing rapidly, and so tools to handle large volumes of video data are required. One of the first steps towards creating more versatile video tools is to segment the video data, i.e. partition it into its component shots. This paper presents a novel combination of algorithms for video segmentation, utilising histogram comparison, motion vector information (focus of expansion), and edge information to detect transitions between shots. Our method can reliably detect transitions such as camera breaks, fades, dissolves and wipes, in video compressed to the MPEG-I standard.
We have developed a TV-on-demand system that provides playback of a television program after a period ranging from a few seconds to one week after broadcast, and have conducted usage trials in cooperation with a telev...
详细信息
We have developed a TV-on-demand system that provides playback of a television program after a period ranging from a few seconds to one week after broadcast, and have conducted usage trials in cooperation with a television station in Nagano Prefecture of Japan. This system has been achieved through the development of various technologies such as automatic updating of stored television programs and contents retrieval by telop characters. Users in the trials can begin playback of a television program immediately after its broadcast has begun. The purpose of the trials was to evaluate the system's usability in applications such as contents retrieval, selective viewing of commercials, and customer service at the television station. This paper presents applied technologies and some experimental results and also addresses a new direction of information retrieval system based on the evaluation of the usage trials.
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a no...
详细信息
ISBN:
(纸本)0819431273
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based imageretrieval system was built. The system was benchmarked against the visual retrieval cartridge, Virage, built into Oracle 8 database system. The results suggested that the new measure is useful for imageretrieval.
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and cou...
详细信息
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and could be used with any image search engine implementing that API. Also, different methods of specifying the initial search image are discussed as well as different methods of displaying the results, including the use of 3D using VRML.
In this paper we present a multiresolution approach for video indexing and feature matching of subband coded videodatabases. Subband coding refers to a coding technique where the input images are quantized after bein...
详细信息
ISBN:
(纸本)0819414808
In this paper we present a multiresolution approach for video indexing and feature matching of subband coded videodatabases. Subband coding refers to a coding technique where the input images are quantized after being decomposed into several narrow spatial frequency bands by filtering and decimation. Five different approaches were tested for scene change detection which is applied only on the lowest subband for computational efficiency. Two kinds of scene changes, abrupt and smoothly accumulated scene changes, mark the beginning of new scene segments. An index for each scene segment is the histogram of two representative frames, which we take to be the first and the last frame of the scene for simplicity. Using the approach of query by example, the index matching algorithm takes a multi-resolution approach by hierarchically comparing histograms at different resolutions. The search algorithm for the match between example query and its target scene segment starts from the coarsest resolution, and moves to the next finer resolution until a single match is obtained or the finest resolution is reached. Experimental results are presented, and the proposed indexing technique appears to be promising for its computational efficiency and its inherent hierarchical search procedure.
Although content based retrieval of images is increasingly common, the use of media content as a basis for navigation has received relatively little attention. In this paper we describe our recent development of facil...
详细信息
ISBN:
(纸本)0819424331
Although content based retrieval of images is increasingly common, the use of media content as a basis for navigation has received relatively little attention. In this paper we describe our recent development of facilities in the MAVIS/Microcosm architecture for generic link authoring and following from non-text media and in particular, the use of shape and texture for content based navigation from images. Applications from a product catalogue and an archaeological collection are presented, together with an outline of an image viewer providing rapid delineation of object shapes in images when authoring or following links.
暂无评论