The proceedings contain 49 papers. The topics discussed include: efficient imageretrieval with multiple distance measures;video content characterization and compaction for digital library applications;content-based v...
The proceedings contain 49 papers. The topics discussed include: efficient imageretrieval with multiple distance measures;video content characterization and compaction for digital library applications;content-based videoretrieval by example video clip;video server technology for high-performance retrieval in video on demand;evolving discriminators for querying video sequences;distributed data collection for a database of radiological image interpretations;design and implementation of an experimental video database system for supporting videoretrieval from different perspectives;semantic imageretrieval through human subject segmentation and characterization;and recognition and visual feature matching of text region in video for conceptual indexing.
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain ...
详细信息
ISBN:
(纸本)0819424331
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain a preponderance of shots of people. In this paper, we describe our work in semantic imageretrieval of person-rich scenes (key frames) for videodatabases and libraries. We use an approach called retrieval through segmentation. A key-frame image is first segmented into human subjects and background. We developed a specialized segmentation technique that utilizes both human flesh-tone detection and contour analysis. Experimental results show that this technique can effectively segment images in a low time complexity. Once the image has been segmented, we can then extract features or pose queries about both the people and the background. We propose a retrieval framework that is based on the segmentation results and the extracted features of people and background.
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The virage video Engine (vvE) with th...
详细信息
ISBN:
(纸本)0819424331
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The virage video Engine (vvE) with the default set of primitives provide the necessary frame work and basic tools for video content based retrieval. The video engine is a flexible platform independent architecture which provides support for processing multiple synchronized data streams like image sequences, audio and closed captions. The architecture allows for multi-modal indexing and retrieval of video through the use of media specific primitives. This paper presents the use of the vvE framework for content based videoretrieval.
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose ...
详细信息
ISBN:
(纸本)0819424331
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose to employ a segmentation-based approach. As a specific example, we introduce a quadtree segmentation technique for textured images and a distance measure, Sum of Minimum Distance, suitable for template-based imageretrieval applications.
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the da...
详细信息
ISBN:
(纸本)0819424331
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the database for clips with similar contents. video clips are characterized by a sequence of representative frame signatures, which are constructed from DC coefficients and motion information (''DC+M'' signatures). The similarity between two video clips is determined by using their respective signatures. This method facilitates retrieval of clips for the purpose of video editing, broadcast news retrieval, or copyright violation detection.
The development of increasingly complex multimedia applications calls for new methodologies for the organization and retrieval of still images and video sequences. Query and retrieval methods based on image content pr...
详细信息
ISBN:
(纸本)0819424331
The development of increasingly complex multimedia applications calls for new methodologies for the organization and retrieval of still images and video sequences. Query and retrieval methods based on image content promise good results, are currently widely investigated and, to some extent, already commercially available. Yet a large number of issues remain unsolved. In this paper we describe some results of a study on similarity evaluation in imageretrieval using color, object orientation and relative position as content features. A simple prototype system is also introduced that computes the feature descriptors and performs queries. Although not trivial, the features extraction process is completely automated and requires no user intervention. The system is admittedly not a general purpose tool, but is oriented to thematic image repositories where the semantics of stored images are limited to a specific domain.
This paper describes the development of a prototype of a video database system, called vLdIO, that takes account of the importance of different perspectives in videoretrieval. Text-based hierarchical structures are u...
详细信息
ISBN:
(纸本)0819424331
This paper describes the development of a prototype of a video database system, called vLdIO, that takes account of the importance of different perspectives in videoretrieval. Text-based hierarchical structures are used for representing the contents of a video. The structures are used for supporting the required functionalities in organizing personalized video materials. In addition to support for indexing original video materials, the system also supports tools for re-indexing and maintaining the results of videoretrieval. In other words it tries to fulfill the requirement of personalized video information management. The paper defines the requirement, outlines the key considerations in providing such support and describes the implemented system.
In this paper me present a framework for content based query and retrieval of information from large videodatabases. This framework enables content based retrieval of video sequences by characterizing the sequences u...
详细信息
ISBN:
(纸本)0819424331
In this paper me present a framework for content based query and retrieval of information from large videodatabases. This framework enables content based retrieval of video sequences by characterizing the sequences using motion, texture and colorimetry cues. This characterization is biologically inspired and results in a compact parameter space where every segment of video is represented by an 8 dimensional vector. Searching and retrieval is done in real-time with accuracy in this parameter space. Using this characterization, we then evolve a set of discriminators using Genetic Programming. Experiments indicate that these discriminators are capable of analyzing and characterizing video. The videoBook is able to search and retrieve video sequences with 92% accuracy in real-time. Experiments thus demonstrate that the characterization is capable of extracting higher level structure from raw pixel values.
In this paper we address imageretrieval by similarity in multimedia databases. We discuss the generation and use of signatures computed from image content. The proposed technique is not based on image annotation, the...
详细信息
ISBN:
(纸本)0819424331
In this paper we address imageretrieval by similarity in multimedia databases. We discuss the generation and use of signatures computed from image content. The proposed technique is not based on image annotation, therefore it does not require human assistance. Signatures abstract the directionality of image objects. They are computed from the image Fourier transform, and the influence of computation parameters on signature effectiveness is discussed. retrieval is based on spectrum comparison between a reference image, assumed as the query, and the images in a collection. We introduce a metric for comparing the spectra and ranking the result, and approach the issue of partial query specification. Sample results on a small test collection are given.
暂无评论