The proceedings contain 49 papers. The topics discussed include: efficient imageretrieval with multiple distance measures;video content characterization and compaction for digital library applications;content-based v...
The proceedings contain 49 papers. The topics discussed include: efficient imageretrieval with multiple distance measures;video content characterization and compaction for digital library applications;content-based videoretrieval by example video clip;video server technology for high-performance retrieval in video on demand;evolving discriminators for querying video sequences;distributed data collection for a database of radiological image interpretations;design and implementation of an experimental video database system for supporting videoretrieval from different perspectives;semantic imageretrieval through human subject segmentation and characterization;and recognition and visual feature matching of text region in video for conceptual indexing.
A rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems is described. To demonstrate the RTS property of the imageretrieval, four databases are experimented in...
详细信息
A rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems is described. To demonstrate the RTS property of the imageretrieval, four databases are experimented in the computer simulation of the algorithm. The proposed technique is found to be unaffected by substantial changes in the database images due to rotation, translation, and scaling. This makes the technique very robust and attractive for a number of applications of the imagestorage and retrieval systems.
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (vORTEX) to enable...
详细信息
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (vORTEX) to enable operation directly on compressed video data. The structure of the video compression standards is exploited in order to avoid the costly decompression operation. This is achieved by utilizing motion compensation-a critical prediction filter embedded in video compression standards-to estimate and interpolate the desired method for template matching. Occlusion analysis, filtering and motion analysis are used to implement fast tracking of objects of interest on the compressed video data. Being presented with a query in the form of template images of objects, the system operates on the compressed video in order to find the images or video sequences where those objects are present and their positions in the image. This enables the retrieval and display of the query-relevant sequences.
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain ...
详细信息
ISBN:
(纸本)0819424331
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain a preponderance of shots of people. In this paper, we describe our work in semantic imageretrieval of person-rich scenes (key frames) for videodatabases and libraries. We use an approach called retrieval through segmentation. A key-frame image is first segmented into human subjects and background. We developed a specialized segmentation technique that utilizes both human flesh-tone detection and contour analysis. Experimental results show that this technique can effectively segment images in a low time complexity. Once the image has been segmented, we can then extract features or pose queries about both the people and the background. We propose a retrieval framework that is based on the segmentation results and the extracted features of people and background.
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The virage video Engine (vvE) with th...
详细信息
ISBN:
(纸本)0819424331
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The virage video Engine (vvE) with the default set of primitives provide the necessary frame work and basic tools for video content based retrieval. The video engine is a flexible platform independent architecture which provides support for processing multiple synchronized data streams like image sequences, audio and closed captions. The architecture allows for multi-modal indexing and retrieval of video through the use of media specific primitives. This paper presents the use of the vvE framework for content based videoretrieval.
We present a fast algorithm for computing the singular value decomposition (SvD) of a matrix consisting of the frames from a video sequence. The computational efficiency of this algorithm derives from the observation ...
详细信息
We present a fast algorithm for computing the singular value decomposition (SvD) of a matrix consisting of the frames from a video sequence. The computational efficiency of this algorithm derives from the observation that portions of a video sequence will consist of sets of correlated frames. We then show that the information obtained from the SvD can be used to analyze video sequences to obtain information such as scene breaks, scene query, reduced-order shot representation and key frame determination. We illustrate this approach on several video sequences.
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that 'mostly' fit the query image. We showed that in certain conditions the measure is a norm...
详细信息
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that 'mostly' fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based imageretrieval system was build. The system was benchmarked against the visual retrieval cartridge, virage, built into Oracle 8 database system. The results suggested that the new measure is useful for imageretrieval.
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose ...
详细信息
ISBN:
(纸本)0819424331
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose to employ a segmentation-based approach. As a specific example, we introduce a quadtree segmentation technique for textured images and a distance measure, Sum of Minimum Distance, suitable for template-based imageretrieval applications.
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the da...
详细信息
ISBN:
(纸本)0819424331
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the database for clips with similar contents. video clips are characterized by a sequence of representative frame signatures, which are constructed from DC coefficients and motion information (''DC+M'' signatures). The similarity between two video clips is determined by using their respective signatures. This method facilitates retrieval of clips for the purpose of video editing, broadcast news retrieval, or copyright violation detection.
暂无评论