The viability of large distributed imagedatabases is strongly dependent on the development of new image representations capable of providing support for Extended functionality, directly in the compressed domain. We h...
详细信息
ISBN:
(纸本)0818681837
The viability of large distributed imagedatabases is strongly dependent on the development of new image representations capable of providing support for Extended functionality, directly in the compressed domain. We have recently introduced one such representation (Library-based coding) which we now augment with statistical pre-indexing schemes, automatically built at the time of encoding, that provide several layers of content description allowing efficient content-based retrieval and summarization.
The usefulness of a collection of scanned graphical documents can be measured by the facilities available for their retrieval. We present an approach for indexing a collection of line drawings automatically. The index...
详细信息
ISBN:
(纸本)081941767X
The usefulness of a collection of scanned graphical documents can be measured by the facilities available for their retrieval. We present an approach for indexing a collection of line drawings automatically. The indexing is based on the textual and graphical content of the drawings. This approach has been developed to facilitate `retrieval by example' in heterogeneous collections of graphical documents. No a priori knowledge about the application domain is assumed. Starting with a raster image, candidate character patterns and graphical primitives (i.e., line segments and arcs) are extracted. Candidate character patterns are classified by an OCR method and grouped into word hypotheses. Graphical features of various types are computed from groupings of graphical primitives (e.g., sequences of adjacent lines, pairs of parallel lines). retrieval occurs with a weighted information retrieval system. Each document of the collection and each query are described with a set of indexing features with their corresponding weights. The weight of an indexing feature reflects the descriptive nature of the feature and is computed from the number of occurrences of the indexing feature in the document (feature frequency ff) and the number of documents containing the indexing feature (document frequency df).
Although content based retrieval of images is increasingly common, the use of media content as a basis for navigation has received relatively little attention. In this paper we describe our recent development of facil...
详细信息
ISBN:
(纸本)0819424331
Although content based retrieval of images is increasingly common, the use of media content as a basis for navigation has received relatively little attention. In this paper we describe our recent development of facilities in the MAVIS/Microcosm architecture for generic link authoring and following from non-text media and in particular, the use of shape and texture for content based navigation from images. Applications from a product catalogue and an archaeological collection are presented, together with an outline of an image viewer providing rapid delineation of object shapes in images when authoring or following links.
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a no...
详细信息
ISBN:
(纸本)0819431273
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based imageretrieval system was built. The system was benchmarked against the visual retrieval cartridge, Virage, built into Oracle 8 database system. The results suggested that the new measure is useful for imageretrieval.
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consumi...
详细信息
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consuming it is useful to identify a minimal set allowing for an automatic classification of some class or genre. Further it can be shown that deleting the coherence of the features characterizing some class is not suitable to guarantee an optimal classification result. The central question of the paper is thus which features should be selected and how they should be weighted to optimize a classification problem.
An experimental video server for middle-scale video-on-demand services that uses a 'redundant double-layered disk array' can read out 100 MPEG-1 1.5-Mbps video streams simultaneously with a response time of un...
详细信息
ISBN:
(纸本)0819420441
An experimental video server for middle-scale video-on-demand services that uses a 'redundant double-layered disk array' can read out 100 MPEG-1 1.5-Mbps video streams simultaneously with a response time of under one second through an FDDI-LAN. An exclusive data method that switches between normal data and fast data and a skip-search method are used to provide fast visual search. The gateway connecting the video server LAN to a 6.312-Mbps constant bit-rate line allows broadcast services to be integrated with on- demand services. The protocol implemented in this gateway controls the visual search rate, corrects errors in downloaded data, and accelerates the playback mode changes.
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of field...
详细信息
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of fields, and there are many issues that are still waiting a satisfactory solution. In this paper we present our (albeit still partial) answer to some of these problems, and the future directions of our work. Our design is divided in two layers: a Logbook which operates as a short time repository of unsummarized and unprocessed data, and a long term spatio-temporal database which stores and queries summarized data.
We propose anew simple image coder based on Discrete Wavelet Transform (DWT). The DWT coefficients are coded in bitplanes. We use a variable order Markovian model to code the DWT coefficient bitplanes. Recently, we ha...
详细信息
ISBN:
(纸本)0819424331
We propose anew simple image coder based on Discrete Wavelet Transform (DWT). The DWT coefficients are coded in bitplanes. We use a variable order Markovian model to code the DWT coefficient bitplanes. Recently, we have developed this method that used 65 contexts(7). In this paper, the number of contexts is reduced to 34. We show the experimental results, both in terms of distortion measurement and visual comparison, and compare them to well-known methods.
In order to flexibly and efficiently store, manage, and present video data streams, continuous video data must be chopped into video objects and stored into database. This paper investigates systematic strategies for ...
详细信息
ISBN:
(纸本)0819420441
In order to flexibly and efficiently store, manage, and present video data streams, continuous video data must be chopped into video objects and stored into database. This paper investigates systematic strategies for supporting continuous and synchronized presentation of video data streams in multimedia database systems. Compressed video data streams are segmented and stored as sets of video objects coupled with specified synchronization requirements. Strategies for efficiently scheduling and buffering video objects are presented which guarantee the hiccup-free presentations of video streams. Delay effects are considered in these strategies. We propose to extend the existing object-oriented database system (OODBS) techniques to include the proposed video presentation mechanisms. We are currently designing and implementing a multimedia presentation tool (termed MediaShow) on top of O2, a well-known OODBS, as a basis for our implementation. However, the design strategies can be generally used in any OODBS environments that support C++ interface.
With the advance of multimedia technologies and the explosive expansion of the World Wide Web, the volume of image and video data increases rapidly. An efficient and effective multimedia data retrieval technique is ne...
详细信息
ISBN:
(纸本)0819439932
With the advance of multimedia technologies and the explosive expansion of the World Wide Web, the volume of image and video data increases rapidly. An efficient and effective multimedia data retrieval technique is needed. In this paper, we propose an approach based on feature points for the content-based imageretrieval. The feature points extracted from the multiresolution representation of the query image and database image are first matched to determine the matching pairs. Then, the matching pairs are classified into groups, finally, two similarity measurements based on different similarity requirements are proposed to compute the similarity degree. We perform a series of experiments to study. the characteristics of this approach, and compare with the region-based approach on similar-shot sequence retrieval. The comparison shows the superiority of this approach.
暂无评论