This Volume 4315 of the conference proceedings contains 620 papers. Topics discussed include search and retrieval of image database, indexing, querying and learning, media information systems, multimodel retrieval, fe...
详细信息
This Volume 4315 of the conference proceedings contains 620 papers. Topics discussed include search and retrieval of image database, indexing, querying and learning, media information systems, multimodel retrieval, feature evaluation, video processing, video sequences, videoretrieval systems and MPEG.
Indexing of scientific imagedatabases is a difficult task, due to their extraordinary sizes and to the complex nature of the visual information contained in them. This data volume and complexity require an automatic ...
详细信息
ISBN:
(纸本)0819414808
Indexing of scientific imagedatabases is a difficult task, due to their extraordinary sizes and to the complex nature of the visual information contained in them. This data volume and complexity require an automatic indexing scheme that will categorize this visual content; without it, the data will be essentially useless to scientists and medical doctors. A method for automatic indexing of scientific imagedatabases is presented which involves a wavelet package decomposition of images in the frequency domain, resulting in a quad-tree of subbands. These subbands are regarded as realizations of random fields, and statistical measures are computed on them. One class of newly derived measures determines whether the subbands contain any significant organization of pixels beyond what chance would imply. If this is found to be true for a subband, its node is retained on an index tree, and other identifying measurements may be added. The structure of the resulting pruned subband tree constitutes the first level of index; the node statistics form a second indexing level. Results of a pilot study are reported; they suggest that further investigation of this approach is warranted.
In this paper we propose a novel system of semantic feature extraction and retrieval for interior design and decoration application. The system, V2ID (Virtual Interior Design), uses colored texture and spatial edge la...
详细信息
In this paper we propose a novel system of semantic feature extraction and retrieval for interior design and decoration application. The system, V2ID (Virtual Interior Design), uses colored texture and spatial edge layout to obtain simple information about global room environment. We address the domain specific segmentation problem in our application and present techniques for obtaining semantic features from a room environment. We also discuss heuristics for making use of these features (color, texture, edge layout and shape) to retrieve objects from an existing database. The final resynthesized room environment with original scene and objects from database is created for the purpose of animation and virtual walk-through.
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of field...
详细信息
ISBN:
(纸本)0819431273
This paper introduces a model of spatio-temporal database that we are developing to query interesting events in video sequences. The database that we are designing is pushing the state of the art for a number of fields, and there are many issues that are still waiting a satisfactory solution. In this paper we present our (albeit still partial) answer to some of these problems, and the future directions of our work. Our design is divided in two layers: a Logbook which operates as a short time repository of unsummarized and unprocessed data, and a long term spatio-temporal database which stores and queries sumamrized data.
The Fractal Transform (FT) was originally introduced as a methodology for compressing digital images and representing them at different scales. The process of calculating an FT generates a great dear of information ab...
详细信息
ISBN:
(纸本)0819431273
The Fractal Transform (FT) was originally introduced as a methodology for compressing digital images and representing them at different scales. The process of calculating an FT generates a great dear of information about the affine similarities and dissimilarities of an image, most of which is discarded in compression applications. In this paper we introduce the concept of Fractal Transform Analysis and use it to derive new image descriptors. We present results of experiments in which description schemes comprised of some of these FT-based descriptors are applied to the problems of finding objects in an image similar to a given object, of indexing images, and of querying an image database consisting of about 17,000 images. Complexity and timing data are also presented.
In this paper, a technique is presented to locate and track the facial areas in image and videodatabases. The extracted facial regions are used to obtain a number of features that are suitable for content-based stora...
详细信息
ISBN:
(纸本)0819427497
In this paper, a technique is presented to locate and track the facial areas in image and videodatabases. The extracted facial regions are used to obtain a number of features that are suitable for content-based storage and retrieval. The proposed face localization method consists of essentially two components: i) a color processing unit, and ii) a shape and color analysis module. The color processing component utilizes the distribution of skin-tones in the HSV color space to obtain an initial set of candidate regions or objects. The latter shape and color analysis module is used to correctly identify the facial regions when falsely detected objects are extracted. A number of features such as hair color, skin-tone, and face location and size are subsequently determined from the extracted facial areas. The hair and skin colors provide useful descriptions related to the human characteristics while the face location and size can reveal information about the activity within the scene (i.e. spatial relationships with other objects), and the type of image (i.e. portrait shot, complete body). These features can be effectively combined with others and employed in user queries to retrieve particular facial images.
The Fractal Transform (FT) was originally introduced as a methodology for compressing digital images and representing them at different scales. The process of calculating an FT generates a great deal of information ab...
详细信息
The Fractal Transform (FT) was originally introduced as a methodology for compressing digital images and representing them at different scales. The process of calculating an FT generates a great deal of information about the affine similarities and dissimilarities of an image, most of which is discarded in compression applications. In this paper we introduce the concept of Fractal Transform Analysis and use it to derive new image descriptors. We present results of experiments in which description schemes comprised of some of these FT-based descriptors are applied to the problems of finding objects in an image similar to a given object, of indexing images, and of querying an image database consisting of about 17,000 images. Complexity and timing data are also presented.
For developing advanced query formulation methods for general multimedia data, we describe the issues related to video data. We distinguish between the requirements for imageretrieval and videoretrieval by identifyi...
详细信息
ISBN:
(纸本)081941767X
For developing advanced query formulation methods for general multimedia data, we describe the issues related to video data. We distinguish between the requirements for imageretrieval and videoretrieval by identifying queryable attributes unique to video data, namely audio, temporal structure, motion, and events. Our approach is based on visual query methods to describe predicates interactively while providing feedback that is as similar as possible to the video data. An initial prototype of our visual query system for video data is presented.
IBM's Ultimedia Manager is a software product for management and retrieval of image data. The product includes both traditional database search and content based search. Traditional database search allows images t...
详细信息
ISBN:
(纸本)081941767X
IBM's Ultimedia Manager is a software product for management and retrieval of image data. The product includes both traditional database search and content based search. Traditional database search allows images to be retrieved by text descriptors or business data such as price, date, and catalog number. Content based search allows retrieval by similarity to a specified color, texture, shape, position or any combination of these. The two can be combined, as in 'retrieve all images with the text `beach' in their description, and sort them in order by how much blue they contain.' Functions are also available for fast browning, and for database navigation. The two main components of Ultimedia Manger are a database population tool to prepare images for query by identifying areas of interest and computing their features, and the query tool for doing retrievals. Application areas include stock photography, electronic libraries, retail, cataloging, and business graphics.
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when deal...
详细信息
ISBN:
(纸本)0819431273
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when dealing with complex data, such as images, the problem of access must be redefined along different lines than text databases. In multimedia databases, the semantics of the data is imprecise, and depends in part on user's interpretation. This observation made us consider the development of interfaces in which the user explores the database rather than querying it. In this paper we give a brief justification of our position and present the exploratory interface that we have developed for our image database El nino.
暂无评论