This paper describes the retrieval process from imagedatabases, based on a partial match between the query and the images. The proposed approach allows to measure the similarity between the query and the images in th...
详细信息
ISBN:
(纸本)0819414808
This paper describes the retrieval process from imagedatabases, based on a partial match between the query and the images. The proposed approach allows to measure the similarity between the query and the images in the database and to retrieve those having the highest probability to be relevant. The paper describes the query processing and the access structures, based on the `signature method'. Four levels of signature files are associated to the image database and a signature is associated to the query. The query signature is compared with the image signatures in a four step image processing algorithm. The result of the process is a set of images with an associated recognition degree, measured by using information provided by the user during query formulation (such as importance of the presence of each object) and by using the image structure and the recognition degree associated to each object. The retrieved images are presented to the user in decreasing relevance order. The method described so far is inefficient, since the selection of most relevant images is executed among all relevant images (even those having a low relevance). The paper presents two approaches for improving the efficiency of query processing by (a) reducing the number of accesses to the image Database and (b) by reducing the number of accesses to the signature file. The two approaches are discussed in detail in the paper. The advantages and drawbacks of each method are illustrated.
In this paper we consider the problem of similarity between video sequences. Three basic questions are raised and (partially) answered. Firstly, at what temporal duration can video sequences be compared? The frame, sh...
详细信息
ISBN:
(纸本)0819427527
In this paper we consider the problem of similarity between video sequences. Three basic questions are raised and (partially) answered. Firstly, at what temporal duration can video sequences be compared? The frame, shot, scene and video levels are identified. Secondly, given some image or video feature, what are the requirements on its distance measure and how can it be "easily" transformed into the visual similarity desired by the inquirer? Thirdly how can video sequences be compared at different levels? A general approach based on either a set or sequence representation with variable degrees of aggregation is proposed and applied recursively over the different levels of temporal resolution. It allows the inquirer to fully control the importance of temporal ordering and duration. Promising experimental results are presented.
Content Based imageretrieval has recently become one of the most active research areas, due to the massive increase in the amount and complexity of digitized data being stored, transmitted and accessed. We present he...
详细信息
Content Based imageretrieval has recently become one of the most active research areas, due to the massive increase in the amount and complexity of digitized data being stored, transmitted and accessed. We present here a prototype implementation of DRAWSEARCH, an imageretrieval by content system that uses color and shape (and texture in the near future) features to index and retrieve images. The system, currently being tested and improved, is designed to increase interactivity with users posing queries over the Internet and avails of a Java client for query by sketch. It also implements relevance feedback to allow users dynamically refine queries. Experiments show that the proposed approach can greatly reduce the user's effort to compose a query while capturing his/her information need with greater precision.
The worldwide research efforts in the area of image and videoretrieval have concentrated so far on increasing the efficiency and reliability of extracting the elements of image and video semantics and so on improving...
详细信息
ISBN:
(纸本)0819439932
The worldwide research efforts in the area of image and videoretrieval have concentrated so far on increasing the efficiency and reliability of extracting the elements of image and video semantics and so on improving the search and retrieval performance at the cognitive level of content abstraction. At this abstraction level, the user is searching for "factual" or "objective" content such as image showing a panorama of San Francisco, an outdoor or an indoor image, a broadcast news report on a defined topic, a movie dialog between the actors A and B or the parts of a basketball game showing fast breaks, steals and scores. These efforts, however, do not address the retrieval applications at the so-called affective level of content abstraction where the "ground truth" is not strictly defined. Such applications are, for instance, those where subjectivity of the user plays the major role, e.g. the task of retrieving all images that the user "likes most", and those that are based on "recognizing emotions" in audiovisual data. Typical examples are searching for all images that "radiate happiness", identifying all "sad" movie fragments and looking for the "romantic landscapes", "sentimental" movie segments, "movie highlights" or "most exciting" moments of a sport event. This paper discusses the needs and possibilities for widening the current scope of research in the area of image and video search and retrieval in order to enable applications at the affective level of content abstraction.
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatica...
详细信息
ISBN:
(纸本)0819427527
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatically segmented into homogeneous regions. Generic features are computed and stored for each segment. Specifically, we maintain not only geometric and photometric attributes but also simple spatial information for each extracted region. This approach asks the user to construct queries in terms of the given primitives, i.e. regions and their spatial relations. Preliminary results show that the success of the system depends on how well the images can be modelled by homogeneous regions, on how useful the generic features are for the given application, and on the knowledge that the user puts into the formulation of the queries. A fully automatic segmentation algorithm is of paramount importance. We have designed an algorithm called Perceptual Region Growing that combines region growing, edge detection, and perceptual organization principles, without resorting to any kind of high level knowledge or interactive user intervention. Decision thresholds and quality measures are directly derived from the image data, based on image statistics. Search through critical parameter spaces is the key idea to cope with noise in uncontrolled environments. The dynamics of the region growing process is constantly monitored and exploited.
This paper presents a video indexing and representation tool for sequences which contain moving persons, using a model-based dynamic scene analysis. A scenario, describing the sequence in terms of basic events, is pro...
详细信息
ISBN:
(纸本)081941767X
This paper presents a video indexing and representation tool for sequences which contain moving persons, using a model-based dynamic scene analysis. A scenario, describing the sequence in terms of basic events, is proposed. These events constitute a first level of annotation and are used to build a visual representation of the sequence called Object Based video Icon. Experiments are carried out and a prototype system is described.
This volume 3312 of the conference proceedings contains 40 papers. Topics discussed include imageretrieval, video representation, video segmentation, intelligent tools, similarity search, image and video authenticati...
详细信息
This volume 3312 of the conference proceedings contains 40 papers. Topics discussed include imageretrieval, video representation, video segmentation, intelligent tools, similarity search, image and video authentication and videostorage and delivery.
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and cou...
详细信息
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and could be used with any image search engine implementing that API. Also, different methods of specifying the initial search image are discussed as well as different methods of displaying the results, including the use of 3D using vRML.
A picture knowledge base management system is described that is used to represent, organize and retrieve pictures from a frame knowledge base. Experiments with human test subjects were conducted to obtain further desc...
详细信息
ISBN:
(纸本)0819424331
A picture knowledge base management system is described that is used to represent, organize and retrieve pictures from a frame knowledge base. Experiments with human test subjects were conducted to obtain further descriptions of pictures from news magazines. These descriptions were used to represent the semantic content of pictures in frame representations. A conceptual clustering algorithm is described which organizes pictures not only on the observable features, but also on implicit properties derived from the frame representations. The algorithm uses inheritance reasoning to take into account background knowledge in the clustering. The algorithm creates clusters of pictures using a group similarity function that is based on the gestalt theory of picture perception. For each cluster created, a frame is generated which describes the semantic content of pictures in the cluster. Clustering and retrieval experiments were conducted with and without background knowledge. The paper shows how the use of background knowledge and semantic similarity heuristics improves the speed, precision, and recall of queries processed. The paper concludes with a discussion of how natural language processing of can be used to assist in the development of knowledge bases and the processing of user queries.
In an architectural database that is to be used by architects, urbanists, sociologists, geometers, etc., querying must be simplified. The aim of this work is to retrieve the images of a building that best fit a specif...
详细信息
ISBN:
(纸本)0819424331
In an architectural database that is to be used by architects, urbanists, sociologists, geometers, etc., querying must be simplified. The aim of this work is to retrieve the images of a building that best fit a specified point of view. Original data are provided in DXF and TIFF formats (maps and images respectively.) A loose linking between these two types of information is obtained through textual attributes. However, the same building is photographed several times and more than a single building can appear on a picture. After determining the point of view by simple ''clicks'' on a map, we take advantage of the geometrical description of the building in order to draw its outline. Then, the images that have been textually associated with the selected building undergo a five-steps image-processing algorithm: conversion from the RGB color-space to intensity component, Nagao filtering, oriented gradient filtering, thresholding, and correlation-based hierarchical full search matching. If the building objects are not completely masked by natural ones, the ''rectangular'' shapes of frontage and side walls correspond well to the sketch and the requested images are returned to the user.
暂无评论