Advanced visual information retrieval systems supporting both video and images need to have flexible system design so that their system configurations can easily be enhanced. It is therefore desirable to separate the ...
详细信息
ISBN:
(纸本)0819411418
Advanced visual information retrieval systems supporting both video and images need to have flexible system design so that their system configurations can easily be enhanced. It is therefore desirable to separate the features of a central system into three parts: storage servers, communication servers, and a back-end network that combines these. In this architecture, unscheduled arrivals of data blocks at a back-end network cause two problems: unacceptable fluctuation of video frames and overly long delays of image transfer. To solve these problems, we have designed a new multimedia integrated switching system (MISS) that uses a fully connected crossbar switch to combine servers. MISS treats a time interval of a few hundred microseconds (called a `time-slot') as the basic unit of data block transfer, and allocates appropriate time-slots to all transfer requests in order to simultaneously meet the requirements for each kind of visual information transfer. According to simulation results and estimates based on queuing theory, MISS greatly reduces video frame fluctuation and halves the average image transfer delay. These effects have been confirmed in an experimental visual communication system built around MISS. This system supports JPEG compressed video and images, and six terminals can simultaneously retrieve visual information through an FDDI network.
This paper discusses a novel data placement scheme which optimizes the storage utilization of a NVOD system. The scheme is most distinctive in the following two aspects: 1. It considers Me file blocks placement of pro...
详细信息
ISBN:
(纸本)0819424331
This paper discusses a novel data placement scheme which optimizes the storage utilization of a NVOD system. The scheme is most distinctive in the following two aspects: 1. It considers Me file blocks placement of programs featured different number NVOD channels. 2. The file blocks grouping scheme optimizes the storage utilization of a NVOD system.
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when deal...
详细信息
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when dealing with complex data, such as images, the problem of access must be redefined along different lines than text databases. In multimedia databases, the semantics of the data is imprecise, and depends in part on user's interpretation. This observation made us consider the development of interfaces in which the user explores the database rather than querying it. In this paper we give a brief justification of our position and present the exploratory interface that we have developed for our image database El nino.
In this paper, we present a novel technique that can be used for fast similarity-based indexing and retrieval of both image and videodatabases in distributed environments. We assume that image or videodatabases are ...
详细信息
ISBN:
(纸本)0819429880
In this paper, we present a novel technique that can be used for fast similarity-based indexing and retrieval of both image and videodatabases in distributed environments. We assume that image or videodatabases are stored in the compressed form using standard techniques such as JPEG for images, and M-JPEG or MPEG for videos. The existing techniques, proposed in the literature, use computationally intensive features and cost functions for content-based image and videoretrieval and indexing. The proposed algorithm uses an innovative approach based on histograms of DC coefficients only, and therefore is computationally less expensive than the other approaches. In the case of a JPEG-compressed image database, the query process is the following. The user submits a request for search-by-similarity by presenting the desired image. The algorithm calculates the DC coefficients of this image and creates the histogram of DC coefficients. Then, the algorithm compares the DC histogram of the submitted image with the DC histograms of the images stored in the database using a histogram similarity metric. The image database can be local or at a remote server. In our experiments, we compared several histogram similarity metrics: weighted Euclidean distance, square difference, and absolute difference. The algorithm then selects and presents to the user the images with the smallest values of the metric that best match the submitted image. In the case of a compressed video database, the similarity-based indexing and retrieval is more complex. The manipulation of a video database consists of three main operations: (1) partitioning of the video into clips, (2) key frame extraction, and (3) indexing and retrieval of key frames. The proposed algorithm has been applied in all three steps. First, the DC histograms are implemented for partitioning each video into clips or camera shots. Then, in the next phase the same DC histograms are used to extract key frames and create a database of ke
The main aim of this paper is to describe a method for locating a subimage of a stored image that approximately matches a given query image. This matching can support naive users in accessing an image database accordi...
详细信息
ISBN:
(纸本)0819414808
The main aim of this paper is to describe a method for locating a subimage of a stored image that approximately matches a given query image. This matching can support naive users in accessing an image database according to image contents rather symbolic attributes. The query image can be either composed using painting tools or cuts out of an actual scanned image. Our method is based on the extraction of features from the query image and from the stored images. The following three steps are involved: (1) an ISODATA algorithm is applied to segment (into region) both the query image and the stored images; (2) the normalized moment and geometrical features are computed from the segmented regions, and (3) a matching process is run on the resulting features to find those stored images which should be retrieved. The result is an ordered list of stored images or subimages from the database.
This paper presents a video indexing and representation tool for sequences which contain moving persons, using a model-based dynamic scene analysis. A scenario, describing the sequence in terms of basic events, is pro...
详细信息
ISBN:
(纸本)081941767X
This paper presents a video indexing and representation tool for sequences which contain moving persons, using a model-based dynamic scene analysis. A scenario, describing the sequence in terms of basic events, is proposed. These events constitute a first level of annotation and are used to build a visual representation of the sequence called Object Based video Icon. Experiments are carried out and a prototype system is described.
The proceeding contains 41 papers from the conference on storage and retrieval for Media databases 2002. The topics discussed include: structural segmentation for multimedia content-based information retrieval;seeded ...
详细信息
The proceeding contains 41 papers from the conference on storage and retrieval for Media databases 2002. The topics discussed include: structural segmentation for multimedia content-based information retrieval;seeded image segmentation for content-based imageretrieval application;automatic classification of images on the Web;novel imageretrieval technique using salient edges;extensible feature management engine for imageretrieval;search and retrieval of imagedatabases;video segmentation;video indexing and video processing.
In recent years, learning based hashing becomes an attractive technique in large-scale imageretrieval due to its low storage and computation cost. Hashing methods map each high-dimensional vector onto a low-dimension...
详细信息
In recent years, learning based hashing becomes an attractive technique in large-scale imageretrieval due to its low storage and computation cost. Hashing methods map each high-dimensional vector onto a low-dimensional hamming space by projection operators. However, when processing high dimensional data retrieval, many existing methods including hashing cost a majority of time on projection operators. In this paper, we solve this problem by implementing a sparsity regularizer. On one hand, due to the sparse property of the projection matrix, our method effectively lower both the storage and computation cost. On the other hand, we reduce the effective number of parameters involved in the learned projection matrix according to sparsity regularizer, which helps avoid overfitting problem. Without relaxing binary constraints, an iterative scheme jointly optimizing the objective function directly was given, which helps to obtain effective and efficient binary codes. We evaluate our method on three databases and compare it with some state-of-the-art hashing methods. Experimental results demonstrate that our method outperforms the comparison approaches.
A fuzzy logic system for the detection of shot boundaries in video sequences is presented. It integrates multiple metrics and knowledge of editing procedures to detect shot boundaries. Furthermore, the system is capab...
详细信息
A fuzzy logic system for the detection of shot boundaries in video sequences is presented. It integrates multiple metrics and knowledge of editing procedures to detect shot boundaries. Furthermore, the system is capable of classifying the editing process employed to create the shot boundary into one of the following categories: abrupt cut, fade-in, fade-out, or dissolve.
We describe a visual information system prototype for searching for images and videos on the World-Wide Web. New visual information in the form of images, graphics, animations and videos is being published on the Web ...
详细信息
ISBN:
(纸本)0819424331
We describe a visual information system prototype for searching for images and videos on the World-Wide Web. New visual information in the form of images, graphics, animations and videos is being published on the Web at an incredible rate. However, cataloging this visual data is beyond the capabilities of current text-based Web search engines. In this paper, we describe a complete system by which visual information on the Web is (1) collected by automated agents, (2) processed in both text and visual feature domains, (3) catalogued and (4) indexed for fast search and retrieval. We introduce an image and video search engine which utilizes both text-based navigation and content-based technology for searching visually through the catalogued images and videos. Finally, we provide an initial evaluation based upon the cataloging of over one half million images and videos collected from the Web.
暂无评论