In this paper, we explore a technique for automatic classification of video sequences, (such as a TV broadcast, movies). This technique analyses the incoming video sequences and classifies them into categories. It can...
详细信息
ISBN:
(纸本)0819427527
In this paper, we explore a technique for automatic classification of video sequences, (such as a TV broadcast, movies). This technique analyses the incoming video sequences and classifies them into categories. It can be viewed as an on-line parser for video signals. We present two techniques for automatic classification. In the first technique, the incoming video sequence is analyzed to extract the motion information. This information is optimally projected onto a single dimension. This projection information is then used to train Hidden Markov Models (HMMs) that efficiently and accurately classify the incoming video sequence. Preliminary results with 50 different test sequences (25 Sports and 25 News sequences) indicate a classification accuracy of 90% by the HMM models. In the second technique, 24 full length motion picture trailers are classified using HMMs. This classification is compared with the internet movie database and we find that they correlate well. Only two out of 24 trailers were classified incorrectly.
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatica...
详细信息
ISBN:
(纸本)0819427527
We have been developing an imageretrieval system, called MIPS (Multiscalar image Processing and retrieval System), for use in uncontrolled environments. On insertion into the image database, the images are automatically segmented into homogeneous regions. Generic features are computed and stored for each segment. Specifically, we maintain not only geometric and photometric attributes but also simple spatial information for each extracted region. This approach asks the user to construct queries in terms of the given primitives, i.e. regions and their spatial relations. Preliminary results show that the success of the system depends on how well the images can be modelled by homogeneous regions, on how useful the generic features are for the given application, and on the knowledge that the user puts into the formulation of the queries. A fully automatic segmentation algorithm is of paramount importance. We have designed an algorithm called Perceptual Region Growing that combines region growing, edge detection, and perceptual organization principles, without resorting to any kind of high level knowledge or interactive user intervention. Decision thresholds and quality measures are directly derived from the image data, based on image statistics. Search through critical parameter spaces is the key idea to cope with noise in uncontrolled environments. The dynamics of the region growing process is constantly monitored and exploited.
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that 'mostly' fit the query image. We showed that in certain conditions the measure is a norm...
详细信息
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that 'mostly' fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based imageretrieval system was build. The system was benchmarked against the visual retrieval cartridge, virage, built into Oracle 8 database system. The results suggested that the new measure is useful for imageretrieval.
A scheme that reduces the needed storage considerably without reducing the retrieval effectiveness of a content-based imageretrieval (CBIR) system such as ASSERT is introduced. This scheme enables ASSERT to retrieve ...
详细信息
A scheme that reduces the needed storage considerably without reducing the retrieval effectiveness of a content-based imageretrieval (CBIR) system such as ASSERT is introduced. This scheme enables ASSERT to retrieve the most similar physician-marked key images from the other patients, and those adjacent images that cohere with the key images. Only the key images and the other images that cohere with the key images are retained for archival purposes. This significantly reduces the amount of storage needed for fast retrieval.
Multimedia data are generally stored in compressed form in order to efficiently utilize the available storage facilities. Access to multimedia archives is thus dependent on our ability to browse compressed information...
Multimedia data are generally stored in compressed form in order to efficiently utilize the available storage facilities. Access to multimedia archives is thus dependent on our ability to browse compressed information. In this paper, a novel approach to multiple object tracking from compressed multimedia databases is presented. This approach is intended to operate in a distributed environment, where users initiate video searches and retrieve relevant video information simultaneously From multiple compressed video archives. The system operates on the compressed video to find and track objects of interest and determine their positions in the image. This enables more complex query formulations in terms of the relative positions of the target objects in the image. The filtering and analysis of motion information (motion vectors) is used to track objects in the video bit stream. Once the search has terminated. the system may decompress and display the query-relevant video sequences upon request. (C) 2000 Academic Press.
Enabling the efficient storage, access and retrieval of large volumes of multi-dimensional data is one of the important emerging problems in databases. We present a framework for adaptively storing, accessing, and ret...
详细信息
Enabling the efficient storage, access and retrieval of large volumes of multi-dimensional data is one of the important emerging problems in databases. We present a framework for adaptively storing, accessing, and retrieving large images. The framework uses a space and frequency graph to generate and select imageview elements for storing in the database. By adapting to user access patterns, the system selects and stores those view elements that yield the lowest average cost for accessing the multi-resolution sub-region imageviews. The system uses a second adaptation strategy to divide computation between server and client in progressive retrieval of imageviews using view elements. We show that the system speeds-up retrieval for access and retrieval modes such as drill-down browsing and remote zooming and panning and minimizes the amount of data transfer over the network.
Computer-assisted content-based indexing is a critical enabling technology and currently a bottleneck in productive use of video resources. This paper presents the video Classification Project, an effort toward automa...
详细信息
ISBN:
(纸本)0819414808
Computer-assisted content-based indexing is a critical enabling technology and currently a bottleneck in productive use of video resources. This paper presents the video Classification Project, an effort toward automating content-based video indexing and retrieval, at the Institute of Systems Science of the National University of Singapore. We discuss in detail three goals of the project: image processing tools for video parsing, feature extraction and retrieval; a knowledge-based approach to representing video content; and stratified tools which allow greater flexibility in browsing a video resource, either before or after performing specific retrieval operations.
With the increasing popularity of image manipulation with contents, many color-based imageretrieval techniques have been proposed in the literature. A systematic and comparative study of 8 representative techniques i...
详细信息
ISBN:
(纸本)0819427527
With the increasing popularity of image manipulation with contents, many color-based imageretrieval techniques have been proposed in the literature. A systematic and comparative study of 8 representative techniques is first presented in this paper, which uses a database of 200 images of nags and trademarks. These techniques are determined to cover the variations of the color models used, of the characteristic color features employed and of the distance measures calculated for judging the similarity of color images. The results of this comparative study are presented both by the lists of retrieved images for subjective visual inspection and by the retrieving ratios computed for objective judgment. All of them show that the cumulative histogram based techniques using Euclidean distance measures in two perception related color spaces give best results among the 8 techniques under consideration. Started from the best performed techniques, works toward further improving their retrieving capability are then carried on and this has resulted 2 new techniques which use local cumulative histograms. The new techniques have been tested by using a database of 400 images of real flowers which are quite complicated in color contents. Some satisfactory results, compared to that obtained by using existing cumulative histogram based techniques are obtained and presented.
A highly integrated wavelet-based image management system is proposed. Three solutions for key aspect of image management are derived: content-based imageretrieval (CBIR); image compression/decompression; and image t...
详细信息
A highly integrated wavelet-based image management system is proposed. Three solutions for key aspect of image management are derived: content-based imageretrieval (CBIR); image compression/decompression; and image transmission. By exploring the excellent features of wavelet, integrating key aspect of image management, the system shows a high overall performance.
A nine-direction lower-triangular (9DLT) matrix describes the relative spatial relationships among the objects in a symbolic image. In this paper, the 9DLT matrix will be transformed into a linear string, called 9DLT ...
详细信息
A nine-direction lower-triangular (9DLT) matrix describes the relative spatial relationships among the objects in a symbolic image. In this paper, the 9DLT matrix will be transformed into a linear string, called 9DLT string. Based on the 9DLT string, two metrics of similarity in image matching measures, simpler but more precise, are provided to solve the subimage and similar imageretrieval problems. Moreover, a common component binary tree (CCBT) structure will be refined to save a set of 9DLT strings. The revised CCBT structure not only eliminates the redundant information among those 9DLT strings, but also diminishes the processing time for determining the image matching distances between query frames and video frames. Experiments indicate that the storage space and the processing time are greatly reduced through the revised CCBT structure. A fast dynamic programming approach is also proposed to handle the problem of sequence matching between a query frame sequence and a video frame sequence, a zool Academic Press.
暂无评论