We propose anew simple image coder based on Discrete Wavelet Transform (DWT). The DWT coefficients are coded in bitplanes. We use a variable order Markovian model to code the DWT coefficient bitplanes. Recently, we ha...
详细信息
ISBN:
(纸本)0819424331
We propose anew simple image coder based on Discrete Wavelet Transform (DWT). The DWT coefficients are coded in bitplanes. We use a variable order Markovian model to code the DWT coefficient bitplanes. Recently, we have developed this method that used 65 contexts(7). In this paper, the number of contexts is reduced to 34. We show the experimental results, both in terms of distortion measurement and visual comparison, and compare them to well-known methods.
This paper proposes an integrated system for supporting content-based videoretrieval and browsing over networks. An automatic semantic video object extraction technique for providing more compact video representation...
详细信息
ISBN:
(纸本)0819439932
This paper proposes an integrated system for supporting content-based videoretrieval and browsing over networks. An automatic semantic video object extraction technique for providing more compact video representation is developed. The videoimages are first partitioned into a set of homogeneous regions with accurate boundaries by integrating the results of color edge detection and region growing procedures. The object seeds, which are the intuitive and representative part of the semantic objects, are detected from these obtained homogeneous image regions. The semantic objects are then generated by a seeded region aggregation or a human interaction procedure. These obtained semantic objects are tracked along the time axis for exploiting their temporal correspondences among frames. Given the semantic video objects represented by a set of visual features, a seeded semantic video content clustering technique is developed for providing more effective video indexing, retrieval and browsing.
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a no...
详细信息
ISBN:
(纸本)0819431273
A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based imageretrieval system was built. The system was benchmarked against the visual retrieval cartridge, virage, built into Oracle 8 database system. The results suggested that the new measure is useful for imageretrieval.
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and cou...
详细信息
This paper describes an API for image searching. The attempt was to isolate the functionality of the GUI from the functionality of the image search engine. The GUI would then make calls to the image search API and could be used with any image search engine implementing that API. Also, different methods of specifying the initial search image are discussed as well as different methods of displaying the results, including the use of 3D using VRML.
Development of various multimedia applications hinges on the availability of fast and efficient storage, brews browsing, indexing, and retrieval techniques. Given that video is typically stored efficiently in a compre...
详细信息
ISBN:
(纸本)0819424331
Development of various multimedia applications hinges on the availability of fast and efficient storage, brews browsing, indexing, and retrieval techniques. Given that video is typically stored efficiently in a compressed format, if we can analyze the compressed representation directly, we can avoid the costly overhead of decompressing and operating at the pixel level. Compressed domain parsing of video has been presented in earlier work where a video clip is divided into shots, subshots, and scenes.(9,11) In this paper, we describe key frame selection, feature extraction, and indexing and retrieval techniques that are directly applicable to MPEG compressed video. We develop a frame-type independent representation of the various types of frames present in an MPEG video in which all frames can be considered equivalent. Features are derived from the available DCT, macroblock, and motion vector information and mapped to a low-dimensional space where they can be accessed with standard database techniques. The spatial information is used as primacy index while the temporal information is used to enhance the robustness of the system during the retrieval process. The techniques presented enable fast archiving, indexing, and retrieval of video. Our operational prototype typically takes a fraction of a second to retrieve similar video scenes from our database, with over 95% success.
image histogram is an image feature widely used in content-based imageretrieval and video segmentation. It is simple to compute yet very effective as a feature in detecting image-to-image similarity, or frame-to-fram...
详细信息
ISBN:
(纸本)0819431273
image histogram is an image feature widely used in content-based imageretrieval and video segmentation. It is simple to compute yet very effective as a feature in detecting image-to-image similarity, or frame-to-frame dissimilarity. While the image histogram captures the global distribution of different intensities or colors well, it does not contain any information about the spatial distribution of pixels. In this paper, we propose to incorporate spatial information into the image histogram by computing features from the spatial distance between pixels belonging to the same intensity or color. In addition to the frequency count of the intensity or color, the mean, variance, and entropy of the distances are computed to form an Augmented image Histogram. Using the new feature, we preformed experiments on a set of color images and a color video sequence. Experimental results demonstrate that the Augmented image Histogram performs significantly better than the conventional color histogram, both in imageretrieval and video shot segmentation.
video content characterization is a challenging problem in videodatabases. The aim of such characterization is to generate indices that can describe a video clip in terms of objects and their actions in the clip. Gen...
详细信息
ISBN:
(纸本)0819424331
video content characterization is a challenging problem in videodatabases. The aim of such characterization is to generate indices that can describe a video clip in terms of objects and their actions in the clip. Generally, such indices are extracted by performing image analysis on video clips. Many such indices can also be generated by analyzing the embedded audio information of video clips. Indices pertaining to context, scene emotion, and actors or characters present in a video dip appear especially suitable for generation via audio analysis techniques of keyword spotting, and speech and speaker recognition. In this paper, we examine the potential of speaker identification techniques for characterizing video clips in terms of actors present in them. We describe a three-stage processing system consisting of a shot boundary detection stage, an audio classification stage, and a speaker identification stage to determine the presence of different actors in isolated shots. Experimental results using the movie Few Good Men are presented to show the efficacy of speaker identification for labeling video clips in terms of persons present in them.
In order to flexibly and efficiently store, manage, and present video data streams, continuous video data must be chopped into video objects and stored into database. This paper investigates systematic strategies for ...
详细信息
ISBN:
(纸本)0819420441
In order to flexibly and efficiently store, manage, and present video data streams, continuous video data must be chopped into video objects and stored into database. This paper investigates systematic strategies for supporting continuous and synchronized presentation of video data streams in multimedia database systems. Compressed video data streams are segmented and stored as sets of video objects coupled with specified synchronization requirements. Strategies for efficiently scheduling and buffering video objects are presented which guarantee the hiccup-free presentations of video streams. Delay effects are considered in these strategies. We propose to extend the existing object-oriented database system (OODBS) techniques to include the proposed video presentation mechanisms. We are currently designing and implementing a multimedia presentation tool (termed MediaShow) on top of O2, a well-known OODBS, as a basis for our implementation. However, the design strategies can be generally used in any OODBS environments that support C++ interface.
Real-world video-based applications require database technology that is capable of storing digital video in the form of videodatabases and providing content-based video search and retrieval. Methods for handling trad...
详细信息
Real-world video-based applications require database technology that is capable of storing digital video in the form of videodatabases and providing content-based video search and retrieval. Methods for handling traditional data storage, query, search, retrieval, and presentation cannot be extended to provide this functionality. The VDBMS research initiative is motivated by the requirements of video-based applications to search and retrieve portions of video data based on content and by the need for testbed facilities to facilitate research in the area of video database management. In this paper we describe the VDBMS video database research platform, a system that supports comprehensive and efficient database management for digital video. Our fundamental concept is to provide a full range of functionality for video as a well-defined abstract database data type, with its own description, parameters, and applicable methods. Research problems that are addressed by VDBMS to support the handling of video data include MPEG7 standard multimedia content representation, algorithms for image-based shot detection, image processing techniques for extracting low-level visual features, a high-dimensional indexing technique to access the high-dimensional feature vectors extracted by image preprocessing, multimedia query processing and optimization, new query operators, real-time stream management, a search-based buffer management policy, and an access control model for selective, content-based access to streaming video. VDBMS also provides an environment for testing the correctness and scope of new video processing techniques, measuring the performance of algorithms in a standardized way, and comparing the performance of different implementations of an algorithm or component. We are currently developing video component wrappers with well-defined interfaces to facilitate the modification or replacement of video processing components. The ultimate goal of the VDBMS project is a fle
imageretrieval systems that compare the query image exhaustively with each individual image in the database are not scalable to large databases. A scalable search system should ensure that the search time does not in...
详细信息
ISBN:
(纸本)0819431273
imageretrieval systems that compare the query image exhaustively with each individual image in the database are not scalable to large databases. A scalable search system should ensure that the search time does not increase linearly with the number of images in the database. We present a clustering based indexing technique, where the images in the database are grouped into clusters of images with similar color content using a hierarchical clustering algorithm. At search time the query image is not compared with all the images in the database, but only with a small subset. Experiments show that this clustering based approach offers a superior response time with a high retrieval accuracy. Experiments with different database sizes indicate that for a given retrieval accuracy the search time does not increase linearly with the database size.
暂无评论