In this paper we describe a framework of analyzing programs belonging to different Tv program genres using Hidden Markov Models and pseudo-semantic features derived from video shots. Clustering using Gaussian mixture ...
详细信息
ISBN:
(纸本)0819439932
In this paper we describe a framework of analyzing programs belonging to different Tv program genres using Hidden Markov Models and pseudo-semantic features derived from video shots. Clustering using Gaussian mixture models is used to determine the order of the models. Results for initial genre classification experiments using two simple features derived from video shots are given.
Searching in imagedatabases using image content has made the transition from the laboratory to consumer software. Storm Software is a pioneer in bringing these techniques to shrink-wrapped software applications, and ...
详细信息
ISBN:
(纸本)081941767X
Searching in imagedatabases using image content has made the transition from the laboratory to consumer software. Storm Software is a pioneer in bringing these techniques to shrink-wrapped software applications, and this presentation describes some of the methods we use in our products and some of the experiences we have had in bringing this new technology to consumers. We describe the scope of the problem we are trying to solve as well as some of the algorithms and interfaces we used. We also describe some of the rationales (based on theory as well as on user testing) we had for the various design decisions we made. Finally, we describe some of the challenges and opportunities we see ahead. Descriptions and screen shots of two software products implementing image searching (EasyPhoto and Apple PhotoFlash) are provided. Both products were developed by Storm Software.
作者:
Luo, MBai, XSXu, GYTsinghua Univ
Dept Comp Sci & Technol State Key Lab Intelligent Technol & Syst Beijing 100084 Peoples R China
An explosion of on-line image and video data in digital form is already well underway. With the exponential rise in interactive information exploration and dissemination through the World-Wide Web, the major inhibitor...
详细信息
ISBN:
(纸本)0819444162
An explosion of on-line image and video data in digital form is already well underway. With the exponential rise in interactive information exploration and dissemination through the World-Wide Web, the major inhibitors of rapid access to on-line video data are the management of capture and storage, and content-based intelligent search and indexing techniques. This paper proposes an approach for content-based analysis and event-based indexing of sports video. It includes a novel method to organize shots-classifying shots as close shots and far shots, an original idea of blur extent-based event detection, and an innovative local mutation-based algorithm for caption detection and retrieval. Results on extensive real Tv programs demonstrate the applicability of our approach.
In this paper, a video data model is proposed to represent the content of video data. In the proposed model, the trajectory and other properties of objects are recorded. From the trajectory, the motion events such as ...
详细信息
ISBN:
(纸本)0819439932
In this paper, a video data model is proposed to represent the content of video data. In the proposed model, the trajectory and other properties of objects are recorded. From the trajectory, the motion events such as "high speed" of an object and "increasing distance" between objects can be automatically derived. A query language named v-SQL based on the video data model is also proposed for the users to describe the content of the desired video clips. A graphical user interface is implemented for an easier query specification.
There has been work on database systems that can retrieve multimedia objects by their content. We are extending this work by using the World Wide Web as source and storage for multimedia objects much like current text...
详细信息
ISBN:
(纸本)0819424331
There has been work on database systems that can retrieve multimedia objects by their content. We are extending this work by using the World Wide Web as source and storage for multimedia objects much like current text search engines do for textual information. A system that can access all types of multimedia objects by their content is a formidable task and improvements are constantly being made to indexing techniques. We have taken an important first step in demonstrating the viability of this technique while laying the groundwork for a larger, more capable system. We have implemented a simple indexing scheme while concentrating on building the infrastructure to support this system. Our system can retrieve references to images on the WWW, index those images, and store those images using spatial access methods. We then use query by example to find a set of images on the WWW that resemble our query image. Due to its design, it is easy to include additional context features, to substitute different indexing schemes, and add other types of multimedia to our system like time sequences, voice and video.
In this paper we present a multiresolution approach for video indexing and feature matching of subband coded videodatabases. Subband coding refers to a coding technique where the input images are quantized after bein...
详细信息
ISBN:
(纸本)0819414808
In this paper we present a multiresolution approach for video indexing and feature matching of subband coded videodatabases. Subband coding refers to a coding technique where the input images are quantized after being decomposed into several narrow spatial frequency bands by filtering and decimation. Five different approaches were tested for scene change detection which is applied only on the lowest subband for computational efficiency. Two kinds of scene changes, abrupt and smoothly accumulated scene changes, mark the beginning of new scene segments. An index for each scene segment is the histogram of two representative frames, which we take to be the first and the last frame of the scene for simplicity. Using the approach of query by example, the index matching algorithm takes a multi-resolution approach by hierarchically comparing histograms at different resolutions. The search algorithm for the match between example query and its target scene segment starts from the coarsest resolution, and moves to the next finer resolution until a single match is obtained or the finest resolution is reached. Experimental results are presented, and the proposed indexing technique appears to be promising for its computational efficiency and its inherent hierarchical search procedure.
image and video indexing techniques play a key role in content-based searching in multimedia databases. A novel efficient video similarity search approach for content-based videoretrieval in a large storage device is...
详细信息
Content-based search of large image database has received significant attention recently. In this paper, we proposed a new framework, Multiple. Abstraction Level Content Based retrieval (MALCBR), for specifying and pr...
详细信息
ISBN:
(纸本)0819426628
Content-based search of large image database has received significant attention recently. In this paper, we proposed a new framework, Multiple. Abstraction Level Content Based retrieval (MALCBR), for specifying and process content-based retrieval queries on databases of images, time series, or video data. This framework allows search targets to be expressed in a object-based fashion, that allows the extensible specification of arbitrarily complex queries. In our approach, the search targets are either simple objects, specified at multiple levels of abstraction (pixel, feature and semantic levels), or composite objects, defined as collections of relation on the elements of a set of simple objects. During the search, simple objects at the semantic level are retrieved from database tables, feature level objects are computed using pre-extracted features;appropriately indexed, and pixel level objects are extracted from the raw data. Composite objects are computed at query execution time. This framework provides a powerful mechanism for specifying complicated search target and enable efficient processing of filtering of the search results.
Large amounts of databases are created due to the developments in data storage and acquisition technologies. There is a need to develop an appropriate system that will manage these entire databases. Also we need to pr...
详细信息
ISBN:
(纸本)9781479953646
Large amounts of databases are created due to the developments in data storage and acquisition technologies. There is a need to develop an appropriate system that will manage these entire databases. Also we need to precisely and effectively retrieve images from these databases for various applications. The Content Based imageretrieval (CBIR) system serves this purpose. In this paper, we introduce a user based system for CBIR in which genetic algorithm is applied. The different features of color image such as mean, standard deviation and the image bitmap are used for retrieval. In addition, the texture features such as the edge histogram of an image and the entropy of the gray level co-occurrence matrix are used. Furthermore, the genetic algorithm is applied to help the user in identifying the images which satisfy his needs for reducing the gap between the users' expectation and the retrieval results.
For the last few years, shot boundary detection has been recognized as an important research issue on videoretrieval. Also as a preliminary step for the task, it is essential to extract salient features from videos. ...
详细信息
For the last few years, shot boundary detection has been recognized as an important research issue on videoretrieval. Also as a preliminary step for the task, it is essential to extract salient features from videos. Recently, it has become common to perform the two tasks in compressed domain to alleviate their computational costs. In this paper, we propose novel shot boundary detection technique, which uses two feature images, or DC and edge images, extracted directly from MPEG compressed video. While a DC image can be easily obtained, edge image extraction usually requires considerable computational burden. For fast edge image extraction, we suggest to utilize only a few AC coefficients of each DCT block in motion compensated P-frames and B-frames as well as I-frames. This drastically reduces the computational burden compared to edge extraction in the spatial domain. In order to further reduce the computational burden, another edge image extraction technique is also suggested on the basis of AC prediction using DC images. By using the edge energy diagram obtained from edge images and histograms from DC images, shot boundaries such as abrupt transitions, fades, and dissolves are detected automatically. Simulation results show that the proposed techniques are fast and effective.
暂无评论