Histograms are the most prevalently used representation for the color content of images and video. An elaborate representation of the histograms requires specifying the color centers of the histogram bins and the coun...
详细信息
ISBN:
(纸本)0819435902
Histograms are the most prevalently used representation for the color content of images and video. An elaborate representation of the histograms requires specifying the color centers of the histogram bins and the count of the number of image pixels with that color. Such an elaborate representation, though expressive, may not be necessary for some tasks in image search, filtering and retrieval. A qualitative representation of the histogram is sufficient for many applications. Such a representation will be compact and greatly simplify the storage and transmission of the image representation. It will also reduce the computational complexity of search and filtering algorithms without adversely affecting the quality. We present such a compact binary descriptor for color representation. This descriptor is the quantized Haar transform coefficients of the color histograms. We show the use of this descriptor for fast retrieval of similar images and search for similar video segments from a large database. We also show the use of this descriptor for browsing large imagedatabases without the need for computationally expensive clustering algorithms. The compact nature of the descriptor and the associated simple similarity measure allows searching over a database of about four hours of video in less than 5-6 seconds without the use of any sophisticated indexing scheme.
A different approach to content-based retrieval and a novel framework for classification of visual information are proposed. The Visual Apprentice which is an implementation of the framework for still images and video...
详细信息
A different approach to content-based retrieval and a novel framework for classification of visual information are proposed. The Visual Apprentice which is an implementation of the framework for still images and video that uses a combination of lazy-learning, decision trees, and evolution programs for classification and grouping is introduced. Examples and results are given to demonstrate the applicability of the proposed approach to perform visual classification and detection.
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the da...
详细信息
ISBN:
(纸本)0819424331
This paper presents a novel approach for videoretrieval from a large archive of MPEG or Motion JPEG(1) compressed video clips. We introduce a retrieval algorithm that takes a video clip as a query and searches the database for clips with similar contents. video clips are characterized by a sequence of representative frame signatures, which are constructed from DC coefficients and motion information (''DC+M'' signatures). The similarity between two video clips is determined by using their respective signatures. This method facilitates retrieval of clips for the purpose of video editing, broadcast news retrieval, or copyright violation detection.
This book provides an in-depth treatment of the three important topics related to image and videodatabases: restoration, watermarking and retrieval . It is the result of the participation of the Delft University of ...
ISBN:
(数字)9780080508474
ISBN:
(纸本)9780444505026
This book provides an in-depth treatment of the three important topics related to image and videodatabases: restoration, watermarking and retrieval . It is the result of the participation of the Delft University of Technology in the European Union ACTS program, a pre-competitive R&D program on Advanced Communications Technologies and Services (1994-1998). In particular the book has benefited from participation in the AURORA and SMASH projects respectively automated film and video restoration and storage for multimedia systems (watermarking & retrieval).
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database...
详细信息
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database system does not support the high-dimension feature form content description and indexing, thus is limited in its content based retrieval function. Some systems do support high-dimension feature form content description and indexing, but lacks descriptions and query expressions on media object content and relations. In this paper, we present our study results on query mechanism and proposed CbExpr - a powerful flexible query expression mechanism on media object. Based on CbExpr we proposed GMA (general mediabase architecture) - a general architecture for management and content based retrieval on large media databases, and videoBase - a content based videoretrieval system is present as example of GMA. Basic thoughts, considerations, and definitions are presented in the paper, also with some implementation details.
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to...
详细信息
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to compute the object clusters and provide useful information for overlapped clusters. The automatic image segmentation and categorisation is achieved. To obtain the context for imageretrieval, the subjective context and the objective context are modelled by means of the fuzzy sets theory. The system is able to trace the users' interactions during retrieval. The refinements of the retrieval results can be made while the users are submitting the queries telling the specific requirements.
Color is one of the most widely used features for image similarity retrieval. Most of the existing image similarity retrieval schemes employ either global or local color histogramming. In this paper, we explore the us...
详细信息
ISBN:
(纸本)0819427527
Color is one of the most widely used features for image similarity retrieval. Most of the existing image similarity retrieval schemes employ either global or local color histogramming. In this paper, we explore the use of localized dominant hue and saturation values for color-based image similarity retrieval. This scheme results in a relatively compact representation of color images for similarity retrieval. Experimental results comparing the proposed representation with global and local color histogramming are presented to show the efficacy of the suggested scheme.
In this paper me present a framework for content based query and retrieval of information from large videodatabases. This framework enables content based retrieval of video sequences by characterizing the sequences u...
详细信息
ISBN:
(纸本)0819424331
In this paper me present a framework for content based query and retrieval of information from large videodatabases. This framework enables content based retrieval of video sequences by characterizing the sequences using motion, texture and colorimetry cues. This characterization is biologically inspired and results in a compact parameter space where every segment of video is represented by an 8 dimensional vector. Searching and retrieval is done in real-time with accuracy in this parameter space. Using this characterization, we then evolve a set of discriminators using Genetic Programming. Experiments indicate that these discriminators are capable of analyzing and characterizing video. The videoBook is able to search and retrieve video sequences with 92% accuracy in real-time. Experiments thus demonstrate that the characterization is capable of extracting higher level structure from raw pixel values.
The large amount of available multimedia information (e.g. videos, audio, images) requires efficient and effective annotation and retrieval methods. As videos start playing a more important role in the frame of multim...
详细信息
ISBN:
(纸本)0819424331
The large amount of available multimedia information (e.g. videos, audio, images) requires efficient and effective annotation and retrieval methods. As videos start playing a more important role in the frame of multimedia, we want to make these available for content-based retrieval. The imageMiner-System, which was developed at the University of Bremen in the AI group, is designed for content-based retrieval of single images by a new combination of techniques and methods from computer vision and artificial intelligence. In our approach to make videos available for retrieval in a large database of videos and images there are two necessary steps: First, the detection and extraction of shots from a video, which is done by a histogram based method and second, the construction of the separate frames in a shot to one still single image. This is performed by a mosaicing-technique. The resulting mosaiced image gives a one image visualization of the shot and can be analyzed by the the imageMiner-System. imageMiner has been tested on several domains, (e.g. landscape images, technical drawings), which cover a wide range of applications.
With the abstraction of digital video as the corresponding binary video- a process which upon numerous subjective experimentation seems to preserve (most of the) intelligibility of video content- we can pursue a preci...
详细信息
With the abstraction of digital video as the corresponding binary video- a process which upon numerous subjective experimentation seems to preserve (most of the) intelligibility of video content- we can pursue a precise and analytic approach to (digital videostorage and retrieval) algorithm design that are based upon geometrical (morphological) intuition. The foremost and tangible general benefit of such abstraction, however, is the immediate reductions of both data and computational complexities involved in implementing various algorithms and databases. The general paradigm presented may be utilized to address all issues pertaining to video library construction including visualization, optimum feedback query generation, object recognition, e.t.c., but the primary focus of attention in this paper are the ones pertaining to detection of fast (including presence of flashlights) and gradual scene changes (such as dissolves, fades, and various special effects such as wipes). Upon simulation we observed that we can achieve performances comparable to those of others with drastic reductions in both storage and computational complexities. Furthermore, since the conversion from grayscale to binary videos can be performed directly (with minimal additional computation) in the compressed domain by thresholding on the DCT DC coefficients themselves (or by using the contour information attached to MPEG4 formats), the algorithms presented herein are ideally suited for performing fast (on-the-fly) determinations of scene change, object recognition and/or tracking, and other more intelligent tasks traditionally requiring heavy demand on computational and/or storage complexities. The fast determinations may then be used on their own merits or can be used in conjunction or complementation with other higher-layer information in the future.
暂无评论