Recent advances in computing and communications are leading to the creation of large-scale databases of visual and multimedia information. Such databases are finding ready applications in a wide range of fields such a...
详细信息
Recent advances in computing and communications are leading to the creation of large-scale databases of visual and multimedia information. Such databases are finding ready applications in a wide range of fields such as advertising and marketing, education and training, entertainment, medicine, and remote sensing. Because of the very nature of visual and multimedia data, new and innovative methods are called for in modeling, processing, organizing and indexing of this data for efficient storage, management, access, and delivery of the content. The goal of this special section is to highlight the new research areas and explore technology frontiers, by soliciting and selecting quality papers addressing issues from various aspects of storage, search, retrieval, and processing of digital media data (video/image/audio) and across a wide range of research disciplines. For this special section, we have solicited papers in relevant areas, ranging from content capture and processing, database management, new results on similarity measures and semantic features, and query methods to multimedia systems embracing leading-edge technology, and special applications of media management, retrieval, and processing across multiple fields, from consumer media systems, digital libraries, and media imaging to remote sensing. We encouraged the authors of some very promising papers in the SPIE/IS&T Electronic Imaging Symposium (San Jose) 2001 Conference on storage and retrieval of Media databases to submit a full journal-quality version of their conference manuscript to this special section. We also received submissions from outside of the conference, in response to our general call for papers. All submitted papers have undergone multiple peer-reviews, and we accepted 12 papers after two rounds of revisions and follow-up reviews. We have classified the 12 papers into four major categories: (A) Content-based imageretrieval (the first two papers), (B) Content-based videoretrieval (the next
The article looks into a new direction in multimedia content analysis: the extraction and modeling of the affective content of an arbitrary video. The affective content is viewed as the amount of feeling/emotion conta...
详细信息
ISBN:
(纸本)0769513549
The article looks into a new direction in multimedia content analysis: the extraction and modeling of the affective content of an arbitrary video. The affective content is viewed as the amount of feeling/emotion contained in and mediated by a video toward a viewer. The ability to automatically extract video content of this nature will lead to a high level of personalization in broadcast delivery to private users, as well as considerably broadening the possibilities of efficiently handling and presenting large amounts of audio-visual data stored in emerging videodatabases. The technique we have developed uses the so-called "dimensional approach to affect" concept underlined by psychophysiology studies. Our computational method sets to represent the affective content as feature points in the so-called 2D emotion space. We manage to obtain time curves that represent the two affect dimensions (arousal and valence) for a video, considered respectively, from low-level video characteristics. Combining the two time curves results in the so-called affect curve that is regarded as a reliable representation of transitions from one feeling to another along a video, as perceived by a viewer. We illustrate the success of our technique on excerpts taken from an action movie and a typical soccer game, respectively.
We investigate the image authentication system, SARI, proposed by C.Y. Lin and S.F. Chang (see SPIE storage and retrieval of image/videodatabases, 1998), that distinguishes JPEG compression from malicious manipulatio...
详细信息
We investigate the image authentication system, SARI, proposed by C.Y. Lin and S.F. Chang (see SPIE storage and retrieval of image/videodatabases, 1998), that distinguishes JPEG compression from malicious manipulations. In particular, we look at the image digest component of this system. We show that if multiple images have been authenticated with the same secret key and the digests of these images are known to an attacker, Oscar, then he can cause arbitrary images to be authenticated with this same but unknown key. We show that the number of such images needed by Oscar to launch a successful attack is quite small, making the attack very practical. We then suggest possible solutions to enhance the security of this authentication system.
Multimedia data are generally stored in compressed form in order to efficiently utilize the available storage facilities. Access to multimedia archives is thus dependent on our ability to browse compressed information...
Multimedia data are generally stored in compressed form in order to efficiently utilize the available storage facilities. Access to multimedia archives is thus dependent on our ability to browse compressed information. In this paper, a novel approach to multiple object tracking from compressed multimedia databases is presented. This approach is intended to operate in a distributed environment, where users initiate video searches and retrieve relevant video information simultaneously From multiple compressed video archives. The system operates on the compressed video to find and track objects of interest and determine their positions in the image. This enables more complex query formulations in terms of the relative positions of the target objects in the image. The filtering and analysis of motion information (motion vectors) is used to track objects in the video bit stream. Once the search has terminated. the system may decompress and display the query-relevant video sequences upon request. (C) 2000 Academic Press.
This book provides an in-depth treatment of the three important topics related to image and videodatabases: restoration, watermarking and retrieval . It is the result of the participation of the Delft University of ...
ISBN:
(数字)9780080508474
ISBN:
(纸本)9780444505026
This book provides an in-depth treatment of the three important topics related to image and videodatabases: restoration, watermarking and retrieval . It is the result of the participation of the Delft University of Technology in the European Union ACTS program, a pre-competitive R&D program on Advanced Communications Technologies and Services (1994-1998). In particular the book has benefited from participation in the AURORA and SMASH projects respectively automated film and video restoration and storage for multimedia systems (watermarking & retrieval).
Histograms are the most prevalently used representation for the color content of images and video. An elaborate representation of the histograms requires specifying the color centers of the histogram bins and the coun...
详细信息
ISBN:
(纸本)0819435902
Histograms are the most prevalently used representation for the color content of images and video. An elaborate representation of the histograms requires specifying the color centers of the histogram bins and the count of the number of image pixels with that color. Such an elaborate representation, though expressive, may not be necessary for some tasks in image search, filtering and retrieval. A qualitative representation of the histogram is sufficient for many applications. Such a representation will be compact and greatly simplify the storage and transmission of the image representation. It will also reduce the computational complexity of search and filtering algorithms without adversely affecting the quality. We present such a compact binary descriptor for color representation. This descriptor is the quantized Haar transform coefficients of the color histograms. We show the use of this descriptor for fast retrieval of similar images and search for similar video segments from a large database. We also show the use of this descriptor for browsing large imagedatabases without the need for computationally expensive clustering algorithms. The compact nature of the descriptor and the associated simple similarity measure allows searching over a database of about four hours of video in less than 5-6 seconds without the use of any sophisticated indexing scheme.
With the advent of pervasive computing, a growing diversity of client devices is gaining access to audio-visual content. The increased variability in client device processing power, storage, bandwidth, and server load...
详细信息
ISBN:
(纸本)0780365364
With the advent of pervasive computing, a growing diversity of client devices is gaining access to audio-visual content. The increased variability in client device processing power, storage, bandwidth, and server loading require adaptive solutions for image, video and audio retrieval. Progressive retrieval is one prominent mode of access in which views at different resolutions are incrementally retrieved and refined over time. In this paper, we present a new framework for adaptively partitioning the synthesis operations in progressive retrieval of audio-visual signals. The framework considers that the server and client cooperate in synthesizing the views in order to best utilize the available processing power and bandwidth. We provide experimental results that demonstrate a significant reduction in latency in the progressive retrieval of images under different conditions of the client, server and network.
In huge videodatabases, the effective video indexing method is required. While manual indexing is the most effective approach to this goal, it is slow and expensive. Thus automatic indexing is desirable and recently ...
详细信息
ISBN:
(纸本)0780362985
In huge videodatabases, the effective video indexing method is required. While manual indexing is the most effective approach to this goal, it is slow and expensive. Thus automatic indexing is desirable and recently various indexing tools for videodatabases have been developed. For efficient video indexing and retrieval, the similarity measure is an important factor. This paper presents new similarity measures between frames and proposes a new algorithm to detect scene changes using a cross entropy defined between two histograms. Experimental results show that the proposed algorithm is fast and effective compared with several conventional algorithms to detect abrupt scene changes and gradual transitions including fade in/out and flash light scenes.
Dimensionality reduction methods are of interest in applications such as content based image and videoretrieval. In large multimedia databases, it may not be practical to search through the entire database in order t...
详细信息
ISBN:
(纸本)0780362985
Dimensionality reduction methods are of interest in applications such as content based image and videoretrieval. In large multimedia databases, it may not be practical to search through the entire database in order to retrieve the nearest neighbors of a query. Good data structures for similarity search and indexing are needed, and the existing data structures do not scale well for the high dimensional multimedia descriptor-a. We investigate the use of weighted multi-dimensional scaling (WMDS) for dimensionality reduction. The main objective of the WMDS is to preserve the local topology of the high dimensional space, i.e,, to map the nearest neighbors in the high dimensional space to nearest neighbors in the lower dimensional space. In addition to the well known retrieval accuracy as a measure of performance, we propose two additional measures that take into account the ordinal relationships among the nearest neighbors. Experimental results are given.
暂无评论