In multimedia databases, one major class of user queries requires retrieving those database images that are spatially similar to a query image. To rank order the database images with respect to the query, the existing...
详细信息
ISBN:
(纸本)0819423181
In multimedia databases, one major class of user queries requires retrieving those database images that are spatially similar to a query image. To rank order the database images with respect to the query, the existing spatial similarity algorithms compute the similarity of every database image with the query. For large multimedia databases, this task is computationally expensive and renders interactive query processing difficult. In this paper, we propose an indexing scheme which will eliminate non-relevant images to a query before the actual similarity computation. In other words, the indexing scheme serves as a filter and spatial similarity computation is done only on those images that pass through the filter. Some non-relevant images may pass through the filter (i.e., false positives) but the proposed indexing scheme guarantees that no relevant images are eliminated (i.e., no false negatives). The indexing scheme is robust in the sense that it recognizes translation, scaling, and rotation variant images of the query image as relevant to the query.
The WWW is evolving into a predominantly visual medium. The demand for access to images and video has been increasing rapidly. Interactive Video systems, which provide access to the content in video archives, are star...
详细信息
The WWW is evolving into a predominantly visual medium. The demand for access to images and video has been increasing rapidly. Interactive Video systems, which provide access to the content in video archives, are starting to emerge on the www. Partly due to the two-dimensional nature of the web, and partly due to the fact that images that comprise the video are two dimensional, most of these systems provide a VCR-like interface (play, fast-forward, reverse, etc., with additions like object selection, motion specification in the image space, and viewpoint selection). The basis of this paper is the realization that the video streams represent projections of a three-dimensional world, and the user is interested in this three-dimensional content and not the actual configuration of pixels in the image space. In this paper, we justify this intuition by enumerating the information-bearing entities that the user is interested in, and the information specification mechanisms that allow the user to query upon these entities. We will describe how such a intuitive system could be implemented using WWW technologies - VRML, HTML, and HTTP - and present our current WWW prototype which is based on extensions to some of these standards. This system is built on top of our multiple perspective interactive video (MPI Video) paradigm which provides a framework for the management of and interactive access to multiple streams of video data capturing different perspectives of related events.
In this paper, we propose a method to integrate a preexisting conventional database system with a multimedia server in the multidatabase environment. In the multidatabase environment, changes in the preexisting databa...
详细信息
In this paper, we propose a method to integrate a preexisting conventional database system with a multimedia server in the multidatabase environment. In the multidatabase environment, changes in the preexisting database system are not allowed because such changes are too expensive. For the integration, high-level semantic description of multimedia data is modeled using the enhanced entity-relationship (EER) model to support content-based retrieval of multimedia data. The EER design is translated into a schema of the preexisting database system, and then the translated schema is integrated with the preexisting database schema. The content description can be used to locate pertinent multimedia data, and the identifiers are used to access the multimedia data stored in the multimedia server. However, with only a simple schema representation of the semantic description of multimedia data, high levels of recall and precision of queries may not be obtained because conventional database systems provide only exact matching answers to the query. Thus, we extended the conventional query processing mechanism by providing a modified cooperative query answering mechanism.
In this paper we study an important problem in multimedia database, namely, the automatic extraction of indexing information from raw data based on video contents. The goal of our research project is to develop a prot...
详细信息
In this paper we study an important problem in multimedia database, namely, the automatic extraction of indexing information from raw data based on video contents. The goal of our research project is to develop a prototype system for automatic indexing of sports videos. The novelty of our work is that we propose to integrate speech understanding and image analysis algorithms for extracting information. The main thrust of this work comes from the observation that in news or sports video indexing, usually speech analysis is more efficient in detecting events than image analysis. Therefore, in our system, the audio processing modules are first applied to locate candidates in the whole data. This information is passed to the video processing modules, which further analyze the video. The final products of video analysis are in the form of pointers to the locations of interesting events in a video. Our algorithms have been tested extensively with real TV programs, and results are presented and discussed in the paper.
content-based indexing of images and videos based on texture features is a powerful mechanism to retrieve images and video scenes. However, the feature extraction process from these images and video is time consuming ...
详细信息
ISBN:
(纸本)0819423181
content-based indexing of images and videos based on texture features is a powerful mechanism to retrieve images and video scenes. However, the feature extraction process from these images and video is time consuming and is not suitable for interactive query processing. A progressive texture extraction and matching algorithm is proposed and evaluated in this paper. This algorithm takes advantage of the multi-resolution representation of an image generated by the subband coding or the wavelet transformation, Starting at a resolution lower than the full resolution of an image or video, the proposed algorithm performs the feature extraction and matching hierarchically. Only those regions matched to the target template at a lower resolution level will be further compared at a higher resolution. The computation speed of this algorithm is shown to be significantly improved (up to 300%) over conventional algorithms while maintaining the same accuracy.
We describe a novel solution to the problem of occlusion in viewing three-dimensional data. A distortion function is used to clear a line of sight to previously obscured interior elements.
ISBN:
(纸本)9780897918329
We describe a novel solution to the problem of occlusion in viewing three-dimensional data. A distortion function is used to clear a line of sight to previously obscured interior elements.
A multimedia document consists of different media objects that are to be sequenced and presented according to temporal and spatial specifications. Collaborative authoring helps in simultaneous editing and viewing of a...
详细信息
The recent advances in multimediasystems, together with the advent of high speed networks, paved the way to a new generation of applications. In particular, the authoring environments have found in multimedia the mea...
详细信息
ISBN:
(纸本)0819420417
The recent advances in multimediasystems, together with the advent of high speed networks, paved the way to a new generation of applications. In particular, the authoring environments have found in multimedia the means of increasing the richness of information contained in electronic documents. With the evolution of new computer systems that can handle multimedia information, time-based data can be integrated in electronic documents taking into account their temporal dimension. In such documents, temporal dependencies between the different media objects define a temporal structure within the document. This structure is the basic support for the representation of dependencies between data such as audio, video and virtual images. Furthermore, it allows the scheduling of presentation actions during the document presentation. The presentation of multimedia documents is dynamic and the positioning of objects in time together with their duration have to be specified. To achieve this operation efficiently, a high level temporal representation is needed which allows the author to specify all the temporal dependencies between multimedia objects. In this paper, we propose an interval-based temporal model and constraints which provide a basis for the management of the consistency of multimedia documents. We propose an efficient algorithm allowing the detection of a wide range of inconsistencies. The emphasis in the design of these algorithms is put on the handling of both the flexibility of temporal specifications and the indeterministic behaviour of some media objects. Furthermore, we use the logical organization of the document in nested entities to enhance the performance of the methods used for detecting inconsistencies. The aim of our approach is to fulfill the following requirements . Structured modelling: The document content is defined by a hierarchy of nested components where leaves are basic media objects and nodes composite objects. . Incremental manipulation of th
The proceedings contains 55 papers. Topics discussed include satellite communication systems, Internet multimedia applications, satellite communication and broadcasting systems, direct television broadcasting, mobile ...
详细信息
The proceedings contains 55 papers. Topics discussed include satellite communication systems, Internet multimedia applications, satellite communication and broadcasting systems, direct television broadcasting, mobile communication systems, random multiple access, simulation modelling of multisatellite network systems, satellite radio communication, information stream management algorithms, low earth orbit communication networks, remote sensing, digital filters, space navigation systems, and real time information systems.
The paper describes Ethersim, a simulation tool to model and study the performance of integrated service networks that provide multimedia (audio, video, data) information access to mobile users carrying portable wirel...
详细信息
ISBN:
(纸本)0780333373
The paper describes Ethersim, a simulation tool to model and study the performance of integrated service networks that provide multimedia (audio, video, data) information access to mobile users carrying portable wireless terminals and hosts. The design process of such networks requires a proper understanding of how network and multimedia application performance is affected by (i) the choice of algorithms and protocols at various network layers, (ii) the wireless link characteristics, (iii) the presence of mobile hosts, and (iv) the host mobility patterns. Analytic approaches to study these problems suffer from intractability in such complex systems, while the inflexibility of making measurements on testbed systems makes it difficult to draw generalized conclusions. Simulation is an alternative, but available simulators provide poor support, many, for mobility and wireless. Ethersim has been built using a discrete event based simulator core and incorporates models of user applications and transport, network and MAC layer protocols. It provides the capability to specify network topology and host mobility patterns. The software architecture of Ethersim employs five special entities: an air module, a map, a mover, mobile hosts, and basestations. The air module models the physical air-interface effects (e.g., RF power decay, frequency collisions etc.). The mover is a central entity that moves the mobile hosts on the map. Ethersim allows for both random and goal-directed movements of mobile hosts, and also allows synchronized goal-directed movements to model conference room type mobility patterns. We also present case-studies of using Ethersim to model and study the interaction of transport layer, connection rerouting protocol, and radio characteristics in a mobile and wireless ATM network at Bell Laboratories.
暂无评论