Multimedia information is now routinely available in the forms of text, pictures, animation and sound. Although text objects are relatively easy to deal with (in terms of information search and retrieval), other infor...
详细信息
ISBN:
(纸本)0819420441
Multimedia information is now routinely available in the forms of text, pictures, animation and sound. Although text objects are relatively easy to deal with (in terms of information search and retrieval), other information bearing objects (such as sound, images, animation) are more difficult to index. Our research is aimed at developing better ways of representing multimedia objects by using a conceptual representation based on Schank's conceptual dependencies. Moreover, the representation allows for users' individual interpretations to be embedded in the system. This will alleviate the problems associated with traditional semantic networks by allowing for coexistence of multiple views of the same information. The viability of the approach is tested, and the preliminary results reported.
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a similarity-based search as a nearest neighbor search. Though there has been extensive research on data structures and ...
详细信息
ISBN:
(纸本)0819420441
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a similarity-based search as a nearest neighbor search. Though there has been extensive research on data structures and search methods to support nearest-neighbor searching, these indexing and dimension-reduction methods are generally not applicable to non-coordinate data and non-Euclidean distance measures. In this paper we reexamine and extend previous work of other researchers on best match searching based on the triangle inequality. These methods can be used to organize both non-coordinate data and non-Euclidean metric similarity measures. The effectiveness of the indexes depends on the actual dimensionality of the feature set, data, and similarity metric used. We show that these methods provide significant performance improvements and may be of practical value in real-world databases.
The large scale proliferation of multimedia data necessitates the use of sophisticated techniques for accessing the information based on the content. videoRoadMap is a new content-based video indexing system for retri...
详细信息
ISBN:
(纸本)0819429880
The large scale proliferation of multimedia data necessitates the use of sophisticated techniques for accessing the information based on the content. videoRoadMap is a new content-based video indexing system for retrieving video clips and images from multimedia databases. The system indexes the audio-visual information using spatio-temporal features and information modeling methods. The proposed system employs adaptive similarity measurements based on the contents of media objects, resulting in more accurate retrievals. Principal component analysis and second order statistical analysis are employed to determine the appropriate combination of weight values in similarity search. In addition, videoRoadMap includes a powerful multi-faceted querying mechanism which allows queries to be formulated and presented in a variety of modes, including query by example (image and/or video), query by sketch, and query by object motion trajectory.
To the end-user of a video database, content consists of objects and events occurring in the video. A video database system must be designed to extract, represent and organize this information in a fashion that suppor...
详细信息
ISBN:
(纸本)0819420441
To the end-user of a video database, content consists of objects and events occurring in the video. A video database system must be designed to extract, represent and organize this information in a fashion that supports querying, manipulation and data visualization by a user. As a data modeling exercise, objects and events are defined in terms of semantic attributes such that an end-user's queries are expressible through the modeling language. On the other hand, as a feature extraction exercise, objects are defined as solutions to equations, often in terms of low-level visual primitives like voxels or contours. These two formalisms constitute entirely different languages. However, integration of these two approaches can provide a powerful mechanism for description and manipulation of complex visual data. This paper explores issues involved with this integration. We introduce the notion of a visual data modeling language (VDML), which supports data definition and data manipulation operations over complex visual data characteristic of video database systems. We discuss this data- modeling effort in the context of our multiple perspective interactive video system which generates three-dimensional data sets using input from multiple video cameras.
This paper examines the issue of direct extraction of low level features from compressed images. Specifically, we consider the detection of areas of interest and edges in images compressed using the discrete cosine tr...
详细信息
ISBN:
(纸本)0819420441
This paper examines the issue of direct extraction of low level features from compressed images. Specifically, we consider the detection of areas of interest and edges in images compressed using the discrete cosine transform (DCT). For interest areas, we show how a measure based on certain DCT coefficients of a block can provide an indication of underlying activity. For edges, we show using an ideal edge model how the relative values of different DCT coefficients of a block can be used to estimate the strength and orientation of an edge. Our experimental results indicate that coarse edge information from compressed images can be extracted up to 20 times faster than conventional edge detectors.
Multimedia Information Systems are experiencing a tremendous growth as a direct consequence of the popularity and pervasive use of world wide web. As a consequence, it is becoming increasingly important to provide eff...
详细信息
Multimedia Information Systems are experiencing a tremendous growth as a direct consequence of the popularity and pervasive use of world wide web. As a consequence, it is becoming increasingly important to provide efficient and flexible solutions for accessing and retrieving multimedia data. images and video are emerging as significant data types in multimedia systems. And yet, most commercial systems are still text and key-word based and do not fully exploit the image content of these systems. We believe that there is an opportunity to build a novel interactive multimedia system for some specific applications in electronic commerce. In this paper we present an overview of our approach, the rationale behind it and the problems that are inherent in building such a system. We address some of the technical issues in representing and analysing image primitive features. These are the building blocks of any such systems. They can be generalized into a much broader range of applications as well.
Illumination invariance is of paramount importance to annotate video sequences stored in large videodatabases consistently. Yet, popular texture analysis methods such as multichannel filtering techniques do not yield ...
详细信息
Illumination invariance is of paramount importance to annotate video sequences stored in large videodatabases consistently. Yet, popular texture analysis methods such as multichannel filtering techniques do not yield illumination-invariant texture representations. In this paper, we assess the effectiveness of three illumination normalisation schemes for texture representations derived from Gabor filter outputs. The schemes aim at overcoming intensity scaling effects due to changes in ilumination conditions. A theoretical analysis and experimental results enable us to select one scheme as the most promising one. In this scheme, a normalising factor is derived at each pixel by combining the energy responses of different filters at that pixel. The scheme overcomes illumination variations well, while still preserving discriminatory textural information. Further statistical analysis may shed light on other interesting properties or limitations of the scheme.
A multimedia database is a controlled collection of multimedia data items such as text, images, graphic objects, video and audio. A multimedia database management system (DBMS) provides support for the creation, stora...
详细信息
A multimedia database is a controlled collection of multimedia data items such as text, images, graphic objects, video and audio. A multimedia database management system (DBMS) provides support for the creation, storage, access, querying and control of a multimedia database. The requirements of a multimedia DBMS are: multimedia data modeling; multimedia object storage; multimedia indexing, retrieval and browsing; and multimedia query support. This paper discusses a general framework for multimedia database systems and describes the requirements and architecture for these systems.
Important issues in multimedia information systems are the development of efficient storage layout models and effective retrieval system to manage multimedia databases. In multimedia information systems, the capabilit...
详细信息
ISBN:
(纸本)0818688211
Important issues in multimedia information systems are the development of efficient storage layout models and effective retrieval system to manage multimedia databases. In multimedia information systems, the capability for interaction with video media type is still extremely limited due to a huge amount of videodatabases. We present the Fast Polynomial Regression Transform (FPRT) based metadata scheme for manipulating the large video database in the paper. This new metadata scheme provides the simplification of the query mechanism and the improvement of communication in multimedia systems. The main advantage of the proposed approach is the reduction of the data storage space while preserving much video content information. Other advantages are computational simplicity for video information coding and accuracy for video indexing. Visual tools for video representation and browsing are also presented.
Augmented Album is an application developed to demonstrate how user situations can be used to provide an easy-to-use and easy-to-remember interface for the management and retrieval of digital pictures that consist of ...
详细信息
Augmented Album is an application developed to demonstrate how user situations can be used to provide an easy-to-use and easy-to-remember interface for the management and retrieval of digital pictures that consist of both digital video clips and stillimages. In this system, contextual information such as the location, time, and user events, are captured when a picture is taken. It represents the meaning of the picture as well as its content information to some extent, and thus benefits us to retrieve image/video clips. At the same time, the contextual information could be used to achieve a more realistic organization of those pictures on a computer system.
暂无评论