This paper presents an overview of the aims and objectives of MPEG-4. This standard scheduled for November 1998 addresses content-based interactivity, universal access, robustness in error-prone environments, manipula...
详细信息
ISBN:
(纸本)0780344340
This paper presents an overview of the aims and objectives of MPEG-4. This standard scheduled for November 1998 addresses content-based interactivity, universal access, robustness in error-prone environments, manipulation of objects, both synthetic and natural, etc., at various bit rates, quality levels and spatial/temporal resolutions. The standard is designed to accommodate different communication networks, processors and platforms through tools, algorithms and profiles.
With the increasing popularity of WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. Until now access of multimedia objects in databases was done by means of keywords...
With the increasing popularity of WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. Until now access of multimedia objects in databases was done by means of keywords. Now, with the integration of feature-detection algorithms in database systems software, content-based retrieval can be fully integrated with query processing. In this paper, we describe our experimentation platform under development that fully integrates traditional query processing and content-based retrieval and that is based on feature databases, making database technology available to multimedia.
Motion is one of the most prominent features of video. For content-based video retrieval, motion trajectory is the intuitive specification of motion features. In this paper, approaches for video retrieval via single m...
详细信息
Motion is one of the most prominent features of video. For content-based video retrieval, motion trajectory is the intuitive specification of motion features. In this paper, approaches for video retrieval via single motion trajectory and multiple motion trajectories are addressed. For the retrieval via single motion trajectory, the trajectory is modeled as a sequence of segments and each segment is represented as the slope. Two quantitative similarity measures and corresponding algorithms based on the sequence similarity are presented. For the retrieval via multiple motion trajectories, the trajectories of the video are modeled as a sequence of symbolic pictures. Four quantitative similarity measures and algorithms, which are also based on the sequence similarity, are proposed. All the proposed algorithms are developed based on the dynamic programming approach.
Based on the blobworld method, we propose a blob-centric image querying scheme that is comprised of several new techniques in content-based image retrieval. We report our research results in the database structure and...
详细信息
Based on the blobworld method, we propose a blob-centric image querying scheme that is comprised of several new techniques in content-based image retrieval. We report our research results in the database structure and maintenance algorithms for image indexing. We further conduct a performance comparison of image retrieval efficiency for three possible image retrieval methods, the naive method, the representative-blobs method, and the indexing method. Our quantitative analysis shows over 90% reduction in query response time by using the representative-blobs method and the indexing method.
This paper presents a new approach for the classification and retrieval of three-dimensional images and models from databases. A set of retrieval algorithms is introduced. These algorithms are content-based, meaning t...
详细信息
The current trend for integrating media of various forms has treated video sequences as object components. Video processing algorithms which automatically extract objects from video sequences need to be fine-tuned to ...
详细信息
The current trend for integrating media of various forms has treated video sequences as object components. Video processing algorithms which automatically extract objects from video sequences need to be fine-tuned to suit the different video contents. The algorithms themselves, however, tend to be too complicated for multimedia-content editors without skills in video processing. We present a framework for use in a multimedia authoring system, that assists unskilled users in labeling objects. Image selection trains pixel- and region-unit feature extraction approaches which were devised to assist editor understanding. Furthermore, a graphical user interface is presented for the tracking stage which enables convenient editor training and label correction. The main part of the framework was implemented on a real-time parallel image processing system. The system was used to label jockeys in a horse racing sequence, achieving a factor of ten time reduction over a comparable manual labeling process.
We consider a content-based adaptive clustering algorithm (CBAC) to extract a fixed pre-determined number of representative frames to summarize a given digital video. In our algorithm, shot boundary detection is not n...
详细信息
We consider a content-based adaptive clustering algorithm (CBAC) to extract a fixed pre-determined number of representative frames to summarize a given digital video. In our algorithm, shot boundary detection is not needed. The inter-frame changes are compared globally for the extraction of representative frames. Small units of the video are dynamically clustered into two clusters based on the amount of change in content within the unit. One cluster, which has units of low content change, is designated for deletion and the other cluster, which has units of high content change, is for retention. The algorithm adaptively converges to the desired number of frames by deleting some redundant frames from each unit of the deletion cluster during every iteration.
The access of high bandwidth multimedia data is generally limited to the network due to the limited network resources, inefficient indexing mechanism and less of semantic interpretations. Currently, most content-based...
详细信息
The access of high bandwidth multimedia data is generally limited to the network due to the limited network resources, inefficient indexing mechanism and less of semantic interpretations. Currently, most content-based video content representation involves the segmentation and indexing of video based on scene change and camera/object motion, and such research generally performs off-line video processing. Little research has been done on on-line video processing, which is crucial in video communication applications such as video conferencing, video multicasting and on-line video browsing and retrieval. This research investigates real-time content-based processing of multicast video over the Internet. New on-line video feature extraction schemes, such as scene change detection, on-line key frame classification, are considered to meet the requirement of real-time video multicasting filtering based on the user's profile over the Internet. The annotation and features extracted from a multicast videoconference bitstream by the on-line video content analysis proxies, which are using the proposed video processing algorithms, are output to a separate metadata channel for the further assistance of semantic multicasting of video content. The performance of the proposed algorithms is also demonstrated.
This paper presents an overview of the aims and objectives of MPEG-4. This standard scheduled for November 1998 addresses content-based interactivity, universal access, robustness in error-prone environments, manipula...
详细信息
This paper presents an overview of the aims and objectives of MPEG-4. This standard scheduled for November 1998 addresses content-based interactivity, universal access, robustness in error-prone environments, manipulation of objects, both synthetic and natural, etc., at various bit rates, quality levels and spatial/temporal resolutions. The standard is designed to accommodate different communication networks, processors and platforms through tools, algorithms and profiles.
Increasing amounts of text, audio, and video content has fueled efforts to provide direct, content based access to these materials. Summaries are often necessary to enable timely relevancy assessments, information ext...
详细信息
ISBN:
(纸本)0818682183
Increasing amounts of text, audio, and video content has fueled efforts to provide direct, content based access to these materials. Summaries are often necessary to enable timely relevancy assessments, information extraction, or information analysis from source material. Whereas text summarization research is receiving increasing attention, comparatively few investigators have examined video summarization. This paper reports on the extension of a broadcast news access system to provide multimedia summaries. We briefly overview our system for video analysis, focusing on our novel integration of image, speech and language processing techniques to support automated video summarization. We outline algorithms for proper name and keyphrase extraction, story segmentation, and key frame extraction which together underpin our current ability to automatically summarize video. We describe the systems ability to generate multimedia video summaries tailored to a user query. We discuss evaluation metrics for measuring the (quality) value of these summary artifacts.
暂无评论