检索结果-内蒙古大学图书馆

Proceedings of SPIE - The International Society for Optical Engineering 1999年 3656卷 643-652页

作者： Chen, Keshi Demko, Stephen Xie, Ruifeng Iterated Systems Inc Atlanta United States

The color hologram of an image has been widely used as a feature descriptor for the image in content-based retrieval applications. In this paper, some results from our investigation efforts into to usage are reported. We outline three typical color space quantization schemes used in our experiments and introduce the soft-decision histogramming method to eliminate the discontinuity problem in traditional color histogram population process. Then, to improve the effectiveness of color histogram based retrieval algorithms, several similarity metrics are proposed for comparing color histograms, including three special forms of the Kantorovich metric.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

A hierarchical human detection system in (un)compressed domains

引用

IEEE TRANSACTIONS ON MULTIMEDIA 2002年第2期4卷 283-300页

作者： Ozer, IB Wolf, WH Princeton Univ Dept Elect Engn Princeton NJ 08544 USA

With the rapid growth of multimedia information in forms of digital image and video libraries, there is an increasing need for intelligent database management tools with an efficient information retrieval system. For this purpose, we propose a hierarchical retrieval system where shape, color and motion characteristics of human body are captured in compressed and uncompressed domains. The proposed retrieval method provides human detection and activity recognition at different resolution levels from low complexity to low false rates and connects low level features to high level semantics by developing relational object and activity presentations. The available information of standard video compression algorithms are used in order to reduce the amount of time and storage needed for the information retrieval. The principal component analysis is used for activity recognition using MPEG motion vectors and results are presented for walking, kicking, and running to demonstrate that the classification among activities is clearly visible. For low resolution and monochrome images it is demonstrated that the structural information of human silhouettes can be captured from AC-DCT coefficients. The system performance is tested on 40 images that contain a total of 126 nonoccluded frontal poses and the algorithm can detect 101 of them correctly. The finest details in the images and video sequences are obtained from the uncompressed domain via model based segmentation and graph matching for an in depth analysis of human bodies. The detection rate for human body parts is 70.27% for images and sequences including human body regions at different resolutions and with different postures.

关键词： activity recognition eigenspace representation human detection image and video databases JPEG model-based segmentation MPEG relational graph matching

来源：评论

学校读者我要写书评

暂无评论

Using content models to build audio-video summaries

引用

Proceedings of SPIE - The International Society for Optical Engineering 1999年 3656卷 338-347页

作者： Saarela, Janne Merialdo, Bernard Inst Eurecom Sophia-Antipolis France

The amount of digitized video in video archives is becoming so huge that easier access and content browsing tools are desperately needed. Also, video is no longer one big piece of data but a collection of useful smaller building blocks that can be accessed and used independently from the original context of presentation. In this paper, we demonstrate a content model for audio-video sequences with the purpose of enabling the automatic generation of video summaries. The model is based on descriptors which indicate various properties and relations of audio and video segments. In practice, these descriptors could either be generated automatically by analysis methods, or produced manually (or computer-assisted) by the content provider himself. We analyze the requirements and characteristics of the different data segments with respect to the problem of summarization, and we define our model as a set of constraints which allow to produce good quality summaries.

关键词： Digital image storage

来源：评论

学校读者我要写书评

暂无评论

Eigendecomposition-based analysis of video images

引用

Proceedings of SPIE - The International Society for Optical Engineering 1999年 3656卷 186-195页

作者： Chang, C-Y. Maciejewski, A.A. Balakrishnan, V. Purdue Univ West Lafayette United States

We present a fast algorithm for computing the singular value decomposition (SVD) of a matrix consisting of the frames from a video sequence. The computational efficiency of this algorithm derives from the observation that portions of a video sequence will consist of sets of correlated frames. We then show that the information obtained from the SVD can be used to analyze video sequences to obtain information such as scene breaks, scene query, reduced-order shot representation and key frame determination. We illustrate this approach on several video sequences.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

Special effect edit detection using videoTrails: a comparison with existing techniques

引用

Proceedings of SPIE - The International Society for Optical Engineering 1999年 3656卷 302-313页

作者： Kobla, Vikrant DeMenthon, Daniel Doermann, David Univ of Maryland College Park United States

video Segmentation plays an integral role in many multimedia applications such as digital libraries, content management systems, and various other video browsing, indexing, and retrieval systems. Many algorithms for segmentation of video have appeared in the past few years. Most of these algorithms perform well on cuts, but yield poor performance on gradual transitions or special effect edits. A complete video segmentation system must achieve good performance on special effect edit detection also. In this paper, we discuss the performance of our videoTrails based algorithms with other existing special effect edit detection algorithms in literature. Results from experiments testing for the ability to detect edits from TV programs ranging from commercials to news magazine programs, and also diverse special effect edits introduced by us have been shown.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

MULTIMEDIA INTEGRATED SWITCHING ARCHITECTURE FOR VISUAL INFORMATION-retrieval SYSTEMS

MULTIMEDIA INTEGRATED SWITCHING ARCHITECTURE FOR VISUAL INFO...

引用

CONF ON storage AND retrieval FOR image AND video databases

作者： SAKAMOTO, H NISHIMURA, K ISHIBASHI, Y NAKANO, H NTT Human Interface Labs. (Japan)

ISBN: (纸本)0819411418

Advanced visual information retrieval systems supporting both video and images need to have flexible system design so that their system configurations can easily be enhanced. It is therefore desirable to separate the features of a central system into three parts: storage servers, communication servers, and a back-end network that combines these. In this architecture, unscheduled arrivals of data blocks at a back-end network cause two problems: unacceptable fluctuation of video frames and overly long delays of image transfer. To solve these problems, we have designed a new multimedia integrated switching system (MISS) that uses a fully connected crossbar switch to combine servers. MISS treats a time interval of a few hundred microseconds (called a `time-slot') as the basic unit of data block transfer, and allocates appropriate time-slots to all transfer requests in order to simultaneously meet the requirements for each kind of visual information transfer. According to simulation results and estimates based on queuing theory, MISS greatly reduces video frame fluctuation and halves the average image transfer delay. These effects have been confirmed in an experimental visual communication system built around MISS. This system supports JPEG compressed video and images, and six terminals can simultaneously retrieve visual information through an FDDI network.

关键词： Information visualization video Visualization Telecommunications Data storage Switches image compression Multimedia Local area networks Switching

来源：评论

学校读者我要写书评

暂无评论

Adaptive synthesis in progressive retrieval of audio-visual data

Adaptive synthesis in progressive retrieval of audio-visual ...

引用

1st IEEE International Conference on Multimedia and Expo (ICME2000)

作者： Smith, JR Li, CS IBM Corp TJ Watson Res Ctr Hawthorne NY 10532 USA

ISBN: (纸本)0780365364

With the advent of pervasive computing, a growing diversity of client devices is gaining access to audio-visual content. The increased variability in client device processing power, storage, bandwidth, and server loading require adaptive solutions for image, video and audio retrieval. Progressive retrieval is one prominent mode of access in which views at different resolutions are incrementally retrieved and refined over time. In this paper, we present a new framework for adaptively partitioning the synthesis operations in progressive retrieval of audio-visual signals. The framework considers that the server and client cooperate in synthesizing the views in order to best utilize the available processing power and bandwidth. We provide experimental results that demonstrate a significant reduction in latency in the progressive retrieval of images under different conditions of the client, server and network.

关键词： image and video databases progressive retrieval pervasive computing wavelets

来源：评论

学校读者我要写书评

暂无评论

On the disk layout of near video on demand system 5

On the disk layout of near video on demand system

引用

Conference on storage and retrieval for image and video databases V

作者： Lee, MH Chen, MC Ho, JM Ko, MT Institute of Information Science Academia Sinica Taipei Taiwan Department of Management Information System Shih Chien College Taipei Taiwan

ISBN: (纸本)0819424331

This paper discusses a novel data placement scheme which optimizes the storage utilization of a NVOD system. The scheme is most distinctive in the following two aspects: 1. It considers Me file blocks placement of programs featured different number NVOD channels. 2. The file blocks grouping scheme optimizes the storage utilization of a NVOD system.

关键词： near video on demand NVOD VOD disk layout

来源：评论

学校读者我要写书评

暂无评论

On video retrieval:: Content analysis by imageMiner™

On video retrieval:: Content analysis by ImageMiner™

引用

storage and retrieval for image and video databases VI Conference

作者： Alshuth, P Hermes, T Voigt, L Herzog, O Univ Bremen Ctr Comp Technol Image Proc Dept D-28334 Bremen Germany

ISBN: (纸本)0819427527

In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from different areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The final content analysis is done by the imageMiner* system. The imageMiner system developed at the University of Bremen of the image Processing Department of the Center for Computing Technology realizes content-based image retrieval for still images through a novel combination of methods and techniques of computer vision and artifical intelligence. The imageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may;be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes.

关键词： content-based image retrieval image analysis video analysis image mosaicing graph grammar

来源：评论

学校读者我要写书评

暂无评论

Interfaces for emergent semantics in multimedia databases

引用

Proceedings of SPIE - The International Society for Optical Engineering 1999年 3656卷 167-175页

作者： Santini, Simone Jain, Ramesh Univ of California La Jolla United States

In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with image databases, most of the ideas we present can be generalized to other types of data. We argue that, when dealing with complex data, such as images, the problem of access must be redefined along different lines than text databases. In multimedia databases, the semantics of the data is imprecise, and depends in part on user's interpretation. This observation made us consider the development of interfaces in which the user explores the database rather than querying it. In this paper we give a brief justification of our position and present the exploratory interface that we have developed for our image database El nino.

关键词： User interfaces

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：