检索结果-内蒙古大学图书馆

Location hashing: an efficient indexing method for locating object queries in image databases

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 366-378页

作者： Syeda-Mahmood, Tanveer IBM Almaden Research Cent San Jose United States

Queries referring to content embedded within images are an essential component of content-based search, browse, or summarize operations in image databases. Localization of such queries under changes in appearance, occlusions and background clutter, is a difficult problem, for which current spatial access structures in databases are not suitable. In this paper we present a new method of indexing image databases called location hashing that uses a special data structure called the location hash tree (LHT) for organizing feature information from images of a database. Location hashing is based on the principle of geometric hashing and determines simultaneously, the relevant images in the database and the regions within them that are most likely to contain a 2d pattern query without incurring detailed search of either. the location hash tree being a red-black tree, allows for efficient search for candidate locations using pose-invariant feature information derived from the query.

关键词： Database systems

来源：评论

学校读者我要写书评

暂无评论

Morphological approach to scene change detection and digital video storage and retrieval

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 733-743页

作者： Kim, Woonkyung M. Song, S.Moon-Ho Kim, Hyeokman Song, Cheeyang Kwon, Byung Woong Kim, Sun Geun Korea Univ Seoul Korea Republic of

With the abstraction of digital video as the corresponding binary video- a process which upon numerous subjective experimentation seems to preserve (most of the) intelligibility of video content- we can pursue a precise and analytic approach to (digital video storage and retrieval) algorithm design that are based upon geometrical (morphological) intuition. the foremost and tangible general benefit of such abstraction, however, is the immediate reductions of both data and computational complexities involved in implementing various algorithms and databases. the general paradigm presented may be utilized to address all issues pertaining to video library construction including visualization, optimum feedback query generation, object recognition, e.t.c., but the primary focus of attention in this paper are the ones pertaining to detection of fast (including presence of flashlights) and gradual scene changes (such as dissolves, fades, and various special effects such as wipes). Upon simulation we observed that we can achieve performances comparable to those of others with drastic reductions in both storage and computational complexities. Furthermore, since the conversion from grayscale to binary videos can be performed directly (with minimal additional computation) in the compressed domain by thresholding on the DCT DC coefficients themselves (or by using the contour information attached to MPEG4 formats), the algorithms presented herein are ideally suited for performing fast (on-the-fly) determinations of scene change, object recognition and/or tracking, and other more intelligent tasks traditionally requiring heavy demand on computational and/or storage complexities. the fast determinations may then be used on their own merits or can be used in conjunction or complementation with other higher-layer information in the future.

关键词： video signal processing

来源：评论

学校读者我要写书评

暂无评论

image descriptors based on fractal transform analysis

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 379-389页

作者： Demko, Stephen Khosravi, Mehdi Chen, Keshi Iterated Systems Inc Atlanta United States

the Fractal Transform (FT) was originally introduced as a methodology for compressing digital images and representing them at different scales. the process of calculating an FT generates a great deal of information about the affine similarities and dissimilarities of an image, most of which is discarded in compression applications. In this paper we introduce the concept of Fractal Transform Analysis and use it to derive new image descriptors. We present results of experiments in which description schemes comprised of some of these FT-based descriptors are applied to the problems of finding objects in an image similar to a given object, of indexing images, and of querying an image database consisting of about 17,000 images. Complexity and timing data are also presented.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

Audio-guided audiovisual data segmentation, indexing, and retrieval

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 316-327页

作者： Zhang, Tong Kuo, C.-C.Jay Univ of Southern California Los Angeles United States

While current approaches for video segmentation and indexing are mostly focused on visual information, audio signals may actually play a primary role in video content parsing. In this paper, we present an approach for automatic segmentation, indexing, and retrieval of audiovisual data based on audio content analysis. the accompanying audio signal of audiovisual data is first segmented and classified into basic types, i.e. speech, music, environmental sound, and silence. this coarse-level segmentation and indexing step is based on morphological and statistical analysis of several short-term features of the audio signals. then, environmental sounds are classified into finer classes such as applause, explosion, bird's sound, etc. this fine-level classification and indexing step is based on time-frequency analysis of audio signals and the use of hidden Markov model (HMM) as the classifier. On top of this archiving scheme, an audiovisual data retrieval system is proposed. Experimental results show that the proposed approach has an accuracy rate higher than 90% for the coarse-level classification, and higher than 85% for the fine-level classification. Examples of audiovisual data segmentation and retrieval are also provided.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Delaunay triangulation for image object indexing: a novel method for shape representation

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 631-642页

作者： Tao, Yi Grosky, William I. Wayne State Univ Detroit United States

Recent research on image databases has been aimed at the development of content-based retrieval techniques for the management of visual information. Compared with such visual information as color, texture, and spatial constraints, shape is so important a feature associated with those image objects of interest that shape alone may be sufficient to identify and classify an object completely and accurately. this paper presents a novel method based on feature point histogram indexing for object shape representation in image databases. In this scheme, the feature point histogram is obtained by discretizing the angles produced by the Delaunay triangulation of a set of unique feature points which characterize object shape in the context, and then counting the number of times each discrete angle occurs in the resulted triangulation. the proposed shape representation technique is translation, scale, and rotation independent. Our various experiments concluded that the Euclidean distance performs very well as the similarity measure function in combination with the feature point histogram computed by counting the two largest angles of each individual Delaunay triangle. through the further experiment, we also found evidence that an image object representation using a feature point histogram provides an effective cue for image object discrimination.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Gesture for video content navigation

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 230-242页

作者： Bradski, Gary Yeo, Boon-Lock Yeung, Minerva M. Intel Corp Santa Clara United States

this article describes the use of gesture recognition techniques in computer vision as a natural interface for video content navigation, and the design of a navigation and browsing system that caters to these natural means of computer-human interaction. For consumer applications, video content navigation presents two challenges: (1) how to parse and summarize multiple video streams in an intuitive and efficient manner, and (2) what type of interface will enhance the ease of use for video browsing and navigation in a living room setting or an interactive environment. In this paper, we address the issues and propose the techniques that combine video content navigation with gestures, seamlessly and intuitively, in an integrated system. the current framework can incorporate speech recognition technology. We present a new type of browser for browsing and navigating video content, as well as a gesture recognition interface for this browser.

关键词： Pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Semi-automatic dynamic video object marker creation

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 178-185页

作者： Toklu, Candemir Liou, Shih-Ping Siemens Corporate Research Inc Princeton United States

In this paper we propose a method for tracking a video object in an ordered sequence of two-dimensional images, where the outcome is the trajectory of the video object throughout the time sequence of images. this method is designed to run in real-time in a synchronous video collaboration environment, and used for producing dynamic object annotations for enhanced video content understanding. A dynamic object is an object whose location or size in the video frame constantly changes due to the camera motion, the motion of its own, or both. We suggest a novel method for finding the trajectory of the object in the intermediate frames given the locations and shapes of the object in two end frames. In addition to the shape and location information of the object, its texture information in the end frames is used to predict the location and search space of it in the intermediate frames.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

Fast edge map extraction from MPEG compressed video data for video parsing

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 710-721页

作者： Song, Byung Cheol Ra, Jong Beom Korea Advanced Inst of Science and Technology Taejon Korea Republic of

For the last few years, shot boundary detection has been recognized as an important research issue on video retrieval. Also as a preliminary step for the task, it is essential to extract salient features from videos. Recently, it has become common to perform the two tasks in compressed domain to alleviate their computational costs. In this paper, we propose novel shot boundary detection technique, which uses two feature images, or DC and edge images, extracted directly from MPEG compressed video. While a DC image can be easily obtained, edge image extraction usually requires considerable computational burden. For fast edge image extraction, we suggest to utilize only a few AC coefficients of each DCT block in motion compensated P-frames and B-frames as well as I-frames. this drastically reduces the computational burden compared to edge extraction in the spatial domain. In order to further reduce the computational burden, another edge image extraction technique is also suggested on the basis of AC prediction using DC images. By using the edge energy diagram obtained from edge images and histograms from DC images, shot boundaries such as abrupt transitions, fades, and dissolves are detected automatically. Simulation results show that the proposed techniques are fast and effective.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Processing of partial video data for detection of wipes

引用

Proceedings of SPIE - the International Society for Optical Engineering 1999年 3656卷 280-289页

作者： Kim, Hyeokman Park, Sung-Jun Lee, Jinho Kim, Woonkyung M. Song, S.Moon-Ho Korea Telecom Seoul Korea Republic of

With the currently existing shot change detection algorithms, abrupt changes are detected fairly well. It is thus more challenging to detect gradual changes including fades, dissolves, and wipes as these are often missed or falsely detected. In this paper, we focus on the detection of wipes. the proposed algorithm begins by processing the visual rhythm, a portion of the DC image sequence. It is a single image, a sub-sampled version of a full video in which the sampling is performed in a pre-determined and in a systematic fashion. the visual rhythm contains distinctive patterns or visual features for many different types of video effects. the different video effects manifest themselves differently on the visual rhythm. In particular, wipes appear as curves that run from the top to the bottom of the visual rhythm. thus, using the visual rhythm, it becomes possible to automatically detect wipes simply by determining various lines and curves on the visual rhythm.

关键词： video signal processing

来源：评论

学校读者我要写书评

暂无评论

image content retrieval from image databases using feature integration by Choquet integral

Image content retrieval from image databases using feature i...

引用

7th storage and retrieval for image and video databases conference

作者： Popescu, M Gader, P Univ Missouri Med Informat Grp Columbia MO 65201 USA

ISBN: (纸本)0819431273

A novel similarity measure based on the Choquet integral was introduced for retrieving images from a image database that "mostly" fit the query image. We showed that in certain conditions the measure is a norm, a fact that can be used to reduce the searching time using the triangle inequality. To test the new measure, a content based image retrieval system was built. the system was benchmarked against the visual retrieval cartridge, Virage, built into Oracle 8 database system. the results suggested that the new measure is useful for image retrieval.

关键词： image database Choquet integral similarity measure OWA Oracle 8

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：