检索结果-内蒙古大学图书馆

storage and retrieval for Media databases 2002

作者： Naphade, Milind R. Lin, Ching-Yung Smith, John R. Tseng, Belle Basu, Sankar Pervasive Media Management Group IBM T J Watson Research Center Hawthorne NY 10532 United States

Model-based approach to video retrieval requires ground-truth data for training the models. This leads to the development of video annotation tools that allow users to annotate each shot in the video sequence as well as to identify and label scenes, events, and objects by applying the labels at the shot-level. The annotation tool considered here also allows the user to associate the object-labels with an individual region in a key-frame image. However, the abundance of video data and diversity of labels make annotation a difficult and overly expensive task. To combat this problem, we formulate the task of annotation in the framework of supervised training with partially labeled data by viewing it as an exercise in active learning. In this scenario, one first trains a classifier with a small set of labeled data, and subsequently updates the classifier by selecting the most informative, or most uncertain subset of the available data-set. Consequently, propagation of labels to yet unlabeled data is automatically achieved as well. The purpose of this paper is primarily twofold. The first is to describe a video annotation tool that has been developed for the purpose of annotating generic video sequences in the context of a recent video-TREC benchmarking exercise. The tool is semi-automatic in that it automatically propagates labels to "similar" shots, which requires the user to confirm or reject the propagated labels. The second purpose is to show how active learning strategy can be potentially implemented in this context to further improve the performance of the annotation tool. While many versions of active learning could be thought of, we specifically report results on experiments with support vector machine classifiers with polynomial kernels.

关键词： Database systems

来源：评论

学校读者我要写书评

暂无评论

Summarizing motion contents of the video clip using Moving Edge Overlaid Frame (MEOF)

Summarizing motion contents of the video clip using Moving E...

引用

storage and retrieval for Media databases 2002

作者： Yu, Tian-Li Zhang, Yu-Jin Department of Electronic Engineering Tsinghua University Beijing 100084 China

How to quickly and effectively exchange video information with the user is a major task for video searching engine's user interface. In this paper, we proposed to use Moving Edge Overlaid Frame (MEOF) image to summarize both the local object motion and global camera motion information of the video clip into a single image. MEOF will supplement the motion information that is generally dropped by the key frame representation, and it will enable faster perception for the user than viewing the actual video. The key technology of our MEOF generating algorithm involves the global motion estimation (GME). In order to extract the precise global motion model from general video, our GME module takes two stages, the match based initial GME and the gradient based GME refinement. The GME module also maintains a sprite image that will be aligned with the new input frame in the background after the global motion compensation transform. The difference between the aligned sprite and the new frame will be used to extract the masks that will help to pick out the moving objects' edges. The sprite is updated with each input frame and the moving edges are extracted at a constant interval. After all the frames are processed, the extracted moving edges are overlaid to the sprite according to there global motion displacement with the sprite and the temporal distance with the last frame, thus create our MEOF image. Experiments show that the MEOF representation of the video clip helps the user acquire the motion knowledge much faster and also be compact enough to serve the needs of online applications.

关键词： video signal processing

来源：评论

学校读者我要写书评

暂无评论

Comparison of sequence matching techniques for video copy detection

Comparison of sequence matching techniques for video copy de...

引用

storage and retrieval for Media databases 2002

作者： Hampapur, Arun Hyun, Ki-Ho Bolle, Ruud IBM T.J Watson Research Center 30 Saw Mill River Road Hawthorne NY 10532 United States

video copy detection is a complementary approach to watermarking. As opposed to watermarking, which relies on inserting a distinct pattern into the video stream, video copy detection techniques match content-based signatures to detect copies of video. Existing typical content-based copy detection schemes have relied on image matching. This paper proposes two new sequence-matching techniques for copy detection and compares the performance with one of the existing techniques. Motion, intensity and color-based signatures are compared in the context of copy detection. Results are reported on detecting copies of movie clips.

关键词： Watermarking

来源：评论

学校读者我要写书评

暂无评论

Automated video summarization using speech transcripts

Automated video summarization using speech transcripts

引用

storage and retrieval for Media databases 2002

作者： Taskiran, Cuneyt M. Amir, Arnon Ponceleon, Dulce Delp, Edward J. Video and Image Processing Lab. Sch. of Elec. and Comp. Engineering Purdue University West Lafayette IN 47907-1285 United States

Compact representations of video data can enable efficient video browsing. Such representations provide the user with information about the content of the particular sequence being examined while preserving the essential message. We propose a method to automatically generate video summaries for long videos. Our video summarization approach involves mainly two tasks: first, segmenting the video into small, coherent segments and second, ranking the resulting segments. Our proposed algorithm scores segments based on word frequency analysis of speech transcripts. Then a summary is generated by selecting the segments with the highest score to duration ratios and these are concatenating them. We have designed and performed a user study to evaluate the quality of summaries generated. Comparisons are made using our proposed algorithm and a random segment selection scheme based on statistical analysis of the user study results. Finally we discuss various issues that arise in summary evaluation with user studies.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

Extracting movie scenes based on multimodal information

Extracting movie scenes based on multimodal information

引用

storage and retrieval for Media databases 2002

作者： Li, Ying Kuo, C.-C. Jay Integrated Media Systems Center Dept. of Elec. Engineering-Systems University of Southern California Los Angeles CA 90089-2564 United States

This research addresses the problem of automatically extracting semantic video scenes from daily movies using multimodal information. A 3-stage scene detection scheme is proposed. In the first stage, we use pure visual information to extract a coarse-level scene structure based on generated shot sinks. In the second stage, the audio cue is integrated to further refine scene detection results by considering various kinds of audio scenarios. Finally, in the third stage, we allow users to directly interact with the system so as to fine-tune the detection results to their own satisfaction. The generated scene structure can provide a compact yet meaningful abstraction of the video data, which will apparently facilitate the content access. Preliminary experiments on integrating multiple media cues for movie scene extraction have yielded encouraging results.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

Face recognition using a similarity-based measure in image database for crime investigation

Face recognition using a similarity-based measure in image d...

引用

IEEE Region 8 International Symposium on video/image Processing and Multimedia Communications (EURASIP)

作者： V. Srisarkun J. Cooper Faculty of Informatics University of Wollongong NSW Australia

A method to handle searching a face image database (FID) is proposed to support police officers when searching criminal records from a central registration database system (CRDS). The proposed method assumes that each FID consists of a fixable object and object correlation. The proposed method employs a database search, so that all images with a similarity-based measure are retrieved. Consequently, the proposed method is much faster than sequential searching, especially when an additional set of attributes, like scar, is defined. Moreover it requires less storage space.

关键词： Face recognition image databases Pixel image retrieval image storage Information retrieval Eyebrows Size measurement Ear Informatics

来源：评论

学校读者我要写书评

暂无评论

Mosaic ultrasound medical image compression using TTA10 algorithm

Mosaic ultrasound medical image compression using TTA10 algo...

引用

International Conference on Digital Signal Processing (DSP)

作者： D. Tsishkou E. Bovbel Department of Radiophysics and Electronics Belarusian State University Minsk Belarus

This paper describes an ultrasound image compression algorithm, related to mosaic image compression, where each compressed object is represented as a set of indexes to the database of mosaic elements. The proposed approach is considered as an alternative method for ultrasound biomedical image compression. A memory efficient implementation based upon the tree tessellation algorithm v.10 (TTA10) indexing/retrieval solution for managing mosaic elements. The principal advantage of the current approach is that for the very specific kind of ultrasound images (cardiology), it classifies a particular type of image (compression stage) first, and secondly uses a specially constructed schema to decompose an object into parts. Each part of the image is identified as the most similar mosaic image in the database using TTA10, and finally only the index is stored in compressed structure of an image. Since the size of ultrasound image storage is similar to the video one, while the specific biomedical content is highly correlated, the usage of mosaic image compression is an effective solution for storage.

关键词： Ultrasonic imaging Biomedical imaging image coding image storage Indexes image databases Memory management Indexing Cardiology video compression

来源：评论

学校读者我要写书评

暂无评论

Constructing colour image databases using BTC for efficient storage and effective retrieval

Constructing colour image databases using BTC for efficient ...

引用

International Symposium on Intelligent Multimedia, video and Speech Processing

作者： Qiu, G Univ Nottingham Sch Comp Sci Nottingham NG8 1BB England

ISBN: (纸本)9628576623

Ire introduce a simple image coding method, the block truncation coding (BTC) technique, as a novel approach to the construction of colour image databases. It is shown that BTC cars riot only be used to compress the images thus achieving storage efficiency, the BTC codes cart also be used directly, to construct image features for effective image retrieval. From the BTC code we have developed an image feature termed the BTC colour co-occurrence matrix (BCCM) as an effective measure of image contents. Experimental results are presented to show that BCCM is comparable to state of the art techniques, such as color correlogram, in image retrieval.

关键词： Color

来源：评论

学校读者我要写书评

暂无评论

Constructing colour image databases using BTC for efficient storage and effective retrieval

Constructing colour image databases using BTC for efficient ...

引用

International Symposium on Intelligent Multimedia, video and Speech Processing

作者： G. Qiu School of Computer Science University of Nottingham Nottingham UK

ISBN: (纸本)9628576623

We introduce a simple image coding method, the block truncation coding (BTC) technique, as a novel approach to the construction of colour image databases. It is shown that BTC can not only be used to compress images, thus achieving storage efficiency, but the BTC codes can also be used directly to construct image features for effective image retrieval. From the BTC code we have developed an image feature termed the BTC colour co-occurrence matrix (BCCM) as an effective measure of image contents. Experimental results are presented to show that BCCM is comparable to state of the art techniques, such as color correlogram, in image retrieval.

关键词： image storage image databases Information retrieval image retrieval image coding Color image processing Spatial databases Visual databases Indexing

来源：评论

学校读者我要写书评

暂无评论

引用

JOURNAL OF VISUAL COMMUNICATION AND image REPRESENTATION 2001年第2期12卷 107-122页

作者： Chan, YK Chang, CC Natl Chung Cheng Univ Dept Comp Sci & Informat Engn Chiayi 621 Taiwan

A nine-direction lower-triangular (9DLT) matrix describes the relative spatial relationships among the objects in a symbolic image. In this paper, the 9DLT matrix will be transformed into a linear string, called 9DLT string. Based on the 9DLT string, two metrics of similarity in image matching measures, simpler but more precise, are provided to solve the subimage and similar image retrieval problems. Moreover, a common component binary tree (CCBT) structure will be refined to save a set of 9DLT strings. The revised CCBT structure not only eliminates the redundant information among those 9DLT strings, but also diminishes the processing time for determining the image matching distances between query frames and video frames. Experiments indicate that the storage space and the processing time are greatly reduced through the revised CCBT structure. A fast dynamic programming approach is also proposed to handle the problem of sequence matching between a query frame sequence and a video frame sequence, a zool Academic Press.

关键词： symbolic image spatial similar image retrieval spatial similar video retrieval 2D C-trees 9DLT matrix CCBT structure

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：