this paper describes an API for image searching. the attempt was to isolate the functionality of the GUI from the functionality of the image search engine. the GUI would then make calls to the image search API and cou...
详细信息
this paper describes an API for image searching. the attempt was to isolate the functionality of the GUI from the functionality of the image search engine. the GUI would then make calls to the image search API and could be used with any image search engine implementing that API. Also, different methods of specifying the initial search image are discussed as well as different methods of displaying the results, including the use of 3D using VRML.
video parsing is an important step in content-based indexing techniques where the input video is decomposed into segments with uniform content. In video parsing detection of scene changes is one of the approaches wide...
详细信息
video parsing is an important step in content-based indexing techniques where the input video is decomposed into segments with uniform content. In video parsing detection of scene changes is one of the approaches widely used for extracting key frames from the video sequence. In this paper, an algorithm based on motion vectors is proposed to detect sudden scene changes and gradual scene changes (camera movements such as panning, tilting and zooming). Unlike some of the existing schemes, the proposed scheme is capable of detecting both sudden and gradual changes in uncompressed as well as compressed domain video. It is shown that the resultant motion vector can be used to identify and classify gradual changes due to camera movements. Results show that algorithm performed as well as the histogram-based schemes with uncompressed video. the performance of the algorithm was also investigated with H.263 compressed video. the detection and classification of both sudden and gradual scene changes was successfully demonstrated.
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database...
详细信息
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database system does not support the high-dimension feature form content description and indexing, thus is limited in its content based retrieval function. Some systems do support high-dimension feature form content description and indexing, but lacks descriptions and query expressions on media object content and relations. In this paper, we present our study results on query mechanism and proposed CbExpr - a powerful flexible query expression mechanism on media object. Based on CbExpr we proposed GMA (general mediabase architecture) - a general architecture for management and content based retrieval on large media databases, and videoBase - a content based videoretrieval system is present as example of GMA. Basic thoughts, considerations, and definitions are presented in the paper, also with some implementation details.
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly withimagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when deal...
详细信息
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly withimagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when dealing with complex data, such as images, the problem of access must be redefined along different lines than text databases. In multimedia databases, the semantics of the data is imprecise, and depends in part on user's interpretation. this observation made us consider the development of interfaces in which the user explores the database rather than querying it. In this paper we give a brief justification of our position and present the exploratory interface that we have developed for our image database El nino.
the amount of digitized video in video archives is becoming so huge that easier access and content browsing tools are desperately needed. Also, video is no longer one big piece of data but a collection of useful small...
详细信息
the amount of digitized video in video archives is becoming so huge that easier access and content browsing tools are desperately needed. Also, video is no longer one big piece of data but a collection of useful smaller building blocks that can be accessed and used independently from the original context of presentation. In this paper, we demonstrate a content model for audio-video sequences withthe purpose of enabling the automatic generation of video summaries. the model is based on descriptors which indicate various properties and relations of audio and video segments. In practice, these descriptors could either be generated automatically by analysis methods, or produced manually (or computer-assisted) by the content provider himself. We analyze the requirements and characteristics of the different data segments with respect to the problem of summarization, and we define our model as a set of constraints which allow to produce good quality summaries.
video Segmentation plays an integral role in many multimedia applications such as digital libraries, content management systems, and various other video browsing, indexing, and retrieval systems. Many algorithms for s...
详细信息
video Segmentation plays an integral role in many multimedia applications such as digital libraries, content management systems, and various other video browsing, indexing, and retrieval systems. Many algorithms for segmentation of video have appeared in the past few years. Most of these algorithms perform well on cuts, but yield poor performance on gradual transitions or special effect edits. A complete video segmentation system must achieve good performance on special effect edit detection also. In this paper, we discuss the performance of our videoTrails based algorithms with other existing special effect edit detection algorithms in literature. Results from experiments testing for the ability to detect edits from TV programs ranging from commercials to news magazine programs, and also diverse special effect edits introduced by us have been shown.
We have developed a TV-on-demand system that provides playback of a television program after a period ranging from a few seconds to one week after broadcast, and have conducted usage trials in cooperation with a telev...
详细信息
We have developed a TV-on-demand system that provides playback of a television program after a period ranging from a few seconds to one week after broadcast, and have conducted usage trials in cooperation with a television station in Nagano Prefecture of Japan. this system has been achieved through the development of various technologies such as automatic updating of stored television programs and contents retrieval by telop characters. Users in the trials can begin playback of a television program immediately after its broadcast has begun. the purpose of the trials was to evaluate the system's usability in applications such as contents retrieval, selective viewing of commercials, and customer service at the television station. this paper presents applied technologies and some experimental results and also addresses a new direction of information retrieval system based on the evaluation of the usage trials.
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consumi...
详细信息
In this paper we address the problem of choosing appropriate features to describe the content of still pictures or video sequences including audio. As the computational analysis of these features is often time-consuming it is useful to identify a minimal set allowing for an automatic classification of some class or genre. Further it can be shown that deleting the coherence of the features characterizing some class is not suitable to guarantee an optimal classification result. the central question of the paper is thus which features should be selected and how they should be weighted to optimize a classification problem.
Content Based imageretrieval has recently become one of the most active research areas, due to the massive increase in the amount and complexity of digitized data being stored, transmitted and accessed. We present he...
详细信息
Content Based imageretrieval has recently become one of the most active research areas, due to the massive increase in the amount and complexity of digitized data being stored, transmitted and accessed. We present here a prototype implementation of DRAWSEARCH, an imageretrieval by content system that uses color and shape (and texture in the near future) features to index and retrieve images. the system, currently being tested and improved, is designed to increase interactivity with users posing queries over the Internet and avails of a Java client for query by sketch. It also implements relevance feedback to allow users dynamically refine queries. Experiments show that the proposed approach can greatly reduce the user's effort to compose a query while capturing his/her information need with greater precision.
Queries referring to content embedded within images are an essential component of content-based search, browse, or summarize operations in imagedatabases. Localization of such queries under changes in appearance, occ...
详细信息
Queries referring to content embedded within images are an essential component of content-based search, browse, or summarize operations in imagedatabases. Localization of such queries under changes in appearance, occlusions and background clutter, is a difficult problem, for which current spatial access structures in databases are not suitable. In this paper we present a new method of indexing imagedatabases called location hashing that uses a special data structure called the location hash tree (LHT) for organizing feature information from images of a database. Location hashing is based on the principle of geometric hashing and determines simultaneously, the relevant images in the database and the regions within them that are most likely to contain a 2d pattern query without incurring detailed search of either. the location hash tree being a red-black tree, allows for efficient search for candidate locations using pose-invariant feature information derived from the query.
暂无评论