A new system, the so-called MUVIS, is introduced for content-based indexing and retrieval for image database management systems. In addition to traditional indexing by key words, MUVIS allows indexing of objects and i...
详细信息
A new system, the so-called MUVIS, is introduced for content-based indexing and retrieval for image database management systems. In addition to traditional indexing by key words, MUVIS allows indexing of objects and images based on color, texture, shape and objects layout inside them. Due to the use of large vector features, the pyramid trees are employed to create the index structure.
Motion and structure analysis in video sequences can lead to efficient descriptions of objects and their motions. Interesting events in videos can be detected using such an analysis--for instance independent object mo...
详细信息
ISBN:
(纸本)0819414808
Motion and structure analysis in video sequences can lead to efficient descriptions of objects and their motions. Interesting events in videos can be detected using such an analysis--for instance independent object motion when the camera itself is moving, figure-ground segregation based on the saliency of a structure compared to its surroundings. In this paper we present a method for 3D motion and structure analysis that uses a planar surface in the environment as a reference coordinate system to describe a video sequence. The motion in the video sequence is described as the motion of the reference plane, and the parallax motion of all the non-planar components of the scene. It is shown how this method simplifies the otherwise hard general 3D motion analysis problem. In addition, a natural coordinate system in the environment is used to describe the scene which can simplify motion based segmentation. This work is a part of an ongoing effort in our group towards video annotation and analysis for indexing and retrieval. Results from a demonstration system being developed are presented.
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain ...
详细信息
ISBN:
(纸本)0819424331
videodatabases can be searched for visual content by searching over automatically extracted key frames rather than the complete video sequence. Many video materials used in the humanities and social sciences contain a preponderance of shots of people. In this paper, we describe our work in semantic imageretrieval of person-rich scenes (key frames) for videodatabases and libraries. We use an approach called retrieval through segmentation. A key-frame image is first segmented into human subjects and background. We developed a specialized segmentation technique that utilizes both human flesh-tone detection and contour analysis. Experimental results show that this technique can effectively segment images in a low time complexity. Once the image has been segmented, we can then extract features or pose queries about both the people and the background. We propose a retrieval framework that is based on the segmentation results and the extracted features of people and background.
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to...
详细信息
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to compute the object clusters and provide useful information for overlapped clusters. The automatic image segmentation and categorisation is achieved. To obtain the context for imageretrieval, the subjective context and the objective context are modelled by means of the fuzzy sets theory. The system is able to trace the users' interactions during retrieval. The refinements of the retrieval results can be made while the users are submitting the queries telling the specific requirements.
We present three strategies for placement of video data on parallel disk arrays. Using a low- level disk model and video data from a scalable subband coding technique, we derive constraints with which to compare the t...
详细信息
ISBN:
(纸本)0819414808
We present three strategies for placement of video data on parallel disk arrays. Using a low- level disk model and video data from a scalable subband coding technique, we derive constraints with which to compare the three strategies. One strategy, constant frame grouping, is shown to be superior. Two methods for interleaving multiple videos under the constant frame grouping strategy are presented: nonperiodic and periodic. Periodic interleaving is shown to have the advantages of a lower access time and limited scan and pause functions. The constant frame grouping strategy is tested on an actual array of 8 disks and shown to have performance that is close to the theoretical prediction. The scalable nature of the compressed data is used to relieve the disk system overload for an overly high request rate.
Indexing of scientific imagedatabases is a difficult task, due to their extraordinary sizes and to the complex nature of the visual information contained in them. This data volume and complexity require an automatic ...
详细信息
ISBN:
(纸本)0819414808
Indexing of scientific imagedatabases is a difficult task, due to their extraordinary sizes and to the complex nature of the visual information contained in them. This data volume and complexity require an automatic indexing scheme that will categorize this visual content; without it, the data will be essentially useless to scientists and medical doctors. A method for automatic indexing of scientific imagedatabases is presented which involves a wavelet package decomposition of images in the frequency domain, resulting in a quad-tree of subbands. These subbands are regarded as realizations of random fields, and statistical measures are computed on them. One class of newly derived measures determines whether the subbands contain any significant organization of pixels beyond what chance would imply. If this is found to be true for a subband, its node is retained on an index tree, and other identifying measurements may be added. The structure of the resulting pruned subband tree constitutes the first level of index; the node statistics form a second indexing level. Results of a pilot study are reported; they suggest that further investigation of this approach is warranted.
The design of an electronic archive of digitized images of thousands of xrays collected as part of nationwide health surveys has raised several issues related to user interface design, image presentation and image com...
详细信息
ISBN:
(纸本)0819414808
The design of an electronic archive of digitized images of thousands of xrays collected as part of nationwide health surveys has raised several issues related to user interface design, image presentation and image compression. The project involves developing an image archive implemented with an optical disk jukebox, and user workstations that allow Internet access to the images. This paper describes: the physical layout design of the workstation screens; desirable image processing functions contributing to better viewing and minimizing artifacts; interface design factors contributing to ease-of-use and speed of task completion; and work toward the selection of a suitable image compression technique.
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The Virage video Engine (VVE) with th...
详细信息
ISBN:
(纸本)0819424331
The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The Virage video Engine (VVE) with the default set of primitives provide the necessary frame work and basic tools for video content based retrieval. The video engine is a flexible platform independent architecture which provides support for processing multiple synchronized data streams like image sequences, audio and closed captions. The architecture allows for multi-modal indexing and retrieval of video through the use of media specific primitives. This paper presents the use of the VVE framework for content based videoretrieval.
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database...
详细信息
Content based retrieval on large multimedia database attracts the interests of many researchers, but the database architecture needed for content based retrieval is still an open problem. Traditional relation database system does not support the high-dimension feature form content description and indexing, thus is limited in its content based retrieval function. Some systems do support high-dimension feature form content description and indexing, but lacks descriptions and query expressions on media object content and relations. In this paper, we present our study results on query mechanism and proposed CbExpr - a powerful flexible query expression mechanism on media object. Based on CbExpr we proposed GMA (general mediabase architecture) - a general architecture for management and content based retrieval on large media databases, and videoBase - a content based videoretrieval system is present as example of GMA. Basic thoughts, considerations, and definitions are presented in the paper, also with some implementation details.
暂无评论