The realization of image/video database requires a specific design for both database structures and mass storage management. This issue has addressed the project of the digital image/video database system that has bee...
详细信息
ISBN:
(纸本)0819411418
The realization of image/video database requires a specific design for both database structures and mass storage management. This issue has addressed the project of the digital image/video database system that has been designed at IBM SEMEA Scientific & Technical Solution Center. Proper database structures have been defined to catalog image/video coding technique with the related parameters, and the description of image/video contents. User workstations and servers are distributed along a local area network. image/video files are not managed directly by the DBMS server. Because of their wide size, they are stored outside the database on network devices. The database contains the pointers to the image/video files and the description of the storage devices. The system can use different kinds of storage media, organized in a hierarchical structure. Three levels of functions are available to manage the storage resources. The functions of the lower level provide media management. They allow it to catalog devices and to modify device status and device network location. The medium level manages image/video files on a physical basis. It manages file migration between high capacity media and low access time media. The functions of the upper level work on image/video file on a logical basis, as they archive, move and copy image/video data selected by user defined queries. These functions are used to support the implementation of a storage management strategy. The database information about characteristics of both storage devices and coding techniques are used by the third level functions to fit delivery/visualization requirements and to reduce archiving costs.
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose ...
详细信息
ISBN:
(纸本)0819424331
We study the problem of retrieving images using a small template. The goal is to allow a user to search for images containing a pattern similar to the template, adding to the capability of a search engine. We propose to employ a segmentation-based approach. As a specific example, we introduce a quadtree segmentation technique for textured images and a distance measure, Sum of Minimum Distance, suitable for template-based imageretrieval applications.
In this paper we show how agent-like processes may be used to support content based retrieval and navigation in MAVIS, a Multimedia architecture for video, image and Sound. The processes provide automatic classificati...
详细信息
ISBN:
(纸本)0819427527
In this paper we show how agent-like processes may be used to support content based retrieval and navigation in MAVIS, a Multimedia architecture for video, image and Sound. The processes provide automatic classification of feature vectors from users' selections when a multimedia thesaurus(MMT) is present and feature vector clustering in the absence of an MMT.
This paper describes the development of a prototype of a video database system, called VLdIO, that takes account of the importance of different perspectives in videoretrieval. Text-based hierarchical structures are u...
详细信息
ISBN:
(纸本)0819424331
This paper describes the development of a prototype of a video database system, called VLdIO, that takes account of the importance of different perspectives in videoretrieval. Text-based hierarchical structures are used for representing the contents of a video. The structures are used for supporting the required functionalities in organizing personalized video materials. In addition to support for indexing original video materials, the system also supports tools for re-indexing and maintaining the results of videoretrieval. In other words it tries to fulfill the requirement of personalized video information management. The paper defines the requirement, outlines the key considerations in providing such support and describes the implemented system.
The scale-invariant feature transform (SIFT) feature plays a very important role in multimedia content analysis, such as near-duplicate image and videoretrieval. However, the storage and query costs of SIFT become un...
详细信息
The scale-invariant feature transform (SIFT) feature plays a very important role in multimedia content analysis, such as near-duplicate image and videoretrieval. However, the storage and query costs of SIFT become unbearable for large-scale databases. In this paper, SIFT features are robustly encoded with temporal information by tracking the SIFT to generate temporal-concentration SIFT (TCSIFT), which highly compresses the quantity of local features to reduce visual redundancy, and keeps the advantages of SIFT as much as possible at the same time. On the basis of TCSIFT, a novel framework for large-scale video copy retrieval is proposed in which the processes of retrieval and validation are implemented at the feature and frame level. Experimental results for two different datasets, i.e., CC_WEB_video and TRECVID, demonstrate that our method can yield comparable accuracy, compact storage size, and more efficient execution time, as well as adapt to various video transformations. (C) 2015 Elsevier B.V. All rights reserved.
In this paper we describe a novel interactive image viewer incorporating a range of image processing techniques that allows inexperienced users to quickly and easily delineate objects or shapes from a wide range of re...
详细信息
ISBN:
(纸本)0819427527
In this paper we describe a novel interactive image viewer incorporating a range of image processing techniques that allows inexperienced users to quickly and easily delineate objects or shapes from a wide range of real world images. The viewer is specifically designed to be easily extensible, and this extensibility is demonstrated with the implementation of an iterative user guided segmentation tool. Using this tool objects can be efficiently extracted from images and used as the basis for navigation and retrieval within MAVIS, the Multimedia Architecture for video, image, and Sound.
Autosophy, an emerging new science, explains `self-assembling structures,' such as crystals or living trees, in mathematical terms. This research provides a new mathematical theory of `learning' and a new `inf...
详细信息
ISBN:
(纸本)0819411418
Autosophy, an emerging new science, explains `self-assembling structures,' such as crystals or living trees, in mathematical terms. This research provides a new mathematical theory of `learning' and a new `information theory' which permits the growing of self-assembling data network in a computer memory similar to the growing of `data crystals' or `data trees' without data processing or programming. Autosophy databases are educated very much like a human child to organize their own internal data storage. Input patterns, such as written questions or images, are converted to points in a mathematical omni dimensional hyperspace. The input patterns are then associated with output patterns, such as written answers or images. Omni dimensional information storage will result in enormous data compression because each pattern fragment is only stored once. Pattern recognition in the text or image files is greatly simplified by the peculiar omni dimensional storage method. videodatabases will absorb input images from a TV camera and associate them with textual information. The `black box' operations are totally self-aligning where the input data will determine their own hyperspace storage locations. Self-aligning autosophy databases may lead to a new generation of brain-like devices.
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (VORTEX) to enable...
详细信息
ISBN:
(纸本)0819431273
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (VORTEX) to enable operation directly on compressed video data. The structure of the video compression standards is exploited in order to avoid the costly decompression operation. This is achieved by utilizing motion compensation-a critical prediction filter embedded in video compression standards-to estimate and interpolate the desired method for template matching. Occlusion analysis, filtering and motion analysis are used to implement fast tracking of objects of interest on the compressed video data. Being presented with a query in the form of template images of objects, the system operates on the compressed video in order to find the images or video sequences where those objects are present and their positions in the image. This enables the retrieval and display of the query-relevant sequences.
video parsing is an important step in content-based indexing techniques where the input video is decomposed into segments with uniform content. In video parsing detection of scene changes is one of the approaches wide...
详细信息
video parsing is an important step in content-based indexing techniques where the input video is decomposed into segments with uniform content. In video parsing detection of scene changes is one of the approaches widely used for extracting key frames from the video sequence. In this paper, an algorithm based on motion vectors is proposed to detect sudden scene changes and gradual scene changes (camera movements such as panning, tilting and zooming). Unlike some of the existing schemes, the proposed scheme is capable of detecting both sudden and gradual changes in uncompressed as well as compressed domain video. It is shown that the resultant motion vector can be used to identify and classify gradual changes due to camera movements. Results show that algorithm performed as well as the histogram-based schemes with uncompressed video. The performance of the algorithm was also investigated with H.263 compressed video. The detection and classification of both sudden and gradual scene changes was successfully demonstrated.
The technique of symbolic projection has been widely studied in the area of image database systems as a first step towards content-based indexing and retrieval of images. In this paper we have extended the idea of sym...
详细信息
ISBN:
(纸本)0819424331
The technique of symbolic projection has been widely studied in the area of image database systems as a first step towards content-based indexing and retrieval of images. In this paper we have extended the idea of symbolic projections to video and audio data as well as to multimedia documents containing combinations of these data types. Formal definitions of symbolic video sequence, symbolic audio sequence and symbolic multimedia documents are given as are definitions of their symbolic projections. An indexing methodology based on these symbolic projections is presented. Operators which allow multimedia documents to be constructed from the basic multimedia data types are also presented. The main contribution of this paper is to provide a basis for the development of content-based retrieval of multimedia documents via extended symbolic projections.
暂无评论