Enabling the efficient storage, access and retrieval of large volumes of multi-dimensional data is one of the important emerging problems in databases. We present a framework for adaptively storing, accessing, and ret...
详细信息
Enabling the efficient storage, access and retrieval of large volumes of multi-dimensional data is one of the important emerging problems in databases. We present a framework for adaptively storing, accessing, and retrieving large images. The framework uses a space and frequency graph to generate and select image view elements for storing in the database. By adapting to user access patterns, the system selects and stores those view elements that yield the lowest average cost for accessing the multi-resolution sub-region image views. The system uses a second adaptation strategy to divide computation between server and client in progressive retrieval of image views using view elements. We show that the system speeds-up retrieval for access and retrieval modes such as drill-down browsing and remote zooming and panning and minimizes the amount of data transfer over the network.
With the advancement of multimedia technology and the internet, numerous applications have arisen which require the storage and retrieval of large image and videodatabases. A novel method (Eigenwavelet) was developed...
详细信息
ISBN:
(纸本)0819432997
With the advancement of multimedia technology and the internet, numerous applications have arisen which require the storage and retrieval of large image and videodatabases. A novel method (Eigenwavelet) was developed to retrieve images from a large heterogeneous image database upon a user-specified query. The queries are in the form of an image(s) that the user seeks to find similar matches to in the database. Using the queries, an efficient algorithm was developed which decomposed each image in the database using wavelet packet analysis. Along each node of the packet tree, Principal Component Analysis (PCA) was applied to the: database images after wavelet packet decomposition, and a set of eigenvectors were generated for each node of the packet tree. To search the image database, the query images are projected onto these eigenvectors (Eigenwavelet coefficients). A distance metric is computed between the projections of the queries and the projections of the images in the database onto the eigenwavelets. Those images with minimal distance (L1) are retrieved in response to a unique query set. Simulations with a heterogeneous image database suggest the Eigenwavelet method of imageretrieval is a robust and computationally tractable method of retrieving images with a probability of detection > .8.
A new system, the so-called MUVIS, is introduced for content-based indexing and retrieval for image database management systems. In addition to traditional indexing by key words, MUVIS allows indexing of objects and i...
详细信息
A new system, the so-called MUVIS, is introduced for content-based indexing and retrieval for image database management systems. In addition to traditional indexing by key words, MUVIS allows indexing of objects and images based on color, texture, shape and objects layout inside them. Due to the use of large vector features, the pyramid trees are employed to create the index structure.
This paper is concerned with estimating a probability density function of human skin color using a finite Gaussian mixture model whose parameters are estimated through the EM algorithm. Hawkins' statistical test o...
详细信息
This paper is concerned with estimating a probability density function of human skin color using a finite Gaussian mixture model whose parameters are estimated through the EM algorithm. Hawkins' statistical test on the normality and homoscedasticity (common covariance matrix) of the estimated Gaussian mixture models is performed and McLachlan's bootstrap method is used to test the number of components in a mixture. Experimental results show that the estimated Gaussian mixture model fits skin images from a large database. Applications of the estimated density function in image and videodatabases are presented.
The detection of shot boundaries in video sequences is an important task for generating indexed videodatabases. This paper provides a comprehensive quantitative comparison of the metrics that have been applied to sho...
详细信息
The detection of shot boundaries in video sequences is an important task for generating indexed videodatabases. This paper provides a comprehensive quantitative comparison of the metrics that have been applied to shot boundary detection. In addition, several standardized statistical tests that have not been applied to this problem, and three new metrics, are considered. A mathematical framework for quantitatively comparing metrics is supplied. Experimental results based on a video database containing 39,000 frames are included.
In this paper, we present an approach to clustering video sequences and images for efficient retrieval using relative entropy as our cost criterion. In addition, our experiments indicate that relative entropy is a goo...
详细信息
In this paper, we present an approach to clustering video sequences and images for efficient retrieval using relative entropy as our cost criterion. In addition, our experiments indicate that relative entropy is a good similarity measure for content-based retrieval. In our clustering work, we treat images and video as probability density functions over the extracted features. This leads us to formulate a general algorithm for clustering densities. In this context, it can be seen that an euclidean distance between features and the Kullback-Liebler (KL) divergence give equivalent clustering. In addition, the asymmetry of the KL divergence leads to another clustering. Our experiments indicate that this clustering is more robust to noise and distortions compared with the one resulting from euclidean norm.
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to...
详细信息
A prototype of the content-based imageretrieval system is implemented based on the algorithms introduced in this paper. The image contents at the high levels are extracted. The fuzzy C-means classifier is employed to compute the object clusters and provide useful information for overlapped clusters. The automatic image segmentation and categorisation is achieved. To obtain the context for imageretrieval, the subjective context and the objective context are modelled by means of the fuzzy sets theory. The system is able to trace the users' interactions during retrieval. The refinements of the retrieval results can be made while the users are submitting the queries telling the specific requirements.
A different approach to content-based retrieval and a novel framework for classification of visual information are proposed. The Visual Apprentice which is an implementation of the framework for still images and video...
详细信息
A different approach to content-based retrieval and a novel framework for classification of visual information are proposed. The Visual Apprentice which is an implementation of the framework for still images and video that uses a combination of lazy-learning, decision trees, and evolution programs for classification and grouping is introduced. Examples and results are given to demonstrate the applicability of the proposed approach to perform visual classification and detection.
The Web-based Medical Information retrieval System (WebMIRS) allows Internet access to databases containing 17,000 digitized x-ray spine images and associated text data from National Health and Nutrition Examinations ...
详细信息
The Web-based Medical Information retrieval System (WebMIRS) allows Internet access to databases containing 17,000 digitized x-ray spine images and associated text data from National Health and Nutrition Examinations Surveys (NHANES). WebMIRS allows SQL query of the text, and viewing of the returned text records and images using a standard browser. We are now working (1) to determine utility of data directly derived from the images in our databases and (2) to investigate the feasibility of computer-assisted or automated indexing of the images to support imageretrieval of images of interest to biomedical researchers in the field of osteoarthritis. To build an initial database based on image data, we are manually segmenting a subset of the vertebrae, using techniques from vertebral morphometry. From this, we will derive and add to the database vertebral features. This image-derived data will enhance the user's data access capability by enabling the creation of combined SQL/image-content queries.
With the abstraction of digital video as the corresponding binary video- a process which upon numerous subjective experimentation seems to preserve (most of the) intelligibility of video content- we can pursue a preci...
详细信息
With the abstraction of digital video as the corresponding binary video- a process which upon numerous subjective experimentation seems to preserve (most of the) intelligibility of video content- we can pursue a precise and analytic approach to (digital videostorage and retrieval) algorithm design that are based upon geometrical (morphological) intuition. The foremost and tangible general benefit of such abstraction, however, is the immediate reductions of both data and computational complexities involved in implementing various algorithms and databases. The general paradigm presented may be utilized to address all issues pertaining to video library construction including visualization, optimum feedback query generation, object recognition, e.t.c., but the primary focus of attention in this paper are the ones pertaining to detection of fast (including presence of flashlights) and gradual scene changes (such as dissolves, fades, and various special effects such as wipes). Upon simulation we observed that we can achieve performances comparable to those of others with drastic reductions in both storage and computational complexities. Furthermore, since the conversion from grayscale to binary videos can be performed directly (with minimal additional computation) in the compressed domain by thresholding on the DCT DC coefficients themselves (or by using the contour information attached to MPEG4 formats), the algorithms presented herein are ideally suited for performing fast (on-the-fly) determinations of scene change, object recognition and/or tracking, and other more intelligent tasks traditionally requiring heavy demand on computational and/or storage complexities. The fast determinations may then be used on their own merits or can be used in conjunction or complementation with other higher-layer information in the future.
暂无评论