In this paper we propose a novel system of semantic feature extraction and retrieval for interior design and decoration application. The system, v2ID (virtual Interior Design), uses colored texture and spatial edge la...
详细信息
In this paper we propose a novel system of semantic feature extraction and retrieval for interior design and decoration application. The system, v2ID (virtual Interior Design), uses colored texture and spatial edge layout to obtain simple information about global room environment. We address the domain specific segmentation problem in our application and present techniques for obtaining semantic features from a room environment. We also discuss heuristics for making use of these features (color, texture, edge layout and shape) to retrieve objects from an existing database. The final resynthesized room environment with original scene and objects from database is created for the purpose of animation and virtual walk-through.
The scale-invariant feature transform (SIFT) feature plays a very important role in multimedia content analysis, such as near-duplicate image and videoretrieval. However, the storage and query costs of SIFT become un...
详细信息
The scale-invariant feature transform (SIFT) feature plays a very important role in multimedia content analysis, such as near-duplicate image and videoretrieval. However, the storage and query costs of SIFT become unbearable for large-scale databases. In this paper, SIFT features are robustly encoded with temporal information by tracking the SIFT to generate temporal-concentration SIFT (TCSIFT), which highly compresses the quantity of local features to reduce visual redundancy, and keeps the advantages of SIFT as much as possible at the same time. On the basis of TCSIFT, a novel framework for large-scale video copy retrieval is proposed in which the processes of retrieval and validation are implemented at the feature and frame level. Experimental results for two different datasets, i.e., CC_WEB_video and TRECvID, demonstrate that our method can yield comparable accuracy, compact storage size, and more efficient execution time, as well as adapt to various video transformations. (C) 2015 Elsevier B.v. All rights reserved.
The color hologram of an image has been widely used as a feature descriptor for the image in content-based retrieval applications. In this paper, some results from our investigation efforts into to usage are reported....
详细信息
The color hologram of an image has been widely used as a feature descriptor for the image in content-based retrieval applications. In this paper, some results from our investigation efforts into to usage are reported. We outline three typical color space quantization schemes used in our experiments and introduce the soft-decision histogramming method to eliminate the discontinuity problem in traditional color histogram population process. Then, to improve the effectiveness of color histogram based retrieval algorithms, several similarity metrics are proposed for comparing color histograms, including three special forms of the Kantorovich metric.
Organizing video shots into hierarchy structures is very important for efficient browsing and retrieval on large videodatabases, and many shot organizing methods have been proposed. Most algorithms are based on autom...
详细信息
Organizing video shots into hierarchy structures is very important for efficient browsing and retrieval on large videodatabases, and many shot organizing methods have been proposed. Most algorithms are based on automatic clustering schemes, which usually fail to give satisfactory results in real applications. In this paper, we proposed a preprocessing technology for interactive shot organizing - similarity sequence. It differs from traditional shot organizing methods in that it does not classify shots, instead it only reorders the shot sequence so that similar shots appear near each other, thus provides an effective interactive shot organizing interface and leaves the classification work to the user. A measure called similarity length was introduced to evaluate the similarity between adjacent shots in shot sequence, and an improved genetic algorithm was developed to calculate the similarity sequence. Basic thoughts and implementation details are provided, also with experiment results on real videos and analysis.
This paper describes the videoSTAR experimental database system that is being designed to support video applications in sharing and reusing video data and meta-data. videoSTAR provides four different repositories: for...
详细信息
ISBN:
(纸本)081941767X
This paper describes the videoSTAR experimental database system that is being designed to support video applications in sharing and reusing video data and meta-data. videoSTAR provides four different repositories: for media files, virtual documents, video structures, and video annotations/user indexes. It also provides a generic video data model relating data in the different repositories to each other, and it offers a powerful application interface. videoSTAR concepts have been evaluated by developing a number of experimental video tools, such as a video player, a video annotator, a video authoring tool, a video structure and contents browser, and a video query tool.
In general, video shots need to be clustered to form more semantically significant units, such as scenes, sequences, programs, etc. This is the so-called story-based video structuring. Automatic video structuring is o...
详细信息
ISBN:
(纸本)0819424331
In general, video shots need to be clustered to form more semantically significant units, such as scenes, sequences, programs, etc. This is the so-called story-based video structuring. Automatic video structuring is of great importance for video browsing and retrieval. The shots or scenes are usually described by one or several representative frames, called key frames. viewed from a higher level, key frames of some shots might be redundant in terms of semantics. In this paper, we propose automatic solutions to the problems of key frame computing and key frame pruning. We develop an original image similarity criterion, which considers both spatial layout and detail content in an image. Coefficients of wavelet decomposition are used to derive parameter vectors accounting for the above two aspects. The parameters exhibit (quasi-) invariant properties. The novel ''Seek and Spread (SS)'' strategy used in key frame computing allows us to obtain a Targe representative range for the key frames. Inter-shot redundancy of the key frames is suppressed using the same image similarity measure. Experimental results demonstrate the effectiveness and efficiency of our techniques.
In this paper, we describe the SS+-tree, a tree structure for supporting similarity searches in a high-dimensional Euclidean space. Compared to the SS-tree, the tree uses a tighter bounding sphere for each node which ...
详细信息
ISBN:
(纸本)0819424331
In this paper, we describe the SS+-tree, a tree structure for supporting similarity searches in a high-dimensional Euclidean space. Compared to the SS-tree, the tree uses a tighter bounding sphere for each node which is an approximation to the smallest enclosing sphere and it also makes a better use of the clustering property of the available data by using a variant of the k-means clustering algorithm as the split heuristic for its nodes. A local reorganization rule is also introduced during the tree building to reduce the overlapping between the nodes' bounding spheres.
We present a fast algorithm for computing the singular value decomposition (SvD) of a matrix consisting of the frames from a video sequence. The computational efficiency of this algorithm derives from the observation ...
详细信息
ISBN:
(纸本)0819431273
We present a fast algorithm for computing the singular value decomposition (SvD) of a matrix consisting of the frames from a video sequence. The computational efficiency of this algorithm derives from the observation that portions of a video sequence will consist of sets of correlated frames. We then show that the information obtained from the SvD can be used to analyze video sequences to obtain information such as scene breaks, scene query, reduced-order shot representation and key frame determination. We illustrate this approach on several video sequences.
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when deal...
详细信息
In this paper we introduce our approach to multimedia database interfaces. Although we deal mainly with imagedatabases, most of the ideas we present can be generalized to other types of data. We argue that, when dealing with complex data, such as images, the problem of access must be redefined along different lines than text databases. In multimedia databases, the semantics of the data is imprecise, and depends in part on user's interpretation. This observation made us consider the development of interfaces in which the user explores the database rather than querying it. In this paper we give a brief justification of our position and present the exploratory interface that we have developed for our image database El nino.
Histogram comparison is a popular technique for image indexing. Given a query image, histogram-based techniques can retrieve similar images from a database, which were acquired under similar illumination levels. Howev...
详细信息
ISBN:
(纸本)0819424331
Histogram comparison is a popular technique for image indexing. Given a query image, histogram-based techniques can retrieve similar images from a database, which were acquired under similar illumination levels. However, these techniques fail when images are acquired under different illumination conditions. In this paper, we propose two novel histogram-based techniques which are robust to the changes in illumination. First, we propose to employ moments of the image histogram which are invariant to translation and scaling of image gray levels. Secondly, we propose to compare the parameters of histograms of the wavelet subbands for indexing. These parameters are modified appropriately to counter the effect of changes in illumination. The proposed techniques are computationally inexpensive and can be easily integrated within a wavelet-based image coder.
暂无评论