The virtual digital library, a concept that is quickly becoming a reality, offers rapid and geography-independent access to stores of text, images, graphics, motion video and other datatypes. Furthermore, a user may m...
详细信息
ISBN:
(纸本)0819420441
The virtual digital library, a concept that is quickly becoming a reality, offers rapid and geography-independent access to stores of text, images, graphics, motion video and other datatypes. Furthermore, a user may move from one information source to another through hypertext linkages. The projects described here further the notion of such an information paradigm from an end user viewpoint.
This paper presents the software architecture for DocBrowse: a system for mixed text/graphics document image analysis and retrieval. DocBrowse is an open and extensible environment that permits the user to visually ma...
详细信息
ISBN:
(纸本)0819420441
This paper presents the software architecture for DocBrowse: a system for mixed text/graphics document image analysis and retrieval. DocBrowse is an open and extensible environment that permits the user to visually manage and perform queries on highly degraded document imagedatabases. DocBrowse also serves as a research environment for developing document image analysis and query by image example (QBIE) algorithms. The system consists of a user interface, an object-relational document database and a variety of document image analysis engines. Using DocBrowse, it is possible to perform queries that retrieve documents based on both graphical and textual content. We describe the graphical user interface and visual image browser that is used to perform such queries. We also describe our approach to QBIE, the database structure, and the analysis engines incorporated in DocBrowse.
The growth of digital image and video archives is increasing the need for tools that effectively filter and efficiently search through large amounts of visual data. Towards this goal we propose a technique by which th...
详细信息
ISBN:
(纸本)0819420441
The growth of digital image and video archives is increasing the need for tools that effectively filter and efficiently search through large amounts of visual data. Towards this goal we propose a technique by which the color content of images and videos is automatically extracted to form a class of meta-data that is easily indexed. The color indexing algorithm uses the back- projection of binary color sets to extract color regions from images. This technique provides for both the automated extraction of regions and representation of their color content. It overcomes some of the problems with color histogram techniques such as high-dimensional feature vectors, spatial localization, indexing and distance computation. We present the binary color set back-projection technique and discuss its implementation in the VisualSEEk content- based image/videoretrieval system for the World Wide Web. We also evaluate the retrieval effectiveness of the color set back-projection method and compare its performance to other color imageretrieval methods.
Color space selection and quantization are critical to content-based imageretrieval based on color histograms. In this work, we first examine the color distribution in different color spaces including RGB, HSV, YUV a...
详细信息
ISBN:
(纸本)0819420441
Color space selection and quantization are critical to content-based imageretrieval based on color histograms. In this work, we first examine the color distribution in different color spaces including RGB, HSV, YUV and Munsell spaces and discuss the appropriate quantization strategies in these color spaces based on the distribution of colors. Then, we propose a color quantization scheme which applies the Lloyd-Max quantizer along each axis independently. The proposed scheme is simple yet efficient. retrieval using the proposed quantization scheme on different color spaces are compared through experiments.
HNC Software Inc. has developed a technology for automatic indexing and retrieval of free text and images. This technique is based on the concept of 'context vectors' which encode a succinct representation of ...
详细信息
ISBN:
(纸本)0819420441
HNC Software Inc. has developed a technology for automatic indexing and retrieval of free text and images. This technique is based on the concept of 'context vectors' which encode a succinct representation of the associated text and features of images. In this paper, we describe some technical issues in the image content addressable retrieval system (ICARS) including image context vector representation, clustering algorithm, retrieval and indexing techniques. ICARS has the capability to retrieve images based on similarity of content by texture and color without performing segmentation or object recognition.
Automated analysis and annotation of video sequences are important for digital video libraries, content-based video browsing and data mining projects. A successful video annotation system should provide users with use...
详细信息
ISBN:
(纸本)0819424331
Automated analysis and annotation of video sequences are important for digital video libraries, content-based video browsing and data mining projects. A successful video annotation system should provide users with useful video content summary in a reasonable processing time. Given the wide variety of video genres available today, automatically extracting meaningful video content for annotation still remains hard by using current available techniques. However, a wide range video has inherent structure such that some prior knowledge about the video content can be exploited to improve our understanding of the high-level video semantic content. In this paper, we develop tools and techniques for analyzing structured video by using the low-level information available directly from MPEG compressed video. Being able to work directly in the video compressed domain can greatly reduce the processing time and enhance storage efficiency. As a testbed, we have developed a basketball annotation system which combines the low-level information extracted from MPEG stream with the prior knowledge of basketball video structure to provide high level content analysis, annotation and browsing for events such as wide-angle and close-up views, fast breaks, steals, potential shots, number of possessions and possession times. We expect our approach can also be extended to structured video in other domains.
The paper addresses a fundamental problem for imageretrieval systems: how is the content information to be used in answering user queries? Our answer to this question is a retrieval model based on logic that offers: ...
详细信息
ISBN:
(纸本)0819420441
The paper addresses a fundamental problem for imageretrieval systems: how is the content information to be used in answering user queries? Our answer to this question is a retrieval model based on logic that offers: (a) an abstract representation of the visual appearance of an image allowing to incorporate in a principled way any imageretrieval technique based on the similarity of physical features such as region, color, and shape: (b) a semantic data modeling styled representation of the image content, independent from how the content information is obtained;(c) a functional representation of the association between portions of the image form and content objects. These three-leveled image representations are queried via a logical language spanning along four dimensions: the visual dimension, in which queries are images themselves, and the content, mapping, and spatial dimensions, in which queries are symbolic expressions. An image is retrieved in response to a query if it satisfies, in a logical sense, the query.
In this paper we propose a media-independent knowledge indexing and retrieval system as a basis for an information retrieval system. The representation allows for sharing of low level information bearing objects and a...
详细信息
ISBN:
(纸本)0819420441
In this paper we propose a media-independent knowledge indexing and retrieval system as a basis for an information retrieval system. The representation allows for sharing of low level information bearing objects and at the same time allows for maintaining of user-dependent views. The tools for maintenance and manipulation of concepts focus on the user and user's intentions. The aim of the system is to provide a set of flexible tools and let the user structure the knowledge in his or her own way, instead of attempting to build an all-encompassing common sense, or general knowledge representation.
This paper proposes a design procedure that exploits storage devices with different cost/bandwidth and cost/capacity ratios to build a hierarchical storage system for video-on- demand with minimum cost. The storage sy...
详细信息
ISBN:
(纸本)0819420441
This paper proposes a design procedure that exploits storage devices with different cost/bandwidth and cost/capacity ratios to build a hierarchical storage system for video-on- demand with minimum cost. The storage system is assumed to have two levels of hierarchy. The level-1 storage devices feature a low cost/bandwidth ratio and a high cost/capacity ratio. On the other hand, the level-2 storage devices feature a high cost/bandwidth ratio and a low cost/capacity ratio. The proposed procedure determines, with respect to overall system cost, which level of the hierarchy each program should be placed into. Based on the decisions, the designer then can figure out the appropriate configuration of the hierarchical storage system. The optimization target is to minimize the overall cost of the storage system.
Content based imageretrieval techniques are being developed for automatic indexing and retrieval of images in many applications. One of the main features of an image is its dominant colors, hence the development of c...
详细信息
ISBN:
(纸本)0819420441
Content based imageretrieval techniques are being developed for automatic indexing and retrieval of images in many applications. One of the main features of an image is its dominant colors, hence the development of color based imageretrieval techniques. In these techniques, images are indexed using their dominant colors and images with perceptually similar dominant colors to the query are retrieved. In a large image database, images may come from many different sources, may be captured using different devices and represented using different color spaces. These differences may be subtle, but result in different meanings of image data. If these differences are not accounted for, imageretrieval performance may suffer. In existing systems, this factor is normally not considered. In this paper, we present various different image representations. We then discuss effects on retrieval performance using color histograms when images in a database are represented differently. It is shown that different image representations have serious effects on imageretrieval performance. We discuss the conversion between different image representations, and information required to carry out these conversions. This information is normally not available in most current image formats, which indicates a need of a common color image interchange format.
暂无评论