The general problem of object recognition is difficult and often requires a large amount of computing resources, even for locating an object within a single image. How, then, can it be possible to build a tool for ind...
详细信息
ISBN:
(纸本)0819411418
The general problem of object recognition is difficult and often requires a large amount of computing resources, even for locating an object within a single image. How, then, can it be possible to build a tool for indexing into a large database of, say, thousands of images, which works effectively in `interactive time' on affordable hardware? One important optimization is to take advantage of interaction with the user to find out what types of variation are expected in the database, and to rely on the user to discriminate between similar-looking objects. Another is to create appropriate data structures off-line to speed on-line searches. We are building a tool, called FINDIT, for locating the image of an object from within a large number of images of scenes which may contain the object. The user outlines an object in an image that he wants to find in the database, and specifies the constraints on the transformations of the object that are expected to occur. The program acts as a filter to quickly reduce the possible number of candidates to a number small enough to be perused by the user. FINDIT chooses an appropriate search algorithm depending on the selection of constraints by the user.
Professionals in various fields such as medical imaging, biology, and civil engineering require rapid access to huge amounts of uncompressed pixmap image data. In order to fulfill these requirements, a parallel image ...
详细信息
ISBN:
(纸本)0819411418
Professionals in various fields such as medical imaging, biology, and civil engineering require rapid access to huge amounts of uncompressed pixmap image data. In order to fulfill these requirements, a parallel image server architecture is proposed, based on arrays of intelligent disk nodes, each disk node being composed of one processor and one disk. Pixmap image data is partitioned into rectangular extents, whose size and distribution among disk nodes minimize overall image access times. Disk node processors are responsible for maintaining both the data structure associated with their image file extents and an extent cache offering fast access to recently used data. Disk node processors may also be used for applying image processing operations to locally retrieved image parts. This contribution introduces the concept of an image oriented file system, where the file system is aware of image size, extent size, and extent distribution. Such an image oriented file system provides a natural way of combining parallel disk accesses and processing operations. The performance of the proposed multiprocessor-multidisk architecture is bounded either by communication throughput or by disk access speed. However, when disk accesses are combined with low-level local processing operations such as image size reduction (zooming), close to linear speedup factors can be obtained by increasing the number of intelligent disk nodes.
In this paper, a top-down data placement methodology for a large interactive multimedia information system (MMIS) on a single spindle multi-disk environment such as a Jukebox is presented. The objective of this work i...
详细信息
ISBN:
(纸本)0819411418
In this paper, a top-down data placement methodology for a large interactive multimedia information system (MMIS) on a single spindle multi-disk environment such as a Jukebox is presented. The objective of this work is to minimize average disk seek time as well as the number of platter switches for Jukebox. A large data placement problem can be divided into a number of small data placement problems by weighted graph decomposition. The Kernighan-Lin partitioning algorithm is recursively applied for this purpose. Once the graph is fully partitioned, the objects in the same subgraph are assigned to the same disk. The data placement within a disk is divided into two stages, global data placement and detailed data placement. The expected access patterns of global data placement are modeled as a time-homogeneous ergodic Markov Chain, from which the stationary probability for each node of the browsing graph can be found. Based on these probabilities, we define an expected access cost. Then, the problem of global data placement is posed as an optimization problem, and various clustering and storage layout algorithms are proposed.
We report a database language for visual retrieval which allows queries on image feature information which has been computed and stored along with images. The language is novel in that it provides facilities for deali...
详细信息
ISBN:
(纸本)0819411418
We report a database language for visual retrieval which allows queries on image feature information which has been computed and stored along with images. The language is novel in that it provides facilities for dealing with feature data which has actually been obtained from image analysis. Each line in the Manchester Visual Query Language (MVQL) takes a set of objects as input and produces another, usually smaller, set as output. The MVQL constructs are mainly based on proven operators from the field of digital image analysis. An example is the Hough-group operator which takes as input a specification for the objects to be grouped, a specification for the relevant Hough space, and a definition of the voting rule. The output is a ranked list of high scoring bins. The query could be directed towards one particular image or an entire image database, in the latter case the bins in the output list would in general be associated with different images. We have implemented MVQL in two layers. The command interpreter is a Lisp program which maps each MVQL line to a sequence of commands which are used to control a specialized database engine. The latter is a hybrid graph/relational system which provides low-level support for inheritance and schema evolution. In the paper we outline the language and provide examples of useful queries. We also describe our solution to the engineering problems associated with the implementation of MVQL.
The multimedia interactive conferencing application (MICA) is a personal-workstation application for multipoint visual teleconferencing. It allows people at two or more locations to share visual material such as docum...
详细信息
The multimedia interactive conferencing application (MICA) is a personal-workstation application for multipoint visual teleconferencing. It allows people at two or more locations to share visual material such as documents, photographs, and computer screens, in a highly interactive way. It supports the distribution, storage, retrieval and high-quality display of visuals, real-time interaction by pointing and annotation, and meeting services facilities. In this paper we establish the context of multimedia teleconferencing and computer supported cooperative work, relating earlier research to our design of MICA. We outline the services MICA offers, and then focus on two of the major technical challenges: the handling, compression and display of multiple media, and the design of a suitable user interface. A third major area, the multipoint service that supports the application, is detailed in a companion paper.
The article discusses the incorporation of textile images into the University of Maryland Historic Textile Database by a computer user. The author chose commercially available software and hardware that did not requir...
详细信息
The article discusses the incorporation of textile images into the University of Maryland Historic Textile Database by a computer user. The author chose commercially available software and hardware that did not require a dedicated workstation and could be upgraded without changing the software. Although DBASE III PLUS was initially selected for the database, the author chose PICTUREPOWER(TM) as the image capture system when it became apparent that the database would be a more useful research tool with images. The author discusses the problems that arose with the software and video camera and how she modified the system.
The conference proceedings incorporates 29 papers that are subdivided into six sessions. These deal with optical media technologies;recording system technologies;mass storage systems and applications;and small-systems...
详细信息
ISBN:
(纸本)0819402958
The conference proceedings incorporates 29 papers that are subdivided into six sessions. These deal with optical media technologies;recording system technologies;mass storage systems and applications;and small-systems applications and peripherals. Topics considered include: computer graphics, databases, the publishing industry, imagestorage, optical disks, medical imaging, medical records, image processing, law enforcement, criminal identification, digital memory, imagestorage and retrieval, video recording, telescopes, recording heads, and magnetooptical recording.
Multimedia information technologies, which provide comprehensive and intuitive information for a broad range of applications, have a strong impact on modem life, and have changed our way of learning and thinking. Over...
详细信息
ISBN:
(数字)9783662053003
ISBN:
(纸本)9783540002444;9783642055331
Multimedia information technologies, which provide comprehensive and intuitive information for a broad range of applications, have a strong impact on modem life, and have changed our way of learning and thinking. Over the past two decades, there has been an explosive growth in the use of digital multimedia (including audio, video, images and graphics) over the Internet and wireless communication. As the use of digital multimedia increases, effective data storage and management become increasingly important. In fields which use large quantities of data (e. g. audio, video, image and digital libraries; geographical and medical imagedatabases; etc), we need to minimize the volume of data stored while meeting the often conflicting demand for accurate data representation. In addition, the data need to be managed such that it facilitates efficient searching, browsing and cooperative work. This area has been a very active research area in recent years. This book will provide readers with an up-to-date and comprehensive picture of cutting edge technologies in multimedia information retrieval and management, which directly affect our industry, economy and social life The book is divided into two major parts: Technological Fundamentals which covers the core theories of the area; and Applications which describes the broad range of practical uses for this technology.
images and video play a crucial role in visual information systems and multimedia. There is an extraordinary number of applications of such systems in entertainment, business, art, engineering, and science. Such ap...
详细信息
ISBN:
(数字)9789401596640
ISBN:
(纸本)9781402001093;9789048158638
images and video play a crucial role in visual information systems and multimedia. There is an extraordinary number of applications of such systems in entertainment, business, art, engineering, and science. Such applications often involved large image and video collections, and therefore, searching for images and video in large collections is becoming an important operation. Because of the size of such databases, efficiency is crucial. We strongly believe that image and videoretrieval need an integrated approach from fields such as image processing, shape processing, perception, database indexing, visualization, and querying, etc.;This book contains a selection of results that was presented at the Dagstuhl Seminar on Content-Based image and videoretrieval, in December 1999. The purpose of this seminar was to bring together people from the various fields, in order to promote information exchange and interaction among researchers who are interested in various aspects of accessing the content of image and video data. The book provides an overview of the state of the art in content-based image and videoretrieval. The topics covered by the chapters are integrated system aspects, as well as techniques from image processing, computer vision, multimedia, databases, graphics, signal processing, and information theory.;The book will be of interest to researchers and professionals in the fields of multimedia, visual information (database) systems, computer vision, and information retrieval.
暂无评论