From the Book: PREFACE The explosion of on-line web information has given rise to many query-based textsearch engines (such as Alta Vista) and manually constructed topic hierarchies (suchas Yahoo!). With the current g...
ISBN:
(纸本)9780130471178
From the Book: PREFACE The explosion of on-line web information has given rise to many query-based textsearch engines (such as Alta Vista) and manually constructed topic hierarchies (suchas Yahoo!). With the current growth rate of web information, especially broadbandmultimedia data, query data are growing incomprehensibly large and manual classification in topic hierarchies is creating a major bottleneck. Consequently, the hugeamount of multimedia data is imposing on people a heavy burden of manipulating, searching, interpreting, skimming, and integrating information. Thus, efficientmultimediacontent analysis tools are needed to address these user's needs. This book presents a solution to problems arising from the demand for fastinformation access and for sharing in real-time multimedia transmission over theInternet. We present in this book a solution which exploits software agents thatare placed throughout the network environment. These hierarchical video analysisagents process multimedia streams in real time, and automatically decompose andunderstand the multimediacontent so as to facilitate information access and sharing. multimediacontent contains both the perceptual content such as color, motion,or acoustic features and the conceptual content, which is specified based on conceptsor semantics that can be expressed by text descriptions. Both types of contents areembedded simultaneously in multimedia streams, and usually are complementaryto each other. This book adaptively analyzes both kinds of video contents bycombining mixed media cues from audio, video and text. First, a high-performance module for on-line video segmentation based on scene-change detection isdescribed. The module serves as the first step of any videostream construction and analysis. To meet the high computational demand, ourproposed video scene change detection algorithms are very efficient while maintaining high accuracy and recall rates for fast on-line video analysis. Second, the percep
作者:
Lee, S.Chung Ang Univ
Dept Image Engn Grad Sch Adv Imaging Sci Multimedia & Film Seoul 156756 South Korea
A compressed image reproduction scheme is proposed by properly decomposing and manipulating the coefficients of discrete cosine transform (DCT) directly in the compressed domain. The basic idea of the proposed approac...
详细信息
A compressed image reproduction scheme is proposed by properly decomposing and manipulating the coefficients of discrete cosine transform (DCT) directly in the compressed domain. The basic idea of the proposed approach is to decompose each DCT block into several sub-blocks and to adjust the brightness and detail components of a given image for compressing dynamic range and enhancing contrast. Image reproduction based on the subblock decomposition can be done more precisely than any approach based on the normal block-sized approach. First, DCT coefficients of each block are decomposed into several sub-blocks. Next each sub-block's coefficients are separated into brightness and detail components, and treated differently according to content analysis. Then, the enhanced coefficients are projected on the constraint sets to avoid some artefacts, and are composed back to the original order. The main advantages of the proposed algorithm are that (i) it can enhance the dynamic range and details without affecting the compressibility of the given image since it operates directly in the compressed domain, and (ii) it does not boost blocking artefacts around big edges without any further processing. In order to evaluate the proposed scheme, several base-line approaches are described and compared using enhancement quality measures.
Due to the growing number of wireless communication devices and emerging bandwidth-intensive applications, the demand of data usage is increasing rapidly. Utilizing various radio access technologies and multiple frequ...
Due to the growing number of wireless communication devices and emerging bandwidth-intensive applications, the demand of data usage is increasing rapidly. Utilizing various radio access technologies and multiple frequency bands in wireless networks can provide efficient solutions to meet the growing demand of data. These techniques are promising for the fifth generation (5G) wireless communication systems. However, to fully exploit their benefits, spectrum and spatial reuse, power saving, throughput and utility enhancement are crucial issues. In this thesis, we propose different resource allocation algorithms to address the aforementioned issues in wireless communication networks. First, we study the resource allocation problem for a hybrid overlay/underlay cognitive cellular network. We propose a hybrid overlay/underlay spectrum access mechanism to improve the spectrum and spatial reuse. We formulate the resource allocation problem as a coalition formation game among femtocell users, and analyze the stability of the coalition structure. We propose an efficient algorithm based on the solution concept of recursive core. The proposed algorithm achieves a stable and efficient spectrum allocation. Next, we study the resource allocation problem for multimediacontent delivery in millimeter wave (mmWave) based home networks. We characterize different usage scenarios of multimediacontent delivery. We formulate a joint power and channel allocation problem, which captures the spectrum and spatial reuse of mmWave communications, based on a network utility maximization framework. The problem is a non-convex mixed integer programming (MIP) problem. We reformulate the non-convex MIP problem into a convex MIP problem and propose a resource allocation algorithm based on the outer approximation method. We also develop an efficient heuristic algorithm which has a substantially lower complexity than the outer approximation based algorithm. Finally, we study full-duplex relay-assiste
With the increasing popularity of the WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. access to multimedia objects in databases has long been limited to the inform...
With the increasing popularity of the WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. access to multimedia objects in databases has long been limited to the information provided in manually assigned keywords. Now, with the integration of feature-detection algorithms in database systems software, content-based retrieval can be fully integrated with query processing. We describe our experimentation platform under development, making database technology available to multimedia. Our approach is based on the new notion of feature databases. Its architecture fully integrates traditional query processing and content-based retrieval techniques.
Advances in digital storage and processing speed have made feasible the creation of large image databases with rapid access to individual items stored therein. The huge data sizes of images and the enormous number of ...
Advances in digital storage and processing speed have made feasible the creation of large image databases with rapid access to individual items stored therein. The huge data sizes of images and the enormous number of images in a typical image database, coupled with inexact nature and subjective interpretations, have called for content-based retrieval systems. Fast and accurate retrievals are crucial for such systems to be used efficiently.;This project provides an overview on the content-based Image Retrieval (CBIR) techniques developed recently. Research directions and current available CBIR systems are presented. Important issues such as image segmentation algorithms, image logic structure and spatial relationships, spatial access methods and Query by Visual Example techniques (QVE) are discussed in detail.;A prototype image retrieval system called IMAGESEEK is implemented using the JAVA programming language. The system enables the search of natural colour images and demonstrates the various ideas of Query by Visual Example techniques. A framework for CBIR systems is proposed. Experimental results of different QVE algorithms are discussed and compared with each other. The system has been successful in retrieving images from our sample data sets by their global and local colours. The user-friendly interface of IMAGESEEK allows the user to tailor and refine the query interactively by changing the retrieval algorithm, the threshold value, the weights, and the selected region of the query image. IMAGESEEK provides us a way to understand the key issues of CBIR techniques. It is a small but valuable component in the collection of multimedia retrieval systems.
Motion is one of the most prominent features of video. For content-based video retrieval, motion trajectory is the intuitive specification of motion features. In this paper, approaches for video retrieval via single m...
详细信息
Motion is one of the most prominent features of video. For content-based video retrieval, motion trajectory is the intuitive specification of motion features. In this paper, approaches for video retrieval via single motion trajectory and multiple motion trajectories are addressed. For the retrieval via single motion trajectory, the trajectory is modeled as a sequence of segments and each segment is represented as the slope. Two quantitative similarity measures and corresponding algorithms based on the sequence similarity are presented. For the retrieval via multiple motion trajectories, the trajectories of the video are modeled as a sequence of symbolic pictures. Four quantitative similarity measures and algorithms, which are also based on the sequence similarity, are proposed. All the proposed algorithms are developed based on the dynamic programming approach.
The proceedings contain 12 papers. The topics discussed include: ImageRover: a content-based image browser for the world wide web;distinguishing photographs and graphics on the world wide web;locating deciduous trees;...
ISBN:
(纸本)0818679832
The proceedings contain 12 papers. The topics discussed include: ImageRover: a content-based image browser for the world wide web;distinguishing photographs and graphics on the world wide web;locating deciduous trees;increasing retrieval efficiency by index tree adaptation;models and algorithms for efficient color image indexing;region-based image querying;efficient content extraction in compressed images;a Bayesian video modeling framework for shot segmentation and content characterization;retrieving images by similarity of visual appearance;retrieving images by similarity of visual appearance;a relevance feedback architecture for content-based multimedia information retrieval systems;and training templates for scene classification using a few examples.
Advances in digital storage and processing speed have made feasible the creation of large image databases with rapid access to individual items stored therein. The huge data sizes of images and the enormous number of ...
Advances in digital storage and processing speed have made feasible the creation of large image databases with rapid access to individual items stored therein. The huge data sizes of images and the enormous number of images in a typical image database, coupled with inexact nature and subjective interpretations, have called for content-based retrieval systems. Fast and accurate retrievals are crucial for such systems to be used efficiently. This project provides an overview on the content-based Image Retrieval (CBIR) techniques developed recently. Research directions and current available CBIR systems are presented. Important issues such as image segmentation algorithms, image logic structure and spatial relationships, spatial access methods and Query by Visual Example techniques (QVE) are discussed in detail. A prototype image retrieval system called IMAGESEEK is implemented using the JAVA programming language. The system enables the search of natural colour images and demonstrates the various ideas of Query by Visual Example techniques. A framework for CBIR systems is proposed. Experimental results of different QVE algorithms are discussed and compared with each other. The system has been successful in retrieving images from our sample data sets by their global and local colours. The user-friendly interface of IMAGESEEK allows the user to tailor and refine the query interactively by changing the retrieval algorithm, the threshold value, the weights, and the selected region of the query image. IMAGESEEK provides us a way to understand the key issues of CBIR techniques. It is a small but valuable component in the collection of multimedia retrieval systems.
The explosive growth of the Internet has come with increasing diversity and heterogeneity in terms of client device capability, network bandwidth, and user preferences. To date, most Web content has been designed with...
详细信息
The explosive growth of the Internet has come with increasing diversity and heterogeneity in terms of client device capability, network bandwidth, and user preferences. To date, most Web content has been designed with desktop computers in mind, and often contains rich media such as images, audio, and video. In many cases, this content is not suitable for devices like netTVs, handheld computers, personal digital assistants, and smart phones with relatively limited display capability, storage, processing power, and network access. Thus, Internet access is still constrained on these devices and there is a need to develop alternative approaches for information delivery. In this paper, we propose a framework for adaptive content delivery in heterogeneous environments. The goal is to improve contentaccessibility and perceived quality of service for information access under changing network and viewer conditions. The framework includes content adaptation algorithms, client capability and network bandwidth discovery methods, and a Decision Engine for determining when and how to adapt content. We describe this framework, initial system implementations based upon this framework, and the issues associated with the deployment of such systems based on different architectures.
How to facilitate efficient video manipulation and access in a web-based environment is becoming a popular trend for video applications. In this paper, we present a web-oriented video management and application proces...
详细信息
ISBN:
(纸本)0819442437
How to facilitate efficient video manipulation and access in a web-based environment is becoming a popular trend for video applications. In this paper, we present a web-oriented video management and application processing system, based on our previous work on multimedia database and content-based retrieval. In particular, we extend the VideoMAP architecture with specific web-oriented mechanisms, which include: (1) Concurrency control facilities for the editing of video data among different types of users, such as Video Administrator, Video Producer, Video Editor, and Video Query Client;different users are assigned various priority levels for different operations on the database. (2) Versatile video retrieval mechanism which employes a hybrid approach by integrating a query-based (database) mechanism with content-based retrieval (CBR) functions;its specific language (CAROL/ST with CBR) supports spatiotemporal semantics of video objects, and also offers an improved mechanism to describe visual content of videos by content-based analysis method. (3) Query profiling database which records the "histories" of various clients' query activities;such profiles can be used to provide the default query template when a similar query is encountered by the same kind of users. An experimental prototype system is being developed based on the existing VideoMAP prototype system, using Java and VC++ on the PC platform.
暂无评论