The availability of inexpensive CMOS cameras and microphones that can ubiquitously capture multimediacontent from the environment is fostering the development of wireless multimedia sensor networks (WMSNs), i.e., dis...
详细信息
The availability of inexpensive CMOS cameras and microphones that can ubiquitously capture multimediacontent from the environment is fostering the development of wireless multimedia sensor networks (WMSNs), i.e., distributed systems of wirelessly networked devices that can retrieve video and audio streams, still images, and scalar sensor data. A new cross-layer rate control scheme for WMSNs is introduced in this paper with a twofold objective: (i) maximize the video quality of each individual video stream;(ii) maintain fairness in the domain of video quality between different video streams. The rate control scheme is based on analytical and empirical models of video distortion and consists of a new cross-layer control algorithm that jointly regulates the end-to-end data rate, the video quality, and the strength of the channel coding at the physical layer. The end-to-end data rate is regulated to avoid congestion while maintaining fairness in the domain of video quality rather than data rate. Once the end-to-end data rate has been determined, the sender adjusts the video encoder rate and the channel encoder rate based on the overall rate and the current channel quality, with the objective of minimizing the distortion of the received video. Simulations show that the proposed algorithm considerably improves the received video quality with respect to state-of-the art rate control algorithms, without sacrificing on fairness. (C) 2010 Elsevier B.V. All rights reserved.
The availability of inexpensive CMOS cameras and microphones that can ubiquitously capture multimediacontent from the environment is fostering the development of wireless multimedia sensor networks (WMSNs), i.e., dis...
详细信息
multimedia information and embedded systems are two major technological advances that have significantly changed the way people interact with systems and information in recent years. In this context, audio proves to b...
详细信息
multimedia information and embedded systems are two major technological advances that have significantly changed the way people interact with systems and information in recent years. In this context, audio proves to be the most advantageous media for interacting with embedded systems and their content. Advantages include: hands-free operation;unattended interaction;and simple, cheap devices for capture and playback. The use of embedded systems to seek information stored locally or on the web points up several difficulties inherent in the nature of multimedia-information signals. These difficulties are especially evident when palmtop or deeply embedded devices are used for such purposes. Developing a set of digital-signal-processing- based algorithms for extracting audio information is a primary step toward providing user-friendly access to multimedia information and developing powerful communication interfaces. The algorithms aim to extract semantic and syntactic information from audio signals, including voice. Extracted audio features are employed to access information in multimedia databases, as well as to index it. More extensive, higher-level information, such as audio-source identification (speaker identification) and genre (in the case of music), must be extracted from the audio signal. One basic task involves transforming audio into symbols (e.g. music transformed into a score, speech transformed into text) and transcribing symbols into audio (e.g. score transformed into musical audio, text transformed into speech). The purpose is to search for and access any kind of multimedia information by means of audio. To attain these results, digital audio-processing, digital speech-processing, and soft-computing methods need to be integrated. Neural networks are used as classifiers and fuzzy logic is used for making smart decisions.
The proceedings contain 15 papers. The topics discussed include: discriminative genre-independent audio-visual scene change detection;a random walk through human behavior;flexible user interface for efficient content-...
The proceedings contain 15 papers. The topics discussed include: discriminative genre-independent audio-visual scene change detection;a random walk through human behavior;flexible user interface for efficient content-based video surveillance retrieval: design and evaluation;an automated object-level video editing tool;ImageSeeker: a content-based image retrieval system;extraction of salient regions of interest using visual attention models;research on subjective stereoscopic image quality assessment;image quality assessment in multimedia applications;document description: what works for images should also work for text?;an annotation database for multimedia scientific data;a model of multimodal fusion for medical applications;and binary and nonbinary description of hypointensity for search and retrieval of brain MR images.
作者:
Ciocca, G.Marini, F.Schettini, R.DISCo
Dipartimento di Informatica Sistemistica e Comunicazione Universita Degli Studi di Milano-Bicocca Viale Sarca 336 20126 Milano Italy
In the framework of multimedia applications image quality may have different meanings and interpretations. In this paper, considering the quality of an image as the degree of adequacy to its function/goal within a spe...
详细信息
作者:
Ciocca, G.Cusano, C.Schettini, R.DISCo
Dipartimento di Informática Sistemistica e Comunicazione Universita degli Studi di Milano-Bicocca Viale Sarca 336 20126 Milano Italy
Although traditional content-based retrieval systems have been successfully employed in many multimedia applications, the need for explicit association of higher concepts to images has been a pressing demand from user...
详细信息
In many areas of commerce, government, academia, and medicine, large collections of digital images are being used. Usually, the only way of searching these collections is by their name, or by browsing which is unpract...
详细信息
The major drawback of interactive retrieval systems is the potential frustration of the user that is caused by an excessive labelling work. Active learning has proven to help solving this issue, by carefully selecting...
详细信息
QoS-oriented content delivery is an important component in en-learning systems. In this paper we study the object placement problem in the parallel-access context for QoS-oriented content delivery. The problem is form...
详细信息
ISBN:
(纸本)9781424439294
QoS-oriented content delivery is an important component in en-learning systems. In this paper we study the object placement problem in the parallel-access context for QoS-oriented content delivery. The problem is formulated in terms of bandwidth maximization and further proved to be equivalent to the Fault Tolerant Facility Location Problem (FTFL). In this problem the bandwidth that violates the triangle inequality is measured by a cost function which forms a metric when the shortest-path routing is deployed. Though there have been efficient UFL algorithms with good performance guarantees, the challenge of the FTFL Problem arises due to the conflict of its connectivity requirement (multiple distinct connections) and cost minimization (which leads to identical connections with the minimum cost). We investigate possible solutions for addressing this conflict after giving an overview of the existing techniques. Extension of the problem to the context of QoE-aware multimedia delivery which has important applications in e-learning systems is also discussed.
In the context of the European Cantata project (ITEA project, 2006-2009), within Barco, a complete Multi-content Analysis framework was developed for detection and analysis of compound images. The framework consists o...
详细信息
暂无评论