The proceedings contains 95 papers from the SPIE 2001 Conference on visualcommunications and imageprocessing. The topics discussed include: image coding;image analysis;video coding algorithms;stereo and multiview pr...
详细信息
The proceedings contains 95 papers from the SPIE 2001 Conference on visualcommunications and imageprocessing. The topics discussed include: image coding;image analysis;video coding algorithms;stereo and multiview processing;video coding implementations;motion estimation;error-resilent coding;image sequencing analysis;face tracking and recognition and wireless video.
This paper presents the importance of error resilience tools in visualcommunications. Error resilience tools provide a mechanism enabling transmission of visual information over channels with residual random bit erro...
详细信息
This paper presents the importance of error resilience tools in visualcommunications. Error resilience tools provide a mechanism enabling transmission of visual information over channels with residual random bit errors in the received bit stream. The benefits of using error resilience tools are proven by devising the analytic relationship between the time delay in a transparent channel using automatic repeat request (ARQ) and the equivalent residual bit error, rate in a nontransparent channel. The error resilience tools make it possible to achieve an acceptable visual quality even in the presence of these residual errors. Our work is related and compared to the standardization work of the next-generation still image compression system JPEG2000. The results show that partial and complete substitution of the quantization and symbol encoding in visual compression systems by robust error resilience tools provides a significant increase in robustness. Three error resilience tools are discussed: (1) substitution of the quantization and symbol encoding by a fixed length coding scheme, (2) substitution by a mixed fixed length coding and variable length coding scheme, and (3) substitution of the variable length coding by reversible variable length coding.
The article focuses on the audio and video analysis for multimedia interactive services. It describes a system that automates home video editing. It automatically extracts a set of highlight segments from a set of raw...
详细信息
The article focuses on the audio and video analysis for multimedia interactive services. It describes a system that automates home video editing. It automatically extracts a set of highlight segments from a set of raw home videos and aligns them with user-supplied incidental music based on the content of the video and incidental music. Finally, it introduces a method for interactive image retrieval using query feedback. It learns the user query as well as the correspondence between high-level user concepts and their low-level machine representation by performing retrievals according to multiple queries supplied by the user during the course of a retrieval session.
With desktop imaging devices becoming ubiquitous, effectively managing the images in large collections has become a challenge. The requirements for a modem imaging system now demand not only efficient storage (low bit...
详细信息
ISBN:
(纸本)0819439886
With desktop imaging devices becoming ubiquitous, effectively managing the images in large collections has become a challenge. The requirements for a modem imaging system now demand not only efficient storage (low bit rate coding), but also easy manipulation, indexing and retrieval of images. In this paper, we introduce a new method for colour image coding based on a visual appearance model of local colour image patterns. The visual appearance of small image patterns is characterised by their spatial pattern, colour direction and local energy strength. To encode the local visual appearance, an approach based on vector quantisation (VQ) is introduced. A separate VQ is designed for the spatial pattern and colour direction respectively. It is shown that the method not only achieves good image coding results in terms of rate distortion criterion, it also enables content-based retrieval to be performed in the compressed domain easily and conveniently.
Pre-processing algorithms improve on the performance of a video compression system by removing spurious noise and insignificant features from the original images. This increases compression efficiency and attenuates c...
详细信息
ISBN:
(纸本)0819439886
Pre-processing algorithms improve on the performance of a video compression system by removing spurious noise and insignificant features from the original images. This increases compression efficiency and attenuates coding artifacts. Unfortunately, determining the appropriate amount of pre-filtering is a difficult problem, as it depends on both the content of an image as well as the target bit-rate of compression algorithm. In this paper, we explore a pre-processing technique that is loosely coupled to the quantization decisions of a rate control mechanism. This technique results in a pre-processing system that operates directly on the Displaced Frame Difference (DFD) and is applicable to any standard-compatible compression system. Results explore the effect of several standard filters on the DFD. An adaptive technique is then considered.
An image sequence stabilization system that removes translational jitter while preserving intentional camera pan is presented. The video sequence is processed to acquire global camera translations from frame to frame ...
详细信息
ISBN:
(纸本)0819439886
An image sequence stabilization system that removes translational jitter while preserving intentional camera pan is presented. The video sequence is processed to acquire global camera translations from frame to frame (global interframe motion vectors) by motion estimation. The resulting motion vectors an accumulated to construct an absolute frame position vs. frame number signal. This signal is low-pass filtered to remove high frequency components caused by jitter, and retain low frequency parts representing the intentional camera pan. Correction vectors for image frames are obtained by subtracting the absolute frame position from the low-pass filtered value, and stabilization is achieved by the corresponding translation of image frames.
New robust image-filtering algorithms based on RM point estimates and on the known KNN-filter technology are presented. These RM-KNN filters show a sufficiently high efficiency in rejecting pulse noise and preserving ...
New robust image-filtering algorithms based on RM point estimates and on the known KNN-filter technology are presented. These RM-KNN filters show a sufficiently high efficiency in rejecting pulse noise and preserving object boundaries and small features of the image. The filters were checked with test and real images typical of remote-sensing problems. The proposed filters provide a good visual quality of the filtered images and feature better characteristics than the standard median filter.
Block-based disparity compensation is an efficient prediction scheme for encoding multi-view image data. Available scene geometry can be used to further enhance prediction accuracy In this paper, three different strat...
详细信息
ISBN:
(纸本)0819439886
Block-based disparity compensation is an efficient prediction scheme for encoding multi-view image data. Available scene geometry can be used to further enhance prediction accuracy In this paper, three different strategies are compared that combine prediction based on depth maps and 3-D geometry. Three real-world image sets are used to examine prediction performance for different coding scenarios. Depth maps and geometry models are derived from the calibrated image data. Bit-rate reductions up to 10% are observed by suitably augmenting depth map-based with geometry-based prediction.
This paper presents a regularized smoothing algorithm for 3D reconstruction from image sequence. Depth data estimated from a stereo pair or multiple image frames can easily be corrupted by various types of noise such ...
详细信息
ISBN:
(纸本)0819439886
This paper presents a regularized smoothing algorithm for 3D reconstruction from image sequence. Depth data estimated from a stereo pair or multiple image frames can easily be corrupted by various types of noise such as quantization and imperfect matching. We propose a regularized image restoration algorithm which enhances the surface of depth maps based on spatially adaptive image fusion. We can also enhance the resolution of the surfaces and preserve discontinuities.
In this paper we investigate the use of a fully rate scalable wavelet codec known as SAMCoW (Scalable Adaptive Motion Compensated Wavelet) for use in robust video streaming. We develop a theory based on the notion of ...
详细信息
ISBN:
(纸本)0819439886
In this paper we investigate the use of a fully rate scalable wavelet codec known as SAMCoW (Scalable Adaptive Motion Compensated Wavelet) for use in robust video streaming. We develop a theory based on the notion of additive temporal distortion to predict the performance of the bit stream under error conditions. Due to the regular nature of SAMCoW a closed-form solution is found and compared experimentally to a SAMCoW stream in a simulated channel.
暂无评论