In this demonstration session, we will demonstrate several examples of the state-of-the-art videocompression standards MPEG-2, MPEG-4 and H.263. These demonstrations captured some of the recent work developed in the ...
详细信息
Future desktop and portable computing systems will have as their core an integrated multimedia system. Such a system will seamlessly combine digitalvideo, digital audio, computer animation, text, and graphics. Furthe...
详细信息
ISBN:
(纸本)0819424323
Future desktop and portable computing systems will have as their core an integrated multimedia system. Such a system will seamlessly combine digitalvideo, digital audio, computer animation, text, and graphics. Furthermore, such a system will allow for mixed-media creation, dissemination, and interactive access in real time. Multimedia architectures that need to support these functions have traditionally required special display and processing units for the different media types. This approach tends to be expensive and is inefficient in its use of silicon. Furthermore, such media-specific processing units are unable to cope with the fluid nature of the multimedia market wherein the needs and standards are changing and system manufacturers may demand a single component media engine across a range of products. This constraint has led tot a shift towards providing a single-component multimedia specific computing engine that can be integrated easily within desktop systems, tethered consumer appliances, or portable appliances. In this paper, we review some of the recent architectural efforts in developing integrated media systems. We primarily focus on two efforts, namely the evolution of multimedia-capable general purpose processors and a more recent effort in developing single component mixed media cc-processors. Design considerations that could facilitate the migration of these technologies to a portable integrated media system will also be presented.
Linear filters banks are being used extensively in image and video applications. New research results in wavelet applications for compression and de-noising are constantly appearing in the technical literature. On the...
详细信息
ISBN:
(纸本)0819424374
Linear filters banks are being used extensively in image and video applications. New research results in wavelet applications for compression and de-noising are constantly appearing in the technical literature. On the other hand, non-linear filter banks are also being used regularly in image pyramid algorithms. There are some inherent advantages in using non-linear filters [1] instead of linear filters when non-Gaussian processes are present in images. However, a consistent way of comparing performance criteria between these two schemes has not been fully developed yet. In this paper a recently discovered tool, sample selection probabilities [16], is used to compare the behavior of linear and non-linear filters. In [16] the conversion from weights of order statistics (OS) filters to coefficients of the impulse response is obtained through these probabilities, However, the reverse problem: the conversion from coefficients of the impulse response to the weights of OS filters is not yet fully understood. One of the reasons for this difficulty is the highly non-linear nature of the partitions and generating function used in [16]. In the present paper the problem is posed as an optimization of integer linear programming subject to constraints directly obtained from the coefficients of the impulse response. Although the technique to be presented is not completely refined, it certainly appears to be promising. Some results will be shown.
The proceedings contains 71 papers. Following topics are discussed: video coding;pre- and postprocessing;image/video coding and processing;video transmission;medical and very high quality images;object-based motion co...
详细信息
ISBN:
(纸本)0819423564
The proceedings contains 71 papers. Following topics are discussed: video coding;pre- and postprocessing;image/video coding and processing;video transmission;medical and very high quality images;object-based motion compensation;motion estimation and compensation;MPEG-2 optimization and applications;image compression;optimization, implementation, and applications of codecs;and fractal-based coding.
In this talk, I will describe two aspects of videocompressionalgorithms: the first area has to do with very low bit rate video coding, and the second one with scalable videocompression.
ISBN:
(纸本)0819420425
In this talk, I will describe two aspects of videocompressionalgorithms: the first area has to do with very low bit rate video coding, and the second one with scalable videocompression.
Developments in videocompression and VLSI technologies are producing revolutionary changes in the television industry. Television signals worldwide are beginning to be compressed, encoded and transmitted using digita...
详细信息
Developments in videocompression and VLSI technologies are producing revolutionary changes in the television industry. Television signals worldwide are beginning to be compressed, encoded and transmitted using digital means. Using digitalcompression, encoding and transmission techniques has the advantage of better bandwidth utilization, flexibility and ease of integration with other digital data, and an overall higher quality of video at the receiver. The purpose of this paper is two-fold. Firstly, an introduction to videocompression in the context of emerging digital television standards is provided. The important role played by International Standards, specifically MPEG, in the development of digital television systems is discussed. Secondly, current trends in the use of compression in the development of digital television for deployment in the United States and Europe, and the participation of Philips in these new applications, are discussed.
The large quantity of data associated with visual information necessitates the use of compression techniques. In this paper, we propose a novel approach (compressed domain technique) to implement spatial scalability d...
详细信息
ISBN:
(纸本)0819420425
The large quantity of data associated with visual information necessitates the use of compression techniques. In this paper, we propose a novel approach (compressed domain technique) to implement spatial scalability directly on the compressed image/video data. In contrast to the spatial domain technique (baseline for comparison), the proposed technique removes the unnecessary decompression and re-compression procedures. The computational complexity is greatly reduced by using certain approximations. We note that depending on the image/video content, only marginal quality degradation (almost unnoticeable subjectively) may be introduced. Simulation results confirm the substantial reductions in computational complexity of the proposed technique at a comparable performance to the spatial domain technique.
This paper describes an implementation of a software H.261 codec for PC, that takes an advantage of the fast computational algorithms for DCT-based videocompression, which have been presented by the author at the Feb...
详细信息
ISBN:
(纸本)0819420425
This paper describes an implementation of a software H.261 codec for PC, that takes an advantage of the fast computational algorithms for DCT-based videocompression, which have been presented by the author at the February's 1995 SPIE/IS&T meeting. The motivation for developing the H.261 prototype system is to demonstrate a feasibility of real time software- only videoconferencing solution to operate across a wide range of network bandwidth, frame rate, and resolution of the input video. As the bandwidths of current network technology will be increased, the higher frame rate and resolution of video to be transmitted is allowed, that requires, in turn, a software codec to be able to compress pictures of CIF (352 X 288) resolution at up to 30 frame/sec. Running on Pentium 133 MHz PC the codec presented is capable to compress video in CIF format at 21 - 23 frame/sec. This result is comparable to the known hardware-based H.261 solutions, but it doesn't require any specific hardware. The methods to achieve high performance, the program optimization technique for Pentium microprocessor along with the performance profile, showing the actual contribution of the different encoding/decoding stages to the overall computational process, are presented.
A crucial operation in image and video processing applications is affine transforms. Typical applications of affine transforms include fractal block coding, camera operation detection, affine motion estimation, etc. A...
详细信息
ISBN:
(纸本)0819420425
A crucial operation in image and video processing applications is affine transforms. Typical applications of affine transforms include fractal block coding, camera operation detection, affine motion estimation, etc. Affine transforms involve complex operations and are hence difficult to implement in real-time. In this paper, we present a novel architecture for real-time implementation of affine transforms. First, we derive two fundamental operations from affine transforms and then propose an efficient method of implementing these operations. As an example of the application of ATP (Affine Transform Processor), we propose a high performance videocompression algorithm mapped onto the proposed architecture. This algorithm is based on combined affine transform and vector quantization (ATVQ), where the infra-frame and inter-frame redundancy in the video sequence are exploited through piecewise self-similarity on a block-wise basis within a frame and between frames. ATVQ has the advantages of superior coding performance at a significantly reduced computational complexity. ATVQ has been mapped onto the ATP and real-time execution is demonstrated using a VHDL (VHSIC Hardware Description Language) implementation of ATP.
Block matching algorithms (BMAs) are often employed for motion estimation (ME) in video coding. Most conventional BMAs treat the ME problem as an optimization problem and employ certain search schemes to find a soluti...
详细信息
ISBN:
(纸本)0819420425
Block matching algorithms (BMAs) are often employed for motion estimation (ME) in video coding. Most conventional BMAs treat the ME problem as an optimization problem and employ certain search schemes to find a solution. Except the time-consuming full search algorithms (FS), other fast algorithms such as the three step search (TSS), searching on a reduced search range, cannot guarantee optimal solutions, i.e., search is often trapped at local minima; the ME results are thus usually unsatisfactory. Few of them makes use of the information inherent in the images explicitly. We propose a new ME algorithm which can reduce the search range while guaranteeing global optimality in most cases. Microblock visual patterns are designed to extract edge information to guide block matching: searching is only carried out at places where the real match most likely happens; that is where similar edge features present. The proposed algorithm obtains a speed about 7 - 8 times as fast as that of FS with same search range. The prediction quality is very close to that of FS and much better than that of TSS. It can produce MPEG-1 or MPEG-2 compatible motion vectors, and can be extended to model-based ME. It is suitable for parallel implementation too. Moreover, the visual patterns are potential resources for video indexing to facilitate content-based information retrieval, which is important for applications such as VOD.
暂无评论