Low power signal processing is critical in portable imaging applications where power translates directly to more images, longer videos, or lighter batteries. Complimentary-metal-oxide-semiconductor (CMOS) imagers are ...
Low power signal processing is critical in portable imaging applications where power translates directly to more images, longer videos, or lighter batteries. Complimentary-metal-oxide-semiconductor (CMOS) imagers are capable of lower power due to its ability to integrate analog to-digital conversion and logic circuitry on-chip. This dissertation presents an effective architectural solution to integrated digitalvideo (DV) videocompression that achieves a processing power efficiency of 42 giga-operations-per-Watt and allows for 23 hours of continual operation of a wireless imager generating compressed color video at 25 frames/second, 720 x 576 pixels per frame, and using a 5 Watt-hour Li-ion battery. First, an array of 40 identical processing elements (PEs) is chosen to support the 3.2 giga-operations-per-second computational requirement by partitioning the sensor array into distinct regions each clocked at 80 mega-Hertz. The low clocking requirement allows individual PEs to operate at a supply voltage of l.2 Volts in a 0.18 micron CMOS process, which results in a power reduction by a factor of two. Second, a hierarchical memory architecture is implemented to reduce power and to sustain the data throughput required for real-time processing. The memory systems power is reduced by more than a factor of four through an efficient mapping of the storage and access requirements to different levels of the memory hierarchy. Two memory technologies, the embedded DRAM and the SRAM, are studied for this mapping to take advantage of their area and access characteristics. Finally, parallel PEs are well suited for image and video processing where high instruction-level parallelism (ILP) exists. A non-negligible portion of DV compressionalgorithms are characterized by low ILP, which complicates instruction issuing. This is solved by introducing a dual-streaming mechanism for controlling the processor array in which a second control stream is used to feed coefficients as well as t
Content-based video indexing and retrieval has its foundations in the analyses of the prime video temporal structures. Consequently, technologies for video segmentation and key-frame extraction have become crucial for...
详细信息
Content-based video indexing and retrieval has its foundations in the analyses of the prime video temporal structures. Consequently, technologies for video segmentation and key-frame extraction have become crucial for the development of advanced digitalvideo systems. Conventional algorithms for video partitioning and key-frame extraction are mainly implemented autonomously. By focusing the analysis on compressed video features, this paper introduces a real-time algorithm for scene change detection and key-frame extraction that generates frame difference metrics by analysing the statistics of the macro-block features extracted from an MPEG compressed stream. The key-frame extraction method is implemented using difference metrics in curve simplification by means of a discrete contour evolution algorithm. This approach resulted in a fast and robust algorithm. Results of computer simulations are reported.
The idea of distance education, or learning at home has become very popular in recent years. Most major Universities offer classes on-line, and several smaller institutes exclusively offer distance classes. The distan...
详细信息
ISBN:
(纸本)9628576623
The idea of distance education, or learning at home has become very popular in recent years. Most major Universities offer classes on-line, and several smaller institutes exclusively offer distance classes. The distance courses can be offered either by mail (videotapes and books) or over the Internet (Web pages). Web-based courses are limited by bandwidth restrictions. This makes complex, interactive presentation of information virtually impossible. videotapes are able to present complex information, however the production and distribution cost of videotapes makes the idea very prohibitive. This paper presents a system for compressing video lectures for Internet transmission, without losing classroom information. In addition, we present algorithms that improve the interactivity of the presentation by use of optical character recognition and speech recognition technologies.
Whereas current debate concerns more object-oriented technologies, such as the MPEG-4 standardization process, this paper adresses the state of the art of MPEG-2 technology. The MPEG-2 standard was adopted four years ...
详细信息
Whereas current debate concerns more object-oriented technologies, such as the MPEG-4 standardization process, this paper adresses the state of the art of MPEG-2 technology. The MPEG-2 standard was adopted four years ago and is currently used for broadcasting digital television worldwide. Some comments and requirements from the first field trials require a new generation of MPEG-2 compression systems, and it was the objective of this paper to try to give some inputs for them. Whereas the first generations were mainly an optimization of the TM5 model with new motion estimators, new bit-rate control strategies and sometimes new mode selection algorithms, this new generation will be adaptive to the picture content and complexity. This adaptation concerns not only the input data (such as picture resolution, noise reduction) or the encoder itself(GOP structure, local adaptation, multipass encoding), but also the bandwidth (statistical multiplexing). From the results presented here, a 20 to 30% bit savings can be expected on average, and locally more than 50% (for special cases where the encoders fails, i.e,. noise, scene cuts, fading). After being in the cost and system software, the competition is now clearly on compression efficiency to add as many programs as possible into one single channel. The new techniques developed and introduced in this phase of optimization will certainly be exploited in future standards such as MPEG-4 or MPEG-7 where segmentation and scene characterization, respectively, will be key technologies.
During the past decade the living style of many people has changed quite a lot due to the rapid progresses in multimedia presentation and communications and the blooming of the Internet. As indispensable components of...
详细信息
During the past decade the living style of many people has changed quite a lot due to the rapid progresses in multimedia presentation and communications and the blooming of the Internet. As indispensable components of multimedia, image and video are capable of carrying huge amount of information. This makes it absolutely necessary to compress the image and video signals. In this thesis, we will present contributions in three different sub-areas of video and image compression namely, Vector Quantization (VQ), arbitrarily shaped Discrete Cosine Transform (DCT) and transcoder with arbitrarily sampling rate alternate. Vector quantization is theoretically attractive due to the results from rate-distortion theory. It does yield superior performance over scalar quantization schemes. Despite this, however, a big gap exists between the performance predicted by the theory and that which can actually be achieved. This is due to the failure to observe the two basic assumptions of rate-distortion theory, namely, that the block length can be infinitely large, and the source is stationary. In this thesis, two methods that can amend this big gap are presented. The two methods are dedicated to combating the invalidity of the two assumptions, respectively. A window-based fast search algorithm is proposed to speed up the VQ process. The MPEG-4 standard provides some key technologies that enable interactivity between the user and video objects. One of the prerequisite conditions is the ability to encode arbitrarily shaped video objects. Computationally complex shape-adaptive DCT algorithms have been proposed in the literature. An invention that can achieve shape-adaptivity via padding and regular base-N DCT (N is a power of 2) is presented in this thesis work. The biggest benefit is that many available DCT chipsets and fast algorithms can be used directly to handle arbitrary shapes. A rate-optimization technique is proposed in order to further improve the coding performance. More and m
Perceptual encoding of high-quality audio has found its way to many applications including digital radio, electronic music distribution (EMD) systems and portable audio devices. An overview on the basics of high-quali...
详细信息
Perceptual encoding of high-quality audio has found its way to many applications including digital radio, electronic music distribution (EMD) systems and portable audio devices. An overview on the basics of high-quality low-bitrate audio coding is followed by a look into currently widely used and newer, state-of-the-art coding systems like MP3 and MPEG-2 advanced audio coding (AAC). The rapid deployment of older (1992) technologies (like MP3) followed by the news of new and improved algorithms (like AAC) raises questions about future improvements. The paper analyses some candidates for such improvements and provides a view of some current research activities.
Perceptual encoding of high quality audio has found its way to many applications including digital radio, electronic music distribution (EMD) systems and portable audio devices. An overview on the basics of high quali...
详细信息
Perceptual encoding of high quality audio has found its way to many applications including digital radio, electronic music distribution (EMD) systems and portable audio devices. An overview on the basics of high quality low bitrate audio coding is followed by a look into currently widely used and newer, state-of-the-art coding systems like MP3 and MPEG-2 Advanced Audio Coding (AAC). The rapid deployment of older (1992) technologies (like MP3) followed by the news of new and improved algorithms (like AAC) raises the question about future improvements. The paper analyses some candidates for such improvements and provide a view of some current research activities.
In this paper, we seek to provide an introduction to the fast-moving field of digitalvideo on the Internet, from the viewpoint of the biological microscopist who might wish to store or access videos, for instance in ...
详细信息
In this paper, we seek to provide an introduction to the fast-moving field of digitalvideo on the Internet, from the viewpoint of the biological microscopist who might wish to store or access videos, for instance in image databases such as the BioImage Database (http://***),We describe and evaluate the principal methods used for encoding and compressing moving image data for digital storage and transmission over the Internet, which involve compromises between compression efficiency and retention of image fidelity, and describe the existing alternate software technologies for downloading or streaming compressed digitized videos using a Web browser. We report the results of experiments on video microscopy recordings and three-dimensional confocal animations of biological specimens to evaluate the compression efficiencies of the principal videocompression-decompressionalgorithms (codecs) and to document the artefacts associated with each of them. Because MPEG-1 gives very high compression while yet retaining reasonable image quality, these studies lead us to recommend that video databases should store both a high-resolution original version of each video, ideally either uncompressed or losslessly compressed, and a separate edited and highly compressed MPEG-1 preview version that can be rapidly downloaded for interactive viewing by the database user. (C) 1999 Academic Press.
暂无评论