This paper describes a new wavelet-based approach to the motion estimation problem for digitalvideo. A complex-valued discrete wavelet transform is used to decompose each frame into a subsampled directionally bandpas...
详细信息
This paper describes a new wavelet-based approach to the motion estimation problem for digitalvideo. A complex-valued discrete wavelet transform is used to decompose each frame into a subsampled directionally bandpass filtered hierarchy. The transform is defined so that at each level there is an approximate correspondence between local translation and coefficient phase shift. This relationship is used to estimate motion within each orientation subband. The estimates are combined over all orientations and scales using a coarse-to-fine refinement strategy to produce a fractional-pel accurate motion field with a directional confidence measure. The technique is suitable for videocompression schemes and can also be used for stereo vision and image registration.
This paper presents a videocompression format for HDTV video to be used in studio applications. digital HDTV signals need to be compressed in order to be routed or stored within a studio. The bitrates targeted suppor...
This paper presents a videocompression format for HDTV video to be used in studio applications. digital HDTV signals need to be compressed in order to be routed or stored within a studio. The bitrates targeted support the use of viable networking technologies such as SMPTE-259 and OC-3. The requirements of a studio are considered in the choice of the format. These requirements include excellent picture quality, good quality even in the presence of cascaded coding, adequate sampling of the chroma signal, support for studio operations such as chroma-keying, and frame or field accurate editing. We will also examine techniques for switching between highly compressed video signals, a requirement in the studio, both for production purposes and for transitioning between programs or commercials.
This paper discusses novel multimedia (video and audio) techniques for accomplishing equipment test and diagnosis in the depot and tactical environment of the soldier technician. This approach can enhance diagnostic c...
详细信息
This paper discusses novel multimedia (video and audio) techniques for accomplishing equipment test and diagnosis in the depot and tactical environment of the soldier technician. This approach can enhance diagnostic capability in ways that cannot be matched through any other currently deployed technology. The paper begins by describing the test environment and the requirements of the soldier technician necessitating the use of multimedia techniques. Examples of multimedia use in ATE are cited that include providing the technician with dynamic visual clues pertaining to system assembly, disassembly, and operation in live motion, and using high quality audio to assist the technician in detecting and discerning sound clues pertaining to faulty system operation. A description of current videocompression and audio technologies is provided. Efforts currently underway to implement multimedia in current ATE systems are also described. This paper concludes with a discussion of future directions and trends in this area.
This paper presents the trade-off done during the IC partitioning of the source decoding sub-system of a digital set-top box which leads to a solution with a single chip MPEG2 audio/video/graphic decoder reducing the ...
详细信息
This paper presents the trade-off done during the IC partitioning of the source decoding sub-system of a digital set-top box which leads to a solution with a single chip MPEG2 audio/video/graphic decoder reducing the overall memory used. Then the architecture of the decoder is described with emphasis on the on-chip communications.
The two-dimensional discrete cosine transform (DCT) is probably the most popular transform in image and video coding systems as it achieves very high compression ratio for fast implementation. Various still digital im...
详细信息
The two-dimensional discrete cosine transform (DCT) is probably the most popular transform in image and video coding systems as it achieves very high compression ratio for fast implementation. Various still digital image and videocompressionalgorithms using this block-based transform coding method cause the artifacts, called the blocking, in the reconstructed images of a video sequence. This artifact is more prominent at low-bit rates. A new eight-neighbour symmetric post-filtering technique is introduced in this paper to reduce the widely known blocking effect which is commonly found in the reconstructed images of block-based transform coding schemes. In this technique, the weights of the eight neighbouring pixels of the pixel of interest are determined by the modified Lagrange's interpolation algorithm. The proposed technique reduces the blocking effect while maintaining the sharpness of the reconstructed image. The results obtained by the proposed technique are compared in this paper with the results obtained by several well known eight neighbour low-pass filters such as Reeve and Lim's method, the Laplacian eight neighbour accounting filter (LENAF), the Laplacian four neighbour filter with + form, the Laplacian four neighbour filter with x form and a two neighbour filtering method. These comparisons will be conducted with the following measures: peak signal-to-noise ratio (PSNR), block-edge impairment metric and subjective quality evaluation.
CCD image sensors are capable of delivering at least 600% of linear video to the signal processing. This headroom is needed to reproduce large scene contrasts in a convincing way on displays not capable to handle thes...
CCD image sensors are capable of delivering at least 600% of linear video to the signal processing. This headroom is needed to reproduce large scene contrasts in a convincing way on displays not capable to handle these large contrasts. — Today's cameras have 10 bit A/D conversion in the video path. This calls for some form of nonlinear analog processing like (pre-)gamma or (pre-)knee to handle these 600% signals. This weakens the well known arguments of better accuracy and stability for these digital cameras: non-linear circuits like knee, gamma and whiteshading multipliers are not the most accurate and stable analog circuits. — The move to 12 bit A/D conversion in the camera allows digitalvideoprocessing with sufficient dynamic range, and all non-linear processing in the digital signal path. — This also opens the way to reconsider processing, given the new possibilities of the digital domain. Examples will be given in the paper. Some items that will be addressed are: dynamic pixel correction, automatic skin contours and a new digital highlight compression with improved color fidelity. — The architecture of a 12 bit camera, the Philips BTS LDK20P will be explained in the paper, together with the algorithms used.
We have developed a 1/2-in. high-definition television (HDTV) digital VTR for program production, editing, and broadcasting incorporating bit rate reduction (BRR) technology. The objectives aimed at in developing the ...
We have developed a 1/2-in. high-definition television (HDTV) digital VTR for program production, editing, and broadcasting incorporating bit rate reduction (BRR) technology. The objectives aimed at in developing the HDTV VTR are high picture quality, editing/trick play functions equal to those of conventional definition television (CDTV) VTRs, and a low running cost. — Based on the results of a subjective assessment test using compressed moving pictures, we have employed intra-field fixed length discrete cosine transform (DCT) in BRR and set the compression factor at 4.1 for 8-bit video signal quantization and at 5.4 for 10-bit quantization. To hold down the cost of development and realize its early application, we developed an HD processor, which encodes and decodes HDTV video signals, and added it to a D-5 VTR used in CDTV. The newly developed BRR HDTV VTR provides improved picture quality as well as effective error concealment through the incorporation of such new technologies as adaptive processing, overlap blocking, and individual sync block allocation for low/high frequency component codes.
Progress of optical information processing is driven by two sources : optoelectronic technology benefits from the development of compound semiconductor circuit technology, and the continued growth of electronic chip s...
详细信息
Progress of optical information processing is driven by two sources : optoelectronic technology benefits from the development of compound semiconductor circuit technology, and the continued growth of electronic chip speed and density increases the need for new interconnect technologies. We advocate the use of these advances for solving complex law level image processing problems using dedicated optoelectronic processor arrays. These massively parallel, special purpose machines would not be conceivable without a suitable combination of optics with high density hybrid (digital and analog, electronic and optical) smart pixels. One optical operation of particular interest here is the generation of random number arrays for the implementation of stochastic algorithms. We describe the general structure of optoelectronic "smart retinae" that combine laser speckle, optical convolution and VLSI circuits to implement parallel simulated annealing. We report on our present results that have demonstrated one simple case of such an operation performing parallel simulated annealing at video rate.
In this paper we review the Split-Merge video displacement estimation technique and show how this can be used, given a classification with integer labels of the objects in the first frame of a video sequence, to track...
详细信息
ISBN:
(纸本)0819417661
In this paper we review the Split-Merge video displacement estimation technique and show how this can be used, given a classification with integer labels of the objects in the first frame of a video sequence, to track this classification along the sequence while doing motion estimation.
暂无评论