Block matching has been used for motion estimation and motion compensation in MPEG standards for years. While it has an acceptable performance in describing motion between frames, it requires quite a few bits to repre...
详细信息
Block matching has been used for motion estimation and motion compensation in MPEG standards for years. While it has an acceptable performance in describing motion between frames, it requires quite a few bits to represent the motion vectors. In certain circumstances, the use of whole frame affine motion models would perform equally well or even better than block matching in terms of motion accuracy, while it results in the coding of only 6 parameters. In this paper, we modify an MPEG-4 codec by adding: (1) 6 affine model parameters to the frame header; and (2) mode selection among INTRA, SKIP, INTER-16/spl times/16, INTER-8/spl times/8, and GLOBAL-AFFINE modes by Lagrange optimal rate-distortion criteria. Simulation results demonstrate 10-20% decrease in bit-rate, compared to the MMS codec for an average coded P-frame with the same reconstruction PSNR.
The move from analog to digital media in recent years has given rise to the flourishing of digital signalprocessing techniques. In addition to the many benefits of this flourishing, the plenitude and sophistication o...
详细信息
The move from analog to digital media in recent years has given rise to the flourishing of digital signalprocessing techniques. In addition to the many benefits of this flourishing, the plenitude and sophistication of these techniques has caused learning and teaching the essence of this subject to be a more difficult task. In this paper, a software tool for teaching multimedia signalprocessing is presented. This software tool is intended for assisting teaching and demonstrating speech and audio signalprocessing techniques. Using this tool one can select from a variety of techniques and parameters in a straightforward and simple way. The results of this processing can be played and viewed, numerically and graphically, in order to ease the analysis of the data. The purpose of this tool is to enable easy experimentation and demonstration of different algorithms and thus improve the quality of teaching.
The characteristic (transfer function) of a dispersive M-ary channel equalizer designed through a Bayesian estimator, a performance indicator that is not trivial to obtain for M>2 due to intersymbol interference (I...
详细信息
The characteristic (transfer function) of a dispersive M-ary channel equalizer designed through a Bayesian estimator, a performance indicator that is not trivial to obtain for M>2 due to intersymbol interference (ISI), is investigated. A set of curves is obtained and interpreted. Implementation through a radial basis function neural network is considered. It is shown that because network centers endure updating with different rates, the equalizer characteristic errates off the optimum, causing thus the symbol error rate and/or the training time to increase. A solution, based on incorporating an underlying symmetry in the channel response levels into the updating algorithm, brings the characteristic uniformly closer to the optimal one. It is also provided a strategy endowing the network with self-initializing. Simulation results are presented for a channel with sufficient ISI strength.
This paper proposes a novel fast architecture for two-dimensional discrete wavelet transform by using lifting scheme. The parallel and embedded decimation techniques are employed to optimize the architecture, which is...
详细信息
This paper describes a new corner detection algorithm based on the Radon Transform. The basic idea is to find the straight lines in the images and then search for their intersections, which are the corner points of th...
详细信息
This paper describes real time object tracking of 3D objects in 2D image sequences. The moving objects are segmented by the method of differential image followed by the process of morphological dilation. The moving ob...
详细信息
The residue number system (RNS) has computational advantages in addition and multiplication compared with weighted number systems, such as the binary number system (BNS), since operations on residue digits are perform...
详细信息
The residue number system (RNS) has computational advantages in addition and multiplication compared with weighted number systems, such as the binary number system (BNS), since operations on residue digits are performed independently and these processes can be performed in parallel. Thus they are widely used in digital signalprocessing etc. Since residue to binary conversion is critical and difficult for the practicality of RNS, in this paper, a novel residue to binary (R/B) conversion algorithm for the restricted moduli set (2/sup n/ -1, 2/sup n/, 2n+1), based on exploring the periodicity of modulo (2/sup n/ /spl plusmn/ 1) operations is presented. A new 2n-bit adder based R/B converter is also proposed. The performance comparison results demonstrate that the new converter is faster and requires less area compared with the others reported in the previous literature.
With rapidly increasing storage and computational capacity, a common PC can store and index hundreds of hours of speech. This suggests that new approaches based on database techniques might be useful in speech recogni...
详细信息
With rapidly increasing storage and computational capacity, a common PC can store and index hundreds of hours of speech. This suggests that new approaches based on database techniques might be useful in speech recognition and speech indexing. This paper presents a first step in such a direction. The algorithm developed relies on an indexed single-speaker database. The database consists of spoken utterances transcribed into text. The waveforms of these utterances are converted off-line to binary symbols called fingerprints through a nonlinear frequency-domain transform. The fingerprints are associated with the transcribed text. Given the fingerprint of a new waveform, the best word match from the database can be retrieved. A 3255 word database is used as a test bed. All the words from this database are mixed with white noise and time-scale modified to provide test data. The database is queried with the fingerprint of the test words and the best match is retrieved. The results of the experiments conducted are promising, showing a 99.5% recognition rate for a 20 dB signal to noise ratio (SNR).
In this paper, we propose a new automated approach to extract the centerlines from 2-D angiography. The centerline extraction is the basis of 3-D reconstruction of the blood vessels, so the accurate localization of ce...
详细信息
An important class of radiometric degradations we are faced with often in practice is image blurring. Special attention is paid to the recognition of the blurred image by moment invariant approach. Some important rule...
详细信息
ISBN:
(纸本)0780385543
An important class of radiometric degradations we are faced with often in practice is image blurring. Special attention is paid to the recognition of the blurred image by moment invariant approach. Some important rules of complex moments for the blurred image are presented. Based on these rules, a useful subset of moment invariants is introduced, that are not affected by the blur, rotation, scale, and translation of the images. The experiments have shown that these invariants can be successfully used in recognition of the blurred image.
暂无评论