A computer simulation of an FFT-based digital LFM pulse compressor using vector floating point arithmetic is presented, showing the effects of retaining various number of mantissa bits at the input quantizer. Plots of...
详细信息
A computer simulation of an FFT-based digital LFM pulse compressor using vector floating point arithmetic is presented, showing the effects of retaining various number of mantissa bits at the input quantizer. Plots of the compressed pulse waveforms and rms and peak sidelobe levels as a function of the number of mantissa bits are shown.
Recently, Kumaresan and Tufts (KT) presented a method for estimating the parameters of damped exponential waveforms in additive white noise. The KT method uses singular value decomposition (SVD) of the data matrix, wi...
详细信息
Recently, Kumaresan and Tufts (KT) presented a method for estimating the parameters of damped exponential waveforms in additive white noise. The KT method uses singular value decomposition (SVD) of the data matrix, with truncation and backward prediction to improve the accuracy of the estimates. The KT method was demonstrated to have a very good performance, in comparison to traditional methods used for the same problem (e.g., Prony's method). Kumaresan and Tufts also showed, by numerical simulations, that the variances of the estimates obtained by their method approaches the Cramer-Rao lower bounds for selected test cases. In this correspondence, we provide a quantitative accuracy analysis of the KT method. The analysis is based on first-order Taylor series approximations of the estimated parameters around their true values. No assumptions are made on the number of data points used, but it is assumed that the noise level is small enough for the first-order approximations to be valid. Results of the analysis are illustrated by some numerical examples. These results confirm the good performance of the KT method, and show the effect of the user-chosen parameters on the accuracy of the estimates.
The adaptive predictivecoding with transform domain quantization (APC-TQ) technique was proposed by Bhaskar (1991) for the compression of audio signals. Since then, significant developments have taken place leading t...
详细信息
The adaptive predictivecoding with transform domain quantization (APC-TQ) technique was proposed by Bhaskar (1991) for the compression of audio signals. Since then, significant developments have taken place leading to a reduction in the coding rate. While enhancing the audio quality. These developments include (i) the use of block size adaptation to exploit the variations in the stationarity of the signal, (ii) high resolution spectral modeling using LPC analysis orders up to 64, and (iii) an adaptive bit-allocation procedure to minimize coding noise power as well as minimize the perception of coding noise. The result is a near transparent quality compression of 5 kHz bandwidth audio at a rate of 17 kbit/s. This technology will find applications in the distribution and transmission of AM quality audio programming over low rate channels such as the INMARSAT Standard A, B and aeronautical systems.< >
This investigation presents a speaker adaptation scheme which transforms the prototype speaker's Hidden Markov word models to those of a new speaker. Transformations are applied to both the state transition matrix...
详细信息
This investigation presents a speaker adaptation scheme which transforms the prototype speaker's Hidden Markov word models to those of a new speaker. Transformations are applied to both the state transition matrix and the probability distribution functions of a Hidden Markov word model. These transformations are optimized through maximizing the joint probability of a set of input pronunciations of the new speaker. Details of these parameter transformations and experimental verification are presented. The test uses a 210-word vocabulary with each having a four-state Hidden Markov word model. The test speaker consists of three males and two females with one male heavily accented. By having the system retrained up to a four-minute adaptation speech, a subset of the 210-word vocabulary, the performance shows an improvement of recognition accuracy from 22.5% to 92.1%.
A microprocessor realization for a linearpredictive vocoder is presented. The goal was a low-power, low-cost, compact special-purpose realization of a narrow-band speech terminal. The resultant design is a general-pu...
详细信息
A microprocessor realization for a linearpredictive vocoder is presented. The goal was a low-power, low-cost, compact special-purpose realization of a narrow-band speech terminal. The resultant design is a general-purpose two-bus structure running at a 150 ns cycle time, using as the basic signal processing element, four of the AMD 2901 CPE chips. This basic structure is augmented by a four-cycle multiplier to allow for sufficient signal processing power. The design concessions that mark the linear predictive coding microprocessor (LPCM) as a special-purpose machine designed to be a speech terminal are: limited I/O and limited memory. The present design requires 162 dual-in-line packages, dissipates less than 45 W and occupies about \fraclinear predictive codinglinear predictive coding ft 3 .
This article reports the design and implementation of a graphical display that presents an approximation to vocal tract area in real time for voiced vowel articulation. The acoustic signal is digitally sampled by the ...
详细信息
This article reports the design and implementation of a graphical display that presents an approximation to vocal tract area in real time for voiced vowel articulation. The acoustic signal is digitally sampled by the system. From these data a set of reflection coefficients is derived using linear predictive coding. A matrix of area coefficients is then determined that approximates the vocal tract area of the user. From this information a graphical display is then generated. The complete cycle of analysis and display is repeated at approximately 20 times/s. Synchronised audio and visual sequences can be recorded and used as dynamic targets for articulatory development. Use of the system is illustrated by diagrams of system output for spoken cardinal vowels and for vowels sung in a trained and untrained style.
A low cost voice response system is presented, which performs text-to-speech conversion of any English text. The system is built around an LPC synthesizer chip and a microprocessor. Text-to-allophone rules are used to...
详细信息
A low cost voice response system is presented, which performs text-to-speech conversion of any English text. The system is built around an LPC synthesizer chip and a microprocessor. Text-to-allophone rules are used to convert an input string of ASCII characters into allophonic codes. LPC parameters are then drawn from an allophone library, which takes very little storage space, and concatenated using a fast and simple algorithm to produce natural sounding speech.
Two speech compression systems based on codebooks of inverse filters produced by off-line linear predictive coding (LPC) and vector quantization (VQ) techniques are considered. The first system is a pitch excited voco...
详细信息
Two speech compression systems based on codebooks of inverse filters produced by off-line linear predictive coding (LPC) and vector quantization (VQ) techniques are considered. The first system is a pitch excited vocoder that is a variation on a speech coding system based upon vector quantization. The encoder selects an LPC reverse filter from a finite codebook that best "matches" an observed frame of sampled speech. This filter is in turn used to determine the voicing and digitized pitch information. Unlike LPC systems, the digitization is performed in a single step on the data rather than separate modeling and digitization steps. The second system is a tree encoding system that uses the filter selected by an inverse filter matching vocoder to "color" a tree that is then searched for a minimum distortion path for the original sampled speech waveform. This system can be viewed as a hybrid between an adaptive predictive coder and a universal tree encoder. The two systems are described, simulated, and compared with other similar systems.
A method for the extraction of features for pattern recognition by system identification is presented. A test waveform is associated with a parameterized process model (PM) which is an inverse filter. The structure of...
详细信息
A method for the extraction of features for pattern recognition by system identification is presented. A test waveform is associated with a parameterized process model (PM) which is an inverse filter. The structure of the PM corresponds to the redundant information in a waveform, and the parameter values correspond to the discriminatory information. The PM used in this research is a linearpredictive system whose parameters are the linearpredictive coefficients (LPC's). This technique is applied to feature extraction of electrocardiograms (ECG's) for differential diagnosis. The LPC's are calculated for each ECG and used as a feature vector in a hypergeometric affine N-space spanned by the LPC's. The efficacy of this feature extraction technique is tested by three different perturbation methods, namely noise, matrix distortion, and a newly developed method called directed distortion. Both the Euclidean and Itakura distances between feature vectors in N-space are shown in increase with increasing perturbation of the template waveform. The monotonic behavior of a distance measure is a necessary attribute of a valid feature space. Thus the perturbation analyses done in this research verify the viability of using the parameters of a process model as a feature vector in a pattern recognition scheme.
A single chip speech synthesizer was designed using a switched-capacitor multiplier to implement the LPC algorithm. The chip contains the LPC-10 filter, 20 kbit ROM, all control logic, a three-pole switched-capacitor ...
详细信息
A single chip speech synthesizer was designed using a switched-capacitor multiplier to implement the LPC algorithm. The chip contains the LPC-10 filter, 20 kbit ROM, all control logic, a three-pole switched-capacitor low-pass filter, and an audio amplifier capable of driving a speaker directly. The chip was fabricated in 5 μm CMOS technology and is 218 mils on the side.
暂无评论