Dynamic textures are image sequences which contain spatial-varying and time varying phenomenon. There are many examples of visual patterns in the real-world scenes that are called dynamic textures. In this paper, we p...
详细信息
Dynamic textures are image sequences which contain spatial-varying and time varying phenomenon. There are many examples of visual patterns in the real-world scenes that are called dynamic textures. In this paper, we propose a novel analytical approach for segmentation of dynamic textures. The key idea is to exploit the properties of both frame texture and motion. In our approach, we employ linear predictive coding to extract special features from sequences of mixtures of dynamic texture.
The use of the SIMD (single instruction stream-multiple data stream) mode of parallelism to perform linear predictive coding analysis is explored. Parallel algorithms for the autocorrelation formulation of linear pred...
详细信息
The use of the SIMD (single instruction stream-multiple data stream) mode of parallelism to perform linear predictive coding analysis is explored. Parallel algorithms for the autocorrelation formulation of linear prediction are presented and analyzed. The algorithms are evaluated in terms of the number of arithmetic operations and interprocessor data transfers required.
A hybrid approach in determining the excitation vector in a low-delay code excited linearpredictive coder is proposed. By a judicious division of the composite excitation vector into long-term and short-term componen...
详细信息
A hybrid approach in determining the excitation vector in a low-delay code excited linearpredictive coder is proposed. By a judicious division of the composite excitation vector into long-term and short-term components, and the use of switched quantisation, substantial improvement in coding quality is obtained.
An electrocardiogram (ECG) reconstruction method based on a linear prediction technique is proposed in this paper. The method can reconstruct a rather long missing parts of ECG signals. Each missing data segment may c...
详细信息
An electrocardiogram (ECG) reconstruction method based on a linear prediction technique is proposed in this paper. The method can reconstruct a rather long missing parts of ECG signals. Each missing data segment may cover 1 to 8 beats. The data used in the experiments are from the MIT-BIH normal sinus rhythm database. The experimental results show that our method can perform very well. The reconstructed signals are visually very close to the ground truths. The numerical evaluation also shows that the proposed method yields good results on the heart rate variability (HRV) measure derivation. It gives the time-domain HRV measures that are very close to the ground truths. Its performance is also better than the method commonly used by experts in which the abnormal beats are removed before calculating the HRV measures.
A NMOS vocoder IC, compatible with the LPC-10-2400-b/s speech coding standard, is described. The IC implements an adaptive linear predictive coding (LPC) spectral analyzer, a pitch decoder using Gold's algorithm, ...
详细信息
A NMOS vocoder IC, compatible with the LPC-10-2400-b/s speech coding standard, is described. The IC implements an adaptive linear predictive coding (LPC) spectral analyzer, a pitch decoder using Gold's algorithm, and a speech synthesizer. The algorithms, architecture, and circuit design methods have been clearly optimized to allow a single-chip implementation.
This paper is about the reduction of the computational complexity of a speech codec. A linear predictive coding procedure is developed to allow its implementation with number theoretic transforms. The use of fermat nu...
详细信息
This paper is about the reduction of the computational complexity of a speech codec. A linear predictive coding procedure is developed to allow its implementation with number theoretic transforms. The use of fermat number transform can reduce, in a significant way, the cost of linearpredictive algorithm implantation on digital signal processor.
Systems for high-quality low-bit-rate speech coding based on linear prediction, such as CELP, multipulse LPC, and self-excited vocoders (SEV) are among the best techniques for coding audio at medium-to-low bit rates (...
详细信息
Systems for high-quality low-bit-rate speech coding based on linear prediction, such as CELP, multipulse LPC, and self-excited vocoders (SEV) are among the best techniques for coding audio at medium-to-low bit rates (9.6 kb/s to 4.8 kb/s). For rates of 4.8 kb/s and below, vector quantization (or block coding methods) applied to the excitation signal has been shown to be very promising. An approach is presented that is similar to the CELP and SEV systems in that it uses an analysis-by-synthesis scheme but differs greatly in its method of representing the excitation information of the audio signal. In particular, the excitation system is based on a multistate model where the state transition parameters are determined by an external classifier. Experiments are reported that demonstrate the feasibility of the class structuring approach used and provide some basic information about the system's adaptation processes.< >
A near-optimum linear-predictive speech coding scheme is proposed. Near-optimum performance is defined in terms of minimizing a perceptually weighted mean-squared-error distortion measured directly between the origina...
详细信息
A near-optimum linear-predictive speech coding scheme is proposed. Near-optimum performance is defined in terms of minimizing a perceptually weighted mean-squared-error distortion measured directly between the original speech and the reconstructed speech. This is achieved by an efficient method which involves iteratively applying closed-loop analysis methods (or analysis-by-analysis methods) to the spectrum filter and the excitation signal. It is shown that this method can be applied to any linearpredictive coder (LPC) with any perceptual distortion measure and it is especially useful for low-rate speech coders (less than 4.8 kb/s) where analysis-by-synthesis techniques become essential.< >
The paper presents a high quality harmonic excitation linearpredictive (HE-LPC) speech coder operating at 2 kb/s based on a harmonic excitation model with two bands. The system incorporates novel features such as: co...
详细信息
ISBN:
(纸本)0780386787
The paper presents a high quality harmonic excitation linearpredictive (HE-LPC) speech coder operating at 2 kb/s based on a harmonic excitation model with two bands. The system incorporates novel features such as: combined pitch detection; residual harmonic matching voicing determination; extraction and interpolation of residual harmonic magnitudes. Subjective listening tests indicate that this coder has the same quality as that of the Federal Standard MELP (mixed excitation linear prediction) coder at 2.4 kb/s, whether the training database is from Chinese or English.
The authors report on the use of the codebook-excited linear-predictive (CELP) algorithm for 32 kb/s low-delay (LD-CELP) coding of wideband speech. The main problem associated with wideband coding, namely, spectral no...
详细信息
The authors report on the use of the codebook-excited linear-predictive (CELP) algorithm for 32 kb/s low-delay (LD-CELP) coding of wideband speech. The main problem associated with wideband coding, namely, spectral noise weighting, is discussed. The authors propose an enhanced noise weighting technique and demonstrate its efficiency via subjective listening tests. In these tests, involving 20 listeners and 8 test sentences, the average rating for the proposed 32 kb/s LD-CELP was essentially equal to that of the 65 kb/s standard (G.722) CCITT wideband coder.< >
暂无评论