The design of the half-rate coder for the North-American cellular communication standard poses a challenging problem. The desired speech quality is that of the full-rate coder, but the total bit rate specified for the...
详细信息
The design of the half-rate coder for the North-American cellular communication standard poses a challenging problem. The desired speech quality is that of the full-rate coder, but the total bit rate specified for the half-rate is only 6.5 kb/s. Since the mobile communication channel is very noisy, the use of error-correcting codes is necessary, which leaves only about 4 kb/s for the actual speech information. This very restricted bit budget requires that fewer than 30 b be allotted to the quantization of the LSF parameters, which precludes the use of scalar quantization. In this work, a quantizer design procedure which is tailored for the transmission frame structure of the North-American (TIA) cellular communication standard is presented. It is basically a split vector quantizer which makes use of interframe prediction to lower the number of bits required for quantization. Since the prediction is performed only on every other frame, the error propagation is limited. The quantization technique yields good performance in the 26-27 b/frame range, and the complexity of its implementation is less than 0.5 Mip.
In this paper, a new subband-based classification scheme is developed for classifying underwater mines and mine-like targets From the acoustic backscattered signals. The system consists of a feature extractor using wa...
详细信息
In this paper, a new subband-based classification scheme is developed for classifying underwater mines and mine-like targets From the acoustic backscattered signals. The system consists of a feature extractor using wavelet packets in conjunction with linear predictive coding (LPC), a feature selection scheme, and a backpropagation neural-network classifier, The data set used for this study consists of the backscattered signals from six different objects: two mine-like targets and four nontargets for several aspect angles. Simulation results on ten different noisy realizations and for signal-to-noise ratio (SNR) of 12 dB are presented. The receiver operating characteristic (ROC) curve of the classifier generated based on these results demonstrated excellent classification performance of the system, The generalization ability of the trained network was demonstrated by computing the error and classification rate statistics on a large data set, A multiaspect fusion scheme was also adopted in order to further improve the classification performance.
The range of user requirements on multicast protocols is so wide that no single protocol will ever satisfy them. The set of multicast protocols can be classified using the user requirements, and the architectures, mec...
详细信息
The range of user requirements on multicast protocols is so wide that no single protocol will ever satisfy them. The set of multicast protocols can be classified using the user requirements, and the architectures, mechanisms, communications patterns, and policies used to satisfy these requirements. We provide such a classification, and illustrate it with several example protocols chosen to cover the range of features described.
A technique is presented for quantizing the articulatory space, i.e. for replacing the continuum of all possible vocal tract shapes by a finite set of shapes that span the articulatory space. Vocal tract shapes are re...
详细信息
A technique is presented for quantizing the articulatory space, i.e. for replacing the continuum of all possible vocal tract shapes by a finite set of shapes that span the articulatory space. Vocal tract shapes are represented as vectors of parameters that control an articulatory model, so vector quantization are applicable for deriving this codebook of tract shapes. Samples of the articulatory space are generated by interpolating model parameters along paths between selected target configurations. These shapes constitute the training data that are clustered to form the codebook. Such a codebook of shapes has direct application when vocal tract shapes must be estimated from speech. One such application, which motivated the present study, is the use of the codebook to providing starting configurations for an optimizing articulatory speech synthesizer.< >
In this paper we propose a very simple and efficient weighting filter with which the computational complexity of CELP coders can be considerably reduced. Other algorithms using a weighting filter could also benefit fr...
详细信息
In this paper we propose a very simple and efficient weighting filter with which the computational complexity of CELP coders can be considerably reduced. Other algorithms using a weighting filter could also benefit from the advantages of this simplified weighting filter. The estimation of the long term prediction with the close-loop method is described. A binary codebook is used for the excitation vectors. It is shown how the excitation sequence can be obtained by a non-exhaustive method in two steps with a simplified algorithm and the simple weighting filter. Several coders have been implemented showing that the perceptual quality of the simplified algorithm is equivalent to that of the original CELP.
To promote energy efficiency during *** furnace smelting, four smelting states were introduced in the smelting stage: an unmelted state, semi-molten state, molten state, and overheating state. A smelting identificatio...
详细信息
To promote energy efficiency during *** furnace smelting, four smelting states were introduced in the smelting stage: an unmelted state, semi-molten state, molten state, and overheating state. A smelting identification system to distinguish these smelting states was developed through the use of linear predictive coding and a principal component analysis algorithm. A new smelting condition identification system was obtained. Corresponding pilot productions were conducted to compare the differences between employing the method and not employing the method. All of the pilot production data showed that feeding raw materials over time during the overheating state and decreasing current injection in the molten state could reduce energy consumption as well as increase crystal purity. (C) 2016 Published by Elsevier B.V.
We address the classical bearings-only tracking problem (BOT) for a single object, which belongs to the general class of nonlinear filtering problems. Recently, algorithms based on sequential Monte-Carlo methods (part...
详细信息
We address the classical bearings-only tracking problem (BOT) for a single object, which belongs to the general class of nonlinear filtering problems. Recently, algorithms based on sequential Monte-Carlo methods (particle filtering) have been proposed. As far as performance analysis is concerned, the posterior Cramar-Rao bound (PCRB) provides a lower bound on the mean square error. Classically, under a technical assumption named "asymptotic unbiasedness assumption," the PCRB is given by the inverse Fisher information matrix (FIM). The latter is computed using Tichavsky's recursive formula via Monte-Carlo methods. Two major problems are studied here. First, we show that the asymptotic unbiasedness assumption can be replaced by an assumption which is more meaningful. Second, an exact algorithm to compute the PCRB is derived via Tichavsky's recursive formula without using Monte-Carlo methods. This result is based on a new coordinate system named logarithmic polar coordinate (LPC) system. Simulation results illustrate that PCRB can now be computed accurately and quickly, making it suitable for sensor management applications.
The development of a 2400-b/s speech digitizer which provides an acceptable level of intelligibility and quality over land mobile satellite channels is described. Performance tests over simulated channels in the UHF b...
详细信息
The development of a 2400-b/s speech digitizer which provides an acceptable level of intelligibility and quality over land mobile satellite channels is described. Performance tests over simulated channels in the UHF band (800 MHz) are presented. The voice digitizer is a linear prediction (LPC) vocoder which uses a channel error correction and concealment procedure tailored to error statistics for a minimum-shift keyed (MSK) downlink to a moving vehicle. The error-handling technique is based on perceptual criteria and utilizes the parametric nature of LPC representation of speech. A single-error-correcting, single-burst-detecting (28, 20) fire code is shown to be the best choice for the application. The intelligibility of the vocoder is measured and compared to the standard LPC-10 algorithm. The major remaining sources of speech quality degradation due to channel errors are determined and ranked.< >
Heart sound analysis plays an important role in the auscultative diagnosis process to detect the presence of cardiovascular diseases. In this paper we propose a novel parametric heart sound model that accurately repre...
详细信息
Heart sound analysis plays an important role in the auscultative diagnosis process to detect the presence of cardiovascular diseases. In this paper we propose a novel parametric heart sound model that accurately represents normal and pathological cardiac audio signals, also known as phonocardiograms (PCG). The proposed model considers that the PCG signal is formed by the sum of two parts: one of them is deterministic and the other one is stochastic. The first part contains most of the acoustic energy. This part is modeled by the Matching Pursuit (MP) algorithm, which performs an analysis-synthesis procedure to represent the PCG signal as a linear combination of elementary waveforms. The second part, also called residual, is obtained after subtracting the deterministic signal from the original heart sound recording and can be accurately represented as an autoregressive process using the linear predictive coding (LPC) technique. We evaluate the proposed heart sound model by performing subjective and objective tests using signals corresponding to different pathological cardiac sounds. The results of the objective evaluation show an average Percentage of Root-Mean-Square Difference of approximately 5% between the original heart sound and the reconstructed signal. For the subjective test we conducted a formal methodology for perceptual evaluation of audio quality with the assistance of medical experts. Statistical results of the subjective evaluation show that our model provides a highly accurate approximation of real heart sound signals. We are not aware of any previous heart sound model rigorously evaluated as our proposal.
This paper brings light on the digital signal processing (DSP) roots of a modern concept, voice over IP (VoIP). An example is also provided in which developments in DSP - speech coding, in particular - had a profound ...
详细信息
This paper brings light on the digital signal processing (DSP) roots of a modern concept, voice over IP (VoIP). An example is also provided in which developments in DSP - speech coding, in particular - had a profound impact on the early development of the ARPANET, the ancestor of the Internet. The author shows how packet speech, recently rediscovered and made popular as VoIP, was first successfully demonstrated in 1974 on the ARPANET and how the Internet protocol (IP) emerged largely as a result of that effort.
暂无评论