作者:
Merouane, BouzidUSTHB
Elect Fac Speech Commun & Signal Proc Lab Algiers 16111 Algeria
In this paper, an optimized trellis coded vector quantization (OTCVQ) system designed for efficient and robust coding of LSF spectral parameters is presented. The aim of this system, called at the beginning "LSF-...
详细信息
ISBN:
(纸本)9781424444564
In this paper, an optimized trellis coded vector quantization (OTCVQ) system designed for efficient and robust coding of LSF spectral parameters is presented. The aim of this system, called at the beginning "LSF-OTCVQ Encoder", is to achieve a low bit rate transparent quantization of the FS1016 LSF parameters. Once the effectiveness of the LSF-OTCVQ encoder was proven in the case of ideal transmissions over noiseless channel, we were interested after in the improvement of its robustness for real transmissions over noisy channel. To protect implicitly the transmission indices of the LSF-OTCVQ encoder incorporated in the FS1016, we used a joint source-channel coding carried out by the channel optimized vector quantization.
<正>In September 1992,the recommendation G 728,which is a 16kbps LD-CELP speech coder submitted by AT&Twas standarized by *** the process of ratification test[1],the coder’s performances were equivalent to or bett...
详细信息
ISBN:
(纸本)0780329120
<正>In September 1992,the recommendation G 728,which is a 16kbps LD-CELP speech coder submitted by AT&Twas standarized by *** the process of ratification test[1],the coder’s performances were equivalent to or better than that of 32kbps ADPCM for all conditions *** paper,which is based on a G.728 encoding-decoding system simulated in software,studies and tests different parts of the algorithm,espacially that of the postfitter
For the purpose of low-cost or low power consumption, a cost-effective solution with good trade-off between quality and complexity must be estimated for small mobile machine. In this paper, we propose a condensed MP-M...
详细信息
ISBN:
(纸本)0780367200
For the purpose of low-cost or low power consumption, a cost-effective solution with good trade-off between quality and complexity must be estimated for small mobile machine. In this paper, we propose a condensed MP-MLQ algorithm to reduce search computation required for G723.1. Simulation results show that the proposed method can reduce about 50 similar to 82% computational load for stochastic codebook search with perceptually intangible degradation. This reductionism would be helpful to save the power consumption and relax CPU load for the mobile-machine in wireless communications, e.g. PALM, PDA.
The navigation autopilot system was developed using real-time voice command recognition system for the ship open water cruise. In this design, linear predictive coding (LPC) algorithm is used for the extraction of the...
详细信息
ISBN:
(纸本)9781467355636;9781467355629
The navigation autopilot system was developed using real-time voice command recognition system for the ship open water cruise. In this design, linear predictive coding (LPC) algorithm is used for the extraction of the voice command feature and Dynamic Time Warping (DTW) algorithm is used for attribute matching on MATLAB program.
This paper addresses the problem of improving the intelligibility of the synthesized speech in Tamil text-to-speech (TTS) synthesis system. The human speech is artificially generated by speech synthesis. The normal la...
详细信息
ISBN:
(纸本)9788132221265;9788132221258
This paper addresses the problem of improving the intelligibility of the synthesized speech in Tamil text-to-speech (TTS) synthesis system. The human speech is artificially generated by speech synthesis. The normal language text will be automatically converted into speech using TTS system. This paper deals with a corpus-driven Tamil TTS system based on the concatenative synthesis approach. Concatenative speech synthesis involves the concatenation of the basic units to synthesize an intelligent, natural sounding speech. In this paper, syllables are the basic unit of speech synthesis database and the modification of syllable pitch by timescale modification. The speech units are annotated with associated prosodic information about each unit, manually or automatically, based on an algorithm. An annotated speech corpus utilizes the clustering technique that provides way to select the suitable unit for concatenation, depending on the minimum total joint cost of the speech unit. The entered text file is analyzed first, this syllabication is performed based on the linguistics rules, and the syllables are stored separately. Then, the syllable corresponding speech file is concatenated and the silence present in the concatenated speech is removed. After that, discontinuities are minimized at syllable boundaries without degrading the quality. Smoothing at the concatenated syllable boundary is performed, changing the syllable pitches by timescale modification.
In this paper a new method is proposed which reduce the redundancy of data transmission in Wireless Sensor Network (WSN). In a dense environment of sensors, energy efficiency is key factor for prolonged battery life. ...
详细信息
ISBN:
(纸本)9781509034802
In this paper a new method is proposed which reduce the redundancy of data transmission in Wireless Sensor Network (WSN). In a dense environment of sensors, energy efficiency is key factor for prolonged battery life. Here in this paper temporal redundancy of sensor node is reduced using hybrid of two methods, one is compressive sensing using Discrete Cosine Transform (DCT) and other one is linear predictive coding (LPC). Given method is not suitable for real time data monitoring rather it is better for delay tolerant services as it require processing time before transmission. Here it is assumed that data monitored by sensors is piecewise linear. The relevancy in temporal data exploited here for data compression. First data monitored from each node for a long enough span of time which is further segmented according to their linearity. As compressive sensing compress the data and ARMA also use only few of the transmission bits, combination of both provide satisfactory compression of stored data. The segmented data used as input for compression and further processed to sink node. Result shows that this scheme provide satisfactory compression rate of monitored data.
This paper describes development of reliable gunshot detection system with emphasis on low power consumption for use in counter-poacher devices primarily protecting elephants in Africa. Intended system will work as a ...
详细信息
ISBN:
(纸本)9781479981175
This paper describes development of reliable gunshot detection system with emphasis on low power consumption for use in counter-poacher devices primarily protecting elephants in Africa. Intended system will work as a binary detector of gunfire without further classification of used fire-arm. Dominance of right gunshot detection over false alarms is crucial. Proposed recognition system is based on linearpredictive coefficients, correlation against template and comparison of spectral energy in sub-bands.
Accurate linear Prediction Coefficient (LPC) estimation is one of the key requirements for low bit-rate voice coding. Under harsh acoustic conditions, LPC estimation can become unreliable. This results in poor quality...
详细信息
ISBN:
(纸本)0780375491
Accurate linear Prediction Coefficient (LPC) estimation is one of the key requirements for low bit-rate voice coding. Under harsh acoustic conditions, LPC estimation can become unreliable. This results in poor quality of encoded speech and introduces annoying artifacts. This paper presents a two-branch speech enhancement preprocessing scheme for low bit-rate voice coders. This scheme consists of two parallel denoising blocks. One block will enhance the degraded speech for LPC estimation. Another block will increase the perceptual quality of the speech to be coded. The goal of this paper is to design the two-branch scheme. Test results show that the two-branch scheme can provide better perceptual quality compared to conventional one-branch speech enhancement techniques in noisy environments.
We propose a system that is capable of improving intelligibility of speech by those with hearing impairment. We have found the speech often had problems in the intonation, the duration of the unvoiced consonants, and ...
详细信息
ISBN:
(纸本)9781424425709
We propose a system that is capable of improving intelligibility of speech by those with hearing impairment. We have found the speech often had problems in the intonation, the duration of the unvoiced consonants, and the tone of the voiced phonemes. The system systematically compensates problematic components of the speech using the counterparts in normal speech. It corrects the intonation using TD-PSOLA and elongates consonants by repeating the original waveform using inverting technique for making the result continuously connected until the duration reaches a threshold. Experimental results show that the proposed method successfully improves the intelligibility of the speech from 28% to 35% by making phonemes more articulated, and making double consonants, V of syllabic consonant, and unvoiced consonants perceived clearer.
The first single chip speech synthesizer was introduced by Texas Instruments in June of 1978, and forms the basis for the very popular “Speak & Spell”* learning aid. The development of this unique integrated cir...
详细信息
暂无评论