The purpose of this paper is to comment on the degradation features of poly-emitter n-p-n BJTs after hot carrier injection. First, a physical model which quantitatively explains the experimentally observed gain degrad...
详细信息
The purpose of this paper is to comment on the degradation features of poly-emitter n-p-n BJTs after hot carrier injection. First, a physical model which quantitatively explains the experimentally observed gain degradation is presented and its suitability for Computer Aided Reliability (CAR) applications is demonstrated. Besides, a "charge-pumping" based technique, allowing the accurate evaluation of the total fast electronic Si/SiO/sub 2/ interface states located at the perimeter of the emitter-base junction is also proposed.
This paper describes algorithms to convert spectrograms, cochleagrams and correlograms back into sounds. Each of these representations converts sound waves into pictures or movies. Techniques for inversion, known as t...
详细信息
This paper describes algorithms to convert spectrograms, cochleagrams and correlograms back into sounds. Each of these representations converts sound waves into pictures or movies. Techniques for inversion, known as the pattern playback problem, are important because they allow these representations to be used for analysis and transformations of sound. The algorithms described here use convex projections and intelligent phases guesses to iteratively find the closest waveform consistent with the known information. Reconstructions from the spectrogram and cochleagram are indistinguishable from the original sound. In informal listening tests, the correlogram reconstructions are nearly identical.
This paper presents a design of an extensible and very compact digital multiplier, made in full-custom. It is characterized by regularity and it is particularly adapted to be used as a part of high-speed digital proce...
详细信息
This paper presents a design of an extensible and very compact digital multiplier, made in full-custom. It is characterized by regularity and it is particularly adapted to be used as a part of high-speed digital processors. The design has as a particular feature the short time in which it was developed.
Vocoder is a key component of the digital cellular system. A DSP architecture which supports the direct and immediate addressing modes in one instruction cycle, combined with a RISC-style instruction set, turns out to...
详细信息
Vocoder is a key component of the digital cellular system. A DSP architecture which supports the direct and immediate addressing modes in one instruction cycle, combined with a RISC-style instruction set, turns out to be effective for vocoder implementation in terms of proper performance and lower power consumption requirements. By adopting a dual bank memory system and an efficient ALU bus scheme, two 16-bit operand access and ALU operation can be executed simultaneously. Improved implementation methods applicable to CELP type vocoder in which the pitch search is performed by the analysis-by-synthesis, are also presented.
Speaker modification is the ability to change the perceived speaker identity of a recorded utterance. Basic to this is the capability to alter the vowel segments of speech. Not only do these segments comprise the majo...
详细信息
Speaker modification is the ability to change the perceived speaker identity of a recorded utterance. Basic to this is the capability to alter the vowel segments of speech. Not only do these segments comprise the majority of the voiced portion of speech but they are dominated by clearly defined acoustic parameters-formant frequencies and pitch. A method of altering the formant frequencies of vowel segments using LPC analysis/synthesis was investigated. Pole location modification based on statistical references provided individual control over formant frequencies and bandwidths but, in some transformations, lead to artifacts in the reconstructed speech.
This paper presents a new and efficient LPC quantisation scheme called split matrix quantisation (SMQ). The proposed method can be viewed as an extension of the conventional split vector quantisation process. It opera...
详细信息
This paper presents a new and efficient LPC quantisation scheme called split matrix quantisation (SMQ). The proposed method can be viewed as an extension of the conventional split vector quantisation process. It operates over N consecutive LPC frames and effectively divides a p/spl times/N LSP matrix into K submatrices which are then vector quantised independently. SMQ exploits the interframe redundancy that exists between consecutive sets of LSP coefficients and achieves "transparent" quantisation at 900 bits/sec. "High quality" LSP quantisation can be easily obtained at 750 bits/sec. These bit rates are based in a 20 msec LPC analysis frame size. Furthermore, SMQ is characterised by relatively low complexity and low storage requirements.
The cepstrum coefficients have been widely used for speech signal representation and play a very important role in recognition accuracies. We present a low cost architecture for VLSI implementation of LPC-based cepstr...
详细信息
The cepstrum coefficients have been widely used for speech signal representation and play a very important role in recognition accuracies. We present a low cost architecture for VLSI implementation of LPC-based cepstrum algorithm. The circuit performs the cepstrum operation for each frame of the speech data. A pipelining architecture leads to high speed performance up to speech recognition rate. The cepstrum chip is fabricated in 1.2 /spl mu/m double-metal CMOS technology after the physical design and circuit verification. On the whole, the chip can process 18.3 MHz sampled data and it contains about 24000 transistors which occupy 227.5/spl times/213.3 mils/sup 2/ area. It has been shown to be fully functional and is the first working cepstrum chip.
This paper describes a Korean continuous speech recognition system using phone based semi-continuous hidden Markov model (SCHMM) method for automatic interpretation. The task domain is hotel reservation. The system (c...
详细信息
ISBN:
(纸本)0780324315
This paper describes a Korean continuous speech recognition system using phone based semi-continuous hidden Markov model (SCHMM) method for automatic interpretation. The task domain is hotel reservation. The system (composed of speech recognition, machine translation and speech synthesis) has the following three features. First, an embedded bootstrapping training method is used that enables us to train each phone model without the need for a phoneme segmentation database. Second, a hybrid estimation method which is composed of the forward-backward algorithm and the Viterbi algorithm is proposed for the HMM parameter estimation. Third, a between-word modeling technique is used at the function word boundaries. The recognition results in speaker independent experiments are as follows. In the case of Version 1, the continuous speech recognition result is 89.1% and in Version 2, the result is 97.6%.
A new audio-coding method is proposed. This method is called transform-domain weighted interleave vector quantization (TwinVQ) and achieves high-quality reproduction at less than 64 kbit/s. The method is a transform c...
详细信息
A new audio-coding method is proposed. This method is called transform-domain weighted interleave vector quantization (TwinVQ) and achieves high-quality reproduction at less than 64 kbit/s. The method is a transform coding using modified discrete cosine transform (MDCT). There are three novel techniques in this method: flattening of the MDCT coefficients by the spectrum of linear predictive coding (LPC) coefficients; interframe backward prediction for flattening the MDCT coefficients; and weighted interleave vector quantization. Subjective evaluation tests showed that the quality of the reproduction of TwinVQ exceeded that of an MPEG Layer II coder at the same bitrate.
暂无评论