A fast convergence speech enhancement method is proposed in this paper. The noise estimation acceleration technique is applied to the conventional statistical model based algorithm to shorten the convergence time afte...
详细信息
A fast convergence speech enhancement method is proposed in this paper. The noise estimation acceleration technique is applied to the conventional statistical model based algorithm to shorten the convergence time after the sudden change of noise intensity. First, the burst detection of power spectrum is performed on the noisy spectrum. Next, the loglikelihood ratio (LLR) based VAD is used in the period when the noise power is stationary, and the spectral entropy based VAD is implemented in the hang-over frames after the burst of noisy spectrum. Then a flag is set to control the update of noise estimation. Finally, an attenuated version of the noisy spectrum will be used directly as noise estimation if the update flag is set to one. The performance of the proposed method is evaluated under ITU-T G.160. In comparison with the conventional method, the convergence time is reduced evidently, while the abilities of noise reduction and SNR improvement are preserved, and the impact on the objective speech quality is constrained to a low level.
A novel characteristic waveform (CW) decomposition method based on bi-orthogonal lifting wavelet transform (BLWT) is proposed for CWI speech codec in this paper. Firstly, the complicated CW alignment operations are no...
详细信息
A novel characteristic waveform (CW) decomposition method based on bi-orthogonal lifting wavelet transform (BLWT) is proposed for CWI speech codec in this paper. Firstly, the complicated CW alignment operations are not necessary by using this method, and as a result the computational complexity can be greatly reduced. Secondly, the additional delay of CW decomposition based on traditional wavelet transform is cancelled by using boundary treatment. Thirdly, by omitting filter operations and adopting the situ calculation technique, the memory consumption of the proposed algorithm will be much smaller than the traditional one. In addition, the multi-resolution analysis capability of wavelet transform is preserved, which will make the parameter quantization more flexible and the scalable coding much easier to be realized. The performance evaluation shows that the proposed decomposition algorithm could fulfill the requirements of high quality, multi-rate speech compression, and is suitable for the real-time communication systems. The results of MOS and A/B listening test show that the performance of 1.84kb/s CWI coder based on this CW decomposition is very close to that of 2.4kbit/s MELP coder.
Conventional pulse compression use a periodical echo of single receive antenna, which is modulated by a certain carrier-frequency, in other words, single spectrum is exploited. But for MIMO radar, as the multi-carrier...
详细信息
Conventional pulse compression use a periodical echo of single receive antenna, which is modulated by a certain carrier-frequency, in other words, single spectrum is exploited. But for MIMO radar, as the multi-carrier-frequency signals are transmitted simultaneously, if the spectrum of the target echo after channel separation can be combined to form the whole band spectrum echo, the corresponding range resolution can improve several times as compared with the conventional method, and it will be more convenient for follow-up detection and tracking. Considering the difference between the frequency modulation band and the interval between the adjacent frequencies, the spectrum joint after channel separation will be overlapped or spaced. The methods of spectrum moving of each echo and the spectrum extrapolation with Root-MUSIC algorithm are proposed, by which high-resolution range profile of the target is obtained. Simulation results verify the validity of these methods.
This paper proposes a multi-layer super -widebandembedded speech and audio coding algorithm extending bit rates from 36 to 64 kb/s on the basis of ITU-T Recommendation G.729.1 with a multi-stage coding structure. This...
详细信息
This paper proposes a multi-layer super -widebandembedded speech and audio coding algorithm extending bit rates from 36 to 64 kb/s on the basis of ITU-T Recommendation G.729.1 with a multi-stage coding structure. This codec consists of three embedded stages: G.729.1 wideband coding operating in the range from 8 to 32 kb/s, modified Modulated LappedTransform (MLT) coding of the band (7-14 kHz) at 36, 40 & 48 kb/s and MDCT transform coding for wideband residual signal at 56 and 64 kb/s. In addition, some methods are proposed in transform coding according to perception significance. The objective and subjective listening tests show that this codec has good performance compared with reference codec.
A new height measuring scheme for VHF radar is proposed to reduce the computational complexity due to multi-dimensional search in direction of arrival (DOA) estimation using the conventional maximum likelihood (ML) al...
详细信息
In this paper, by using the cyclostationary properties of speech signal, a voice activity detection (VAD) algorithm based on cyclic cumulant is proposed. The proposed scheme employs the thirdorder cyclic cumulant of t...
详细信息
This paper proposes an efficient method for frame erasure concealment in G.722.1 coding algorithm,which can mitigate the adverse impact of frame erasure on the reconstruction *** lost frame is reconstructed in the mod...
详细信息
This paper proposes an efficient method for frame erasure concealment in G.722.1 coding algorithm,which can mitigate the adverse impact of frame erasure on the reconstruction *** lost frame is reconstructed in the modulated lapped transform (MLT) domain in terms of magnitude and sign of the *** method of interpolation is employed for the magnitude *** sign information is recovered by extra information from *** test results indicate that improved quality is produced by the proposed algorithm.
This paper describes an embedded speech and audio codec which is based on ITU-T Recommendation G.722.1;it can process 7 kHz bandwidth speech and audio signal at scalable bit *** on the G.722.1 of ITU-T,this algorithm ...
详细信息
This paper describes an embedded speech and audio codec which is based on ITU-T Recommendation G.722.1;it can process 7 kHz bandwidth speech and audio signal at scalable bit *** on the G.722.1 of ITU-T,this algorithm adds two modules: the energy ordering of sub-band and the processing of bit-stream ***,it does some modification on the categorization and noise-fill *** makes sure that the codec could produce embedded bit-stream,so this codec had more robustness in the *** test results by ITUT PESQ show that this codec has good performance as G.722.1 at the same bit-rates.
A new 1kb/s waveform interpolation (WI) speech coding algorithm based on non-negative matrix factorization (NMF) is proposed and implemented in this ***-frame parameter joint vector or matrix quantization,parameter pr...
详细信息
A new 1kb/s waveform interpolation (WI) speech coding algorithm based on non-negative matrix factorization (NMF) is proposed and implemented in this ***-frame parameter joint vector or matrix quantization,parameter prediction and discrete cosine transform (DCT) are used to reduce the bit rate and to improve the quality of *** results of informal subjective listening test shows that the intelligibility and articulation of the algorithm are close to that of 2kb/s NMF-based WI coder (called “NMF-WI” as convenience).
Most of the current pitch detection algorithms can not work well under the high noise *** this reason,a pitch detection algorithm for noisy speech signal based on pre-filtering and weighted wavelet coefficients is ***...
详细信息
Most of the current pitch detection algorithms can not work well under the high noise *** this reason,a pitch detection algorithm for noisy speech signal based on pre-filtering and weighted wavelet coefficients is ***,the noisy speech signals are ***,the speech pre-filtered is decomposed by the quadratic spline ***,the wavelet coefficients of three consecutive scales are weighted to emphasize the sharp change ***,three candidate pitch periods are extracted from the weighted ***,the pitch period is calculated by autocorrelation *** show that this algorithm can increase the performance of pitch detection in noisy environment and decreases computational complexity compared with DWT-NCCF method.
暂无评论