检索结果-内蒙古大学图书馆

Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes

COMPUTERS IN BIOLOGY AND MEDICINE 2009年第9期39卷 824-843页

作者： Bahoura, Mohammed Univ Quebec Dept Engn Rimouski PQ G5L 3A1 Canada

In this paper, we present the pattern recognition methods proposed to classify respiratory sounds into normal and wheeze classes. We evaluate and compare the feature extraction techniques based on Fourier transform, linear predictive coding, wavelet transform and Mel-frequency cepstral coefficients (MFCC) in combination with the classification methods based on vector quantization, Gaussian mixture models (GMM) and artificial neural networks, using receiver operating characteristic curves. We propose the use of an optimized threshold to discriminate the wheezing class from the normal one. Also, post-processing filter is employed to considerably improve the classification accuracy. Experimental results show that our approach based on MFCC coefficients combined to GMM is well adapted to classify respiratory sounds in normal and wheeze classes. McNemar's test demonstrated significant difference between results obtained by the presented classifiers (p < 0.05). (C) 2009 Elsevier Ltd. All rights reserved.

关键词： Respiratory sounds Wavelet transform Mel-frequency cepstral coefficients linear predictive coding Vector quantization Gaussian mixture models Multi-layer perceptron Receiver operating characteristic Statistical significance McNemar's test

来源：评论

学校读者我要写书评

暂无评论

Data transmission over GSM voice channel using digital modulation technique based on autoregressive modeling of speech production

引用

DIGITAL SIGNAL PROCESSING 2009年第4期19卷 612-627页

作者： Kotnik, Bojan Mezgec, Zdenko Svecko, Janja Chowdhury, Amor Margento BV NL-1043 BW Amsterdam Netherlands Res Ctr Maribor Ultra Doo Maribor 2000 Slovenia

This paper presents a novel digital data modulation and demodulation algorithm ARDMA based on the principles of autoregressive modeling (AR) of speech production. In the first step a sustained voiced speech signal characteristics are analyzed using autoregressive modeling principle and then the two sets of linear prediction (LPC) coefficients are obtained and converted to linear spectrum frequencies (LSF). The input binary data stream drives the selection mechanism of LSF coefficients which are then applied as filter coefficients of the modulation signal synthesis filter. This filter is excited with specially designed excitation signal which corresponds to the basic characteristics of typical excitation signal of human vocal tract. Finally, a speech-alike modulation signal is produced. This modulation signal is then sent through the voice channel of the GSM system. The demodulator analyzes the incoming modulation signal using autoregressive modeling. The most likely LSF vector which modulated the particular symbol was determined by the demodulation process and converted to the respective string of binary data. The performance of proposed modulation scheme was compared to the regular frequency shift keying method (FSK). The performance improvement of ARDMA against FSK is observed at higher bit-rates in the case of three compared GSM speech coders. (c) 2008 Elsevier Inc. All rights reserved.

关键词： Autoregressive modeling linear predictive coding Digital modulations Speech production model GSM Speech codecs Data transmission systems

来源：评论

学校读者我要写书评

暂无评论

APPLYING IMPROVED SPECTRAL MODELING FOR HIGH QUALITY VOICE CONVERSION

APPLYING IMPROVED SPECTRAL MODELING FOR HIGH QUALITY VOICE C...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Villavicencio, Fernando Roebel, Axel Rodet, Xavier Univ Pompeu Fabra Music Technol Grp Ocata 1 Barcelona 08003 Spain CNRS IRCAM STMS Anal Synthesis Team F-75004 Paris France

ISBN: (纸本)9781424423538

In this work, accurate spectral envelope estimation is applied to Voice Conversion in order to achieve High-Quality timbre conversion. True-Envelope based estimators allow model order selection leading to an adaptation of the spectral features to the characteristics of the speaker. Optimal residual signals can also be computed following a local adaptation of the model order in terms of the F-0. A new perceptual criteria is proposed to measure the impact of the spectral conversion error. The proposed envelope models show improved spectral conversion performance as well as increased converted-speech quality when compared to linear Prediction.

关键词： Speech synthesis speech analysis cepstral analysis spectral analysis linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Support Vector Machines and MLP for automatic classification of seismic signals at Stromboli volcano

Support Vector Machines and MLP for automatic classification...

引用

19th Italian Workshop of the Italian-Society-for-Neural-Network (SIREN) on Neural Nets (WIRN)

作者： Giacco, Ferdinando Esposito, Antonietta Maria Scarpetta, Silvia Giudicepietro, Flora Marinaro, Maria Univ Salerno Dept Phys I-84081 Baronissi SA Italy Osserv Vesuviano Ist Nazl Geofis & Vulcanol Naples Italy Ist Nazl Fis Nucl Salerno Italy INFM CNISM Salerno Italy Inst Adv Sci Studies Vietri Sul Mare Italy

ISBN: (纸本)9781607500728

We applied and compared two supervised pattern recognition techniques, namely the Multilayer Perceptron (MLP) and Support Vector Machine (SVM), to classify seismic signals recorded on Stromboli volcano. The available data are firstly preprocessed in order to obtain a compact representation of the raw seismic signals. We extract from data spectral and temporal information so that each input vector is made up of 71 components, containing both spectral and temporal information extracted from the early signal. We implemented two classification strategies to discriminate three different seismic events: landslide, explosion-quake, and volcanic microtremor signals. The first method is a two-layer MLP network, with a Cross-Entropy error function and logistic activation function for the output units. The second method is a Support Vector Machine, whose multi-class setting is accomplished through a 1vsAll architecture with gaussian kernel. The experiments show that although the MLP produces very good results, the SVM accuracy is always higher, both in term of best performance, 99.5%, and average performance, 98.8%, obtained with different sampling permutations of training and test sets.

关键词： Seismic signals discrimination linear predictive coding Neural Networks Support Vector Machine Multilayer Perceptron

来源：评论

学校读者我要写书评

暂无评论

Audio Watermark Detection Using Undetermined ICA

Audio Watermark Detection Using Undetermined ICA

引用

8th International Conference on Independent Component Analysis and Signal Separation

作者： Seok, Jongwon Malik, Hafiz Changwon Natl Univ Dept Informat & Commun Engn Chang Won Kyongnam South Korea Univ Michigan Dept Elect & Com Dearborn MI USA

ISBN: (纸本)9783642005985

This paper presents a blind watermark detection scheme for additive watermark embedding model. The proposed estimation-correlation-based watermark detector first estimates the embedded watermark by exploiting non-Gaussian of the real-world audio signal and the mutual independence between the host-signal and the embedded watermark and then a correlation-based detector is used to determine the presence or the absence of the watermark. For watermark estimation, blind Source separation (BSS) based on underdetermined independent component analysis (UICA) is used. Low watermark-to-signal ratio (WSR) is one to the limitations of blind detection for additive embedding model. The proposed detector uses two-stage processing to improve WSR at the blind detector;first stage removes the audio spectrum from the watermarked audio signal using linear predictive (LP) filtering and the second stage uses resulting residue from the LP filtering stage to estimate the embedded watermark using BSS based on UICA. Simulation results show that the proposed detector performs significantly better than existing estimation-correlation-based detection schemes.

关键词： Audio Watermark Detection Blind Source Separation Mean-Field Approaches Independent Component Analysis linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

AN ERROR ROBUST ULTRA LOW DELAY AUDIO CODER USING AN MA PREDICTION MODEL

AN ERROR ROBUST ULTRA LOW DELAY AUDIO CODER USING AN MA PRED...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Wabnik, Stefan Schuller, Gerald Kraemer, Ferenc Fraunhofer IDMT Inst Digital Media Technol Ehrenbergstr 31 D-98693 Ilmenau Germany Tech Univ Ilmenau Inst Media Technol D- 98693 Ilmenau Germany

ISBN: (纸本)9781424423538

This paper compares two prediction structures for predictive perceptual audio coding in the context of the Ultra Low Delay (ULD) coding scheme. One structure is based on the commonly used AR signal model, leading to an IIR predictor in the decoder. The other structure is based on an MA signal model, leading to an FIR predictor in the decoder. We find that the AR-based predictor has a slightly better performance in case of an undisturbed transmission channel, but the MA-based predictor has a much better performance in case of transmission errors. For a Bit Error Rate (BER) of 1.0e-5, the perceptual quality of the proposed MA model predictor achieves a mean Objective Difference Grade (ODG) of -0.66 ODG whereas the AR. model predictor only reaches -3.42 ODG.

关键词： Low Delay Audio coding linear predictive coding Moving average processes Autoregressive processes Robustness

来源：评论

学校读者我要写书评

暂无评论

Isolated Word Recognition using Low Dimensional Features and Kernel based Classification

Isolated Word Recognition using Low Dimensional Features and...

引用

International Conference on Advances in Recent Technologies in Communication and Computing

作者： Nehe, N. S. Holambe, R. S. Pravara Rural Engn Coll Dept Instrumentat & Control Engg Loni Maharashtra India SGGS Inst Engn & Technol Dept Instrumentat Engg Nanded Maharashtra India

ISBN: (纸本)9781424451043

This paper describes polynomial kernel subspace approach to Isolated Word Recognition (IWR) systems. linear predictive coding (LPC) coefficients derived from wavelet sub-bands of speech frame were used as features. This approach represents mapping of speech features (input space) into a feature space via a non-linear mapping onto the principal components called Kernel linear Discriminant Analysis (KLDA). The non-linear mapping between the input space and the feature space is implicitly performed using the kernel-trick. This nonlinear mapping using KLDA increases the discrimination ability of a pattern classifier. The use of Wavelet sub-band based LPC features (WLPC) provide low dimensional features which reduce the memory requirement and KLDA provides the fast classification and recognition. Experimental results obtained on isolated word database show that the proposed technique is computationally efficient and performs well with less training data.

关键词： linear predictive coding Wavelet Transform Kernel linear Discriminant Analysis

来源：评论

学校读者我要写书评

暂无评论

JOINT ESTIMATION OF SHORT-TERM AND LONG-TERM PREDICTORS IN SPEECH CODERS

JOINT ESTIMATION OF SHORT-TERM AND LONG-TERM PREDICTORS IN S...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Giacobello, Daniele Christensen, Mads Graesboll Dahl, Joachim Jensen, Soren Holdt Moonen, Marc Aalborg Univ Dept Elect Syst ES MISP Aalborg Denmark Katholieke Univ Leuven Dept Elect Engn ESAT SCD Leuven Belgium

ISBN: (纸本)9781424423538

In low bit-rate coders, the near-sample and far-sample redundancies of the speech signal are usually removed by a cascade of a short-term and a long-term linear predictor. These two predictors are usually found in a sequential and therefore suboptimal approach. In this paper we propose an analysis model that jointly finds the two predictors by adding a regularization term in the minimization process to impose sparsity constraints on a high order predictor. The result is a linear predictor that can be easily factorized into the short-term and long-term predictors. This estimation method is then incorporated into an Algebraic Code Excited linear Prediction scheme and shows to have a better performance than traditional cascade methods and other joint optimization methods, offering lower distortion and higher perceptual speech quality.

关键词： Speech analysis linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Robust Quantization of LPC Parameters for Speech Communication Over Noisy Channel

Robust Quantization of LPC Parameters for Speech Communicati...

引用

2nd International Conference on the Applications of Digital Information and Web Technologies

作者： Merouane, Bouzid USTHB Elect Fac Speech Commun & Signal Proc Lab Algiers 16111 Algeria

ISBN: (纸本)9781424444564

In this paper, an optimized trellis coded vector quantization (OTCVQ) system designed for efficient and robust coding of LSF spectral parameters is presented. The aim of this system, called at the beginning "LSF-OTCVQ Encoder", is to achieve a low bit rate transparent quantization of the FS1016 LSF parameters. Once the effectiveness of the LSF-OTCVQ encoder was proven in the case of ideal transmissions over noiseless channel, we were interested after in the improvement of its robustness for real transmissions over noisy channel. To protect implicitly the transmission indices of the LSF-OTCVQ encoder incorporated in the FS1016, we used a joint source-channel coding carried out by the channel optimized vector quantization.

关键词： Robustness Quantization linear predictive coding Oral communication Decision support systems Fiber reinforced plastics Virtual reality

来源：评论

学校读者我要写书评

暂无评论

THE ESTIMATION OF LINE SPECTRAL FREQUENCIES TRAJECTORIES BASED ON UNSCENTED KALMAN FILTERING

THE ESTIMATION OF LINE SPECTRAL FREQUENCIES TRAJECTORIES BAS...

引用

6th International Multi-Conference on Systems, Signals and Devices

作者： Boubakir, Chabane Berkani, Daoud Jijel Univ Dept Elect LAMEL Jijel Algeria Natl Polytechn Sch LSC Dept Elect Ei Harrach Algeria

ISBN: (纸本)9781424443451

In recent studies the Unscented Kalman Filter (UKF) was applied to some nonlinear systems. Several speech processing problems like the estimation of the formant trajectories, the state and parameter Kalman estimation for speech enhancement and the estimation of Line Spectral Frequency (LSF) trajectories. In this paper we apply the UKF to the estimation of LSF trajectories, in the case of synthetic and real noisy speech. The Expectation Maximization (EM) approach is used to iteratively estimate the LSF parameters. Furthermore, the Square-Root implementation of the UKF is used as it provides numeric stability and guarantees positive semi-definiteness of the state covariance.

关键词： Speech enhancement Kalman filtering Estimation linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：