检索结果-内蒙古大学图书馆

A fast LSF search algorithm based on interframe correlation in G.723.1

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING 2004年第8期2004卷 1107-1112页

作者： Kibey, SA Kulkarni, JP Sarode, PD Tata Elxsi Ltd Digital Signal Proc & Multimedia Grp Bangalore 560048 Karnataka India Indian Inst Sci Ctr Elect Design & Technol Bangalore 560012 Karnataka India Honeywell Technol Solut Labs Pvt Ltd Bangalore 560076 Karnataka India

We explain a time complexity reduction algorithm that improves the line spectral frequencies (LSF) search procedure on the unit circle for low bit rate speech codecs. The algorithm is based on strong interframe correlation exhibited by LSFs. The fixed point C code of ITU-T Recommendation G.723.1, which uses the "real root algorithm" was modified and the results were verified on ARM-7TDMI general purpose RISC processor. The algorithm works for all test vectors provided by International Telecommunications Union-Telecommunication (ITU-T) as well as real speech. The average time reduction in the search computation was found to be approximately 20%.

关键词： line spectral frequencies linear predictive coding unit circle interframe correlation G.7.23.1

来源：评论

学校读者我要写书评

暂无评论

Optimal multistage vector quantization of LPC parameters over noisy channels

引用

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 2004年第1期12卷 1-8页

作者： Krishnan, V Anderson, DV Truong, KK Georgia Inst Technol Sch Elect & Comp Engn Atlanta GA 30332 USA Polycom Inc Pleasanton CA 94588 USA

The direct use of vector quantization (VQ) to encode LPC parameters in a communication system suffers from the following two limitations: 1) complexity of implementation for large vector dimensions and codebook sizes and 2) sensitivity to errors in the received indices due to noise in the communication channel. In the past, these issues have been simultaneously addressed by designing channel matched multistage vector quantizers (CM-MSVQ). A sub-optimal sequential design procedure has been used to train the codebooks of the CM-MSVQ. In this paper, a novel channel-optimized multistage vector quantization (CO-MSVQ) codec is presented, in which the stage codebooks are jointly designed. The proposed codec uses a source and channel-dependent distortion measure to encode line spectral frequencies derived from segments of a speech signal. Extensive simulation results are provided to demonstrate the consistent reduction in both the mean and the variance of the spectral distortion obtained using the proposed codec relative to the conventional sequentially designed CM-MSVQ. Furthermore, the perceptual quality of the reconstructed speech using the proposed codec was found to be better than that obtained using the sequentially designed CM-MSVQ.

关键词： channel coding linear predictive coding speech coding vector quantization

来源：评论

学校读者我要写书评

暂无评论

A new Gabor filter based kernel for texture classification with SVM

引用

International Conference on Image Analysis and Recognition

作者： Sabri, M Fieguth, P Univ Waterloo Dept Syst Design Engn Waterloo ON N2L 3G1 Canada

ISBN: (纸本)3540232400

The performance of Support Vector Machines (SVMs) is highly dependent on the choice of a kernel function suited to the problem at hand. In particular, the kernel implicitly performs a feature selection which is the most important stage in any texture classification algorithm. In this work a new Gabor filter based kernel for texture classification with SVMs is proposed. The proposed kernel function is based on a Gabor filter decomposition and exploiting linear predictive coding (LPC) in each subband, and exploiting a filter selection method to choose the best filters. The proposed texture classification method is evaluated using several texture samples, and compared with recently published methods. The comprehensive evaluation of the proposed method shows significant improvement in classification error rate.

关键词： texture classification Support Vector Machine linear predictive coding Gabor filters segmentation

来源：评论

学校读者我要写书评

暂无评论

Text-to-speech synthesis integrated circuit

Text-to-speech synthesis integrated circuit

引用

IEEE Signal Processing and Communications Applications (SIU)

作者： I.F. Baskaya O. Aktan G. Dundar Elektrik ve Elektronik Mühendisligi Bolümü Boğazisi Üniversitesi Bebek Istanbul Turkey Elektrik ve Elektronik Mühendisliği Bölümü Boğaziçi Üniversitesi İstanbul

Geveze software is one of many implementations in text-to-speech synthesis for various languages. The program is based on vocal tract modeling and compresses speech by the LPC method. During synthesis, for each letter of a given word, the nearest combination of the letter sequences within the words used in training is searched and its parameters are used. As in other systems based on vocal tract modeling, a pulse train generates excitation for voiced sounds, while a noise signal is used for unvoiced sounds. The obtained signal is then amplified with a coefficient special to the sound at that instant and finally sent to an IIR filter, whose filter characteristics are determined by LPC coefficients, and the digitized waveform of the speech is obtained. During training, 10 LPC coefficients, 1 gain, and 1 period information bit are obtained for each 25 ms window, separated by 10 ms. During synthesis, these values change every 10 ms to the values of the following window. The digital signal at the output of the IIR filter is converted to analog, which has to be passed through a low pass filter (LPF) in order to smooth the transitions between windows. After filtering, the analog signal is ready to be amplified. Our objective is to design this system, already running on computer, as an integrated circuit and, if possible, to have a single chip solution with optimum cost and performance.

关键词： Integrated circuit synthesis Speech synthesis IIR filters linear predictive coding Signal synthesis Low pass filters Pulse amplifiers Pulse generation Signal generators Acoustic noise

来源：评论

学校读者我要写书评

暂无评论

A study of coordination within a road traffic environment

A study of coordination within a road traffic environment

引用

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT)

作者： S. El hadouaj A. Drogoul UPMC/CNRS UMR 7606 LIP6-OASIS/MIRIAD Paris France INRETS Arcueil France

Our objective consists in studying collaborative situations where an introduction of a new agent into the system increases the performance of the group. This work is a part of the road traffic simulation model ARCHISIM in which a model of the behavior of the drivers has already been developed and validated. Our idea is to re-use this structure to define a collaborative behavior. For this purpose, we add a coordination layer to the basic driver agent behavior. The use of a simple reactive coordination strategy called "situated coordination" allows the emergence of a coherent group of agents that coordinate their activities. Experiments earned to measure the performance of the group, each time a new agent is introduced, show satisfactory results. We also demonstrate that the agents succeeded in coordinating themselves even if the degree of the constraints imposed by the simulated environment is very high.

关键词： Roads Traffic control Collaborative work Psychology linear predictive coding Collaboration Time measurement Laboratories Autonomous agents Intelligent agent

来源：评论

学校读者我要写书评

暂无评论

Empirical mode decomposition of voiced speech signal

Empirical mode decomposition of voiced speech signal

引用

International Symposium on Communications Control and Signal Processing (ISCCSP)

作者： A. Bouzid N. Ellouze Computer Science Department Superior Institute of Technological Studies of Sfax Tunisia

This paper describes a new technique, called the empirical mode decomposition (EMD) that has recently been pioneered by N. E. Huang and al., for adaptively representing nonstationary signals as sums of zero-mean AM-FM components [N. E. Huang, et al., 1998]. The components, called intrinsic mode functions (IMFs), allow the analysis of frequency composition of one-dimensional signals. Applied to speech signal, the EMD allows us to study the different intrinsic oscillatory modes. Besides, computing the LPC analysis of each mode provides an estimation of formants. The presented method is firstly applied on a sum of pure frequency signals. Among different modes we can detect all frequencies taking a part of a signal.

关键词： Signal analysis linear predictive coding Speech analysis Iterative algorithms Signal processing Computer science Educational institutions Time frequency analysis Shape Wavelet analysis

来源：评论

学校读者我要写书评

暂无评论

Fault diagnosis of the machines in power plants using LPC

Fault diagnosis of the machines in power plants using LPC

引用

International Forum on Strategic Technology, IFOST

作者： Ui-Pil Chong Sung-Sang Lee Chang-Ho Sohn School of Computer Engineering University of Ulsan South Korea

Fault diagnosis and monitoring of the machines operation in the power plants play an important role in safety operation and maintenance of those operating machines. In this paper we propose the fault diagnosis algorithm using the LPC coefficients with sound acquisition system from the operating machines through the single LPC spectrum is possible.

关键词： Fault diagnosis Power generation linear predictive coding Condition monitoring Power system reliability Equations Diffusion processes Safety Fault detection Random processes

来源：评论

学校读者我要写书评

暂无评论

Spoken digits recognition by subspace decomposition method

Spoken digits recognition by subspace decomposition method

引用

IEEE Region 10 International Conference TENCON

作者： K. Kusakari K. Kurihara T. Murakami Y. Ishida Department of Electronics and Communication Meiji University Kawasaki Kanagawa Japan

In this paper, we propose a method for spoken digits recognition using dynamic programming (DP) matching combined with subspace decomposition, which linearly separates phonetic information from speech data based on principal component analysis (PCA). This method is capable of more robust speech recognition of less reference speech patterns. The use of the spectral envelope by linear predictive coding (LPC) in speech recognition is unable to avoid errors in recognition due to the uncertainty of personalities, the dynamic variation of features, and so on. By using the subspace method, the proposed method eliminates these problems and enables good recognition results of less standard speech patterns. We use DP matching in recognizing, because it is capable of more efficient pattern matching by normalizing the length of vowels. Simulation results show that the proposed method, using projection onto phonetic subspace with less speaker information, is superior to the conventional method using spectral envelopes, which is obtained by LPC, and DP matching. Projection onto phonetic subspace is a kind of feature vector that contains less speaker information.

关键词： Speech recognition Pattern matching Dynamic programming Robustness linear predictive coding Pattern recognition Principal component analysis Speech analysis Uncertainty Euclidean distance

来源：评论

学校读者我要写书评

暂无评论

/sup 6/Li doped glass scintillators for ultra cold neutrons detection

/sup 6/Li doped glass scintillators for ultra cold neutrons ...

引用

IEEE Symposium on Nuclear Science (NSS/MIC)

作者： G. Ban X. Flechard M. Labalme T. Lefort E. Lienard O. Naviliat P. Fierlinger K. Kirch P. Geltenbort K. Bodek LPC Caen Caen France Paul Scherrer Institut Switzerland Institut Laue-Langevin Grenoble France Jagiellonian University Cracow Poland

We report the results of test measurements aimed at determining the performances of /sup 6/Li doped glass scintillators for the detection of ultra-cold neutrons. Three types of scintillators, GS1, GS10 and GS20, which differ by their /sup 6/Li concentrations, have been tested. The signal to background separation is fully acceptable. The relative detection efficiencies have been determined as a function of the neutron velocity. We find that GS10 has a higher efficiency than the others for the detection of neutrons with velocities below 6 m/s (i.e. energies smaller than 250 neV). Two pieces of scintillators have been irradiated with a high flux of cold neutrons to test the radiation hardness of the glasses. No reduction in the pulse height has been observed up to an absorbed dose of 10/sup 13/ n/cm/sup 3/.

关键词： Glass Neutrons Testing Detectors Performance evaluation linear predictive coding Robustness Physics Absorption Helium

来源：评论

学校读者我要写书评

暂无评论

Performance comparison of source controlled GSM AMR and SMV vocoders

Performance comparison of source controlled GSM AMR and SMV ...

引用

IEEE International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

作者： J. Makinen P. Ojala H. Toukomaa Multimedia Technologies Laboratory Nokia Research Center Tampere Finland

The adaptive multi-rate (AMR) speech codec offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. In GSM AMR, the trade-off between speech quality and average bit rate can be further improved by source signal based rate adaptation (SBRA). Together with fast power control, SBRA GSM AMR can be used as a variable rate codec, bringing reduced average bit rate and contributing to an increase in system capacity. SBRA GSM AMR was tested against a currently standardised SMV (selectable mode vocoder) variable rate speech codec. The paper also presents the general descriptions of both SBRA GSM AMR and SMV codecs.

关键词： GSM Vocoders Bit rate Speech codecs Benchmark testing linear predictive coding Laboratories Electronic mail Robustness Speech coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：