检索结果-内蒙古大学图书馆

International Symposium on Chinese Spoken Language Processing

作者： Changchun Bao J. Lukaside C. Ritz Speech and Audio Signal Processing Lab Beijing University of Technology Beijing China Whisper Lab TITR University of Wollongong NSW Australia

ISBN: (纸本)0780386787

The paper presents a high quality harmonic excitation linear predictive (HE-LPC) speech coder operating at 2 kb/s based on a harmonic excitation model with two bands. The system incorporates novel features such as: combined pitch detection; residual harmonic matching voicing determination; extraction and interpolation of residual harmonic magnitudes. Subjective listening tests indicate that this coder has the same quality as that of the Federal Standard MELP (mixed excitation linear prediction) coder at 2.4 kb/s, whether the training database is from Chinese or English.

关键词： Speech coding Delay estimation Speech analysis Speech processing Interpolation linear predictive coding Frequency domain analysis Vocoders Quantization Performance analysis

来源：评论

学校读者我要写书评

暂无评论

On-line signature writer identification

引用

AEJ - Alexandria Engineering Journal 2007年第4期46卷 509-518页

作者： Emam, Ashraf M. Information Technology Dept. Institute of Graduate Studies and Research University of Alexandria Alexandria Egypt

This paper presents a set of descriptors for On-line signature writer identification. These descriptors are intended to be used in e-business and e-government to detect signature forgery where it is hard to identify the writer across the Internet. Some descriptors represent global signature features, while the rest are dynamic signature features derived from the pen's linear speed. The forms of forged and the genuine signatures used in this work look identical. Cepstral descriptors showed a higher rejection rate of (98%) for all forged signatures, and 100% acceptance of any genuine signature. linear predictive descriptors derived from de-noised signature data delivered significant results with a rejection rate of (95%). © Faculty of Engineering Alexandria University.

关键词： Biometric identification Cepstral descriptors linear predictive coding On-line signature verification On-line signature writer identification

来源：评论

学校读者我要写书评

暂无评论

Top Downloads in IEEE Xplore [Reader's Choice]

引用

IEEE Signal Processing Magazine 2007年第3期24卷 8-10页

来源：评论

学校读者我要写书评

暂无评论

Spectral Dynamics as a Source of Discontinuity in Concatenative Speech Synthesis

Spectral Dynamics as a Source of Discontinuity in Concatenat...

引用

International Conference on Digital Signal Processing (DSP)

作者： Barry Kirkpatrick Darragh O'Brien Ronan Scaife Andrew Errity RINCE Faculty of Engineering and Computing University of Dublin Dublin Ireland

The quality of concatenative speech synthesis depends on the cost function employed for unit selection. Effective cost functions for spectral continuity have proven difficult to define and standard measures do not accurately reflect human perception of spectral discontinuity in concatenated speech. Previous studies on spectral join costs have focused predominantly on static spectral measures extracted from the unit boundary. In this paper spectral dynamic behaviour is investigated as a source of discontinuity in concatenated speech. A number of measures representing spectral dynamics are tested for the task of detecting discontinuities. The spectral dynamic measures tested contain information correlating with human perception of discontinuities, suggesting that spectral dynamics are a source of discontinuity in concatenated speech. A strategy to effectively combine dynamic and static measures is proposed using principal component analysis (PCA).

关键词： Speech synthesis Cost function Testing Humans Concatenated codes Databases Principal component analysis Character generation linear predictive coding Probability density function

来源：评论

学校读者我要写书评

暂无评论

Vector Quantization-Block Constrained Trellis Coded Quantization of Speech Line Spectral Frequencies

Vector Quantization-Block Constrained Trellis Coded Quantiza...

引用

IEEE Workshop on Signal Processing Systems (SIPS)

作者： Jungeun Park Yanghee Won Sangwon Kang School of Elc. Engineering and Computer Science Hanyang University Ansan South Korea

In this paper, a vector quantization-block constrained trellis coded quantization (VQ-BCTCQ) is presented to quantize line spectrum frequency (LSF) parameters of the wideband speech codec. Both the predictive structure and safety-net concept are combined into VQ-BCTCQ to develop the predictive VQ-BCTCQ. The performance of this quantization is compared with that of the linear predictive coding (LPC) vector quantizer used in the AMR-WB codec, and reductions in spectral distortion (SD) and encoding complexity are demonstrated.

关键词： Frequency Encoding Speech coding linear predictive coding Vector quantization Wideband Rate-distortion Costs Algorithm design and analysis Convolutional codes

来源：评论

学校读者我要写书评

暂无评论

Enhanced Multichannel Audio Resynthesis Through Residual Processing and Features Alignment

Enhanced Multichannel Audio Resynthesis Through Residual Pro...

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Demetrios Cantzos Athanasios Mouchtaris Chris Kyriakakis Integrated Media Systems Center (IMSC) University of Southern California Los Angeles CA USA Institute of Computer Science (ICS-FORTH) Foundation for Research and Technology Hellas Crete Greece

Multichannel audio refers to a widespread technology that enables audio rendering through multiple channels. Audio reproduction with multiple channels has the advantage of recreating the acoustic scene with unprecedented fidelity and of immersing the listener in an acoustic environment that is virtually indistinguishable from reality. However, one of the greatest challenges of multichannel audio is its high storage and transmission requirements especially since accurate rendering through as many possible channels is the main purpose. Audio resynthesis addresses this issue by enabling us to recreate a set of channels at the receiver end by transmitting only one source channel. We propose a new, enhanced, approach on multichannel audio resynthesis which involves a novel residual processing technique and a features alignment method that significantly increase the resynthesis accuracy. Our results show that this latest method leads to higher audio quality and allows for the robust treatment of any type of multichannel signal set.

关键词： Audio recording Cepstral analysis linear predictive coding Layout Microphones Computer science Robustness Internet Feature extraction Parameter estimation

来源：评论

学校读者我要写书评

暂无评论

Computationally efficient optimum weighting function for vector quantization of LSF parameters

Computationally efficient optimum weighting function for vec...

引用

International Symposium on Signal Processing and Its Applications (ISSPA)

作者： Saikat Chatterjee T.V. Sreenivas Department of Electrical Communication Engineering Indian Institute of Science Bangalore India

We propose a new weighting function which is computationally simple and an approximation to the theoretically derived optimum weighting function shown in the literature. The proposed weighting function is perceptually motivated and provides improved vector quantization performance compared to several weighting functions proposed so far, for line spectrum frequency (LSF) parameter quantization of both clean and noisy speech data.

关键词： Vector quantization linear predictive coding Distortion measurement Frequency Speech Nonlinear filters Humans Ear Euclidean distance Degradation

来源：评论

学校读者我要写书评

暂无评论

A Preliminary Study on Vocal Tract System of Chinese Whispered Vowels

A Preliminary Study on Vocal Tract System of Chinese Whisper...

引用

第二届生物计算：理论及应用国际会议(The Second International Conference on Bio-Inspired Computing: Theories and Applications)

作者： Chenghui Gong Heming Zhao Jianxin Liu Gang Lu Institute of Physics Soochow University 215006 Suzhou China Institute of Electronics and Information Engineering Soochow University 215021 Suzhou China

This paper concentrates on the abstraction of parameters from vocal tract transfer function of Chinese whispered vowels. As there is no fundamental frequency in whispered speech, these parameters become more prominent in speech analysis and synthesis. It is proved that the proposed algorithm for formant estimation is effectual and the gain of vocal tract transfer function can be utilized for tune analysis. The comparison of these parameters between Chinese whispered vowels and voiced ones is the basis for whispering recognition and conversion. The ratios of formants excursion, bandwidths movement, gain and energy variation are calculated for scalar weight coefficients of voice personality transformation.

关键词： Transfer functions Lungs Frequency Speech recognition linear predictive coding Educational institutions Speech synthesis Speech analysis Bandwidth Poles and zeros

来源：评论

学校读者我要写书评

暂无评论

Chip Design of LPC-cepstrum for Speech Recognition

Chip Design of LPC-cepstrum for Speech Recognition

引用

International Conference on Computer and Information Science (ACIS)

作者： Gin-Der Wu Zhen-Wei Zhu Department of Electrical Engineering National Chi Nan University Puli Taiwan

This paper proposed an ASIC of LPC-cepstrum (LPCC) for speech recognition. The proposed ASIC of LPCC can reduce the calculation load of processor in the speech recognition system. In addition, the resource sharing method is adopted into our design in order to reduce the chip size. Hence, it does not give an emphasis on sophistication but on high- performance and low-cost solution. Finally, we did some experiments to compare with other DSP or ASIC design. We found that our proposed LPCC ASIC can efficiently reduce the computation load.

关键词： Chip scale packaging Speech recognition Application specific integrated circuits linear predictive coding Digital signal processing chips Autocorrelation Resource management Costs Hardware Computer architecture

来源：评论

学校读者我要写书评

暂无评论

A Kalman Smoothing Algorithm for Speech Enhancement Based on the Properties of Vocal Tract Varying Slowly

A Kalman Smoothing Algorithm for Speech Enhancement Based on...

引用

ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD)

作者： Hui Li Xin Wang Bei-qian Dai Wei Lu MOE-Microsoft Key Laboratory of Multimedia Computing and Communication University of Science and Technology China

The linear prediction coefficients obtained from noisy speech have an important impact on improving the quality of the enhanced speech in the speech enhancement algorithm based on the Kalman Smoother. According to the properties of the slow changes of the vocal tract, this paper proposes a novel Kalman smoothing algorithm for speech enhancement based on vocal tract parameters smoother. Firstly, the linear prediction coefficients are converted into the line spectrum frequency parameters. Then, these parameters of the adjacent frames are smoothed before they transform into state transition matrix. Experimental results indicate that the proposed Kalman smoothing algorithm for speech enhancement based on vocal tract parameters smoother can suppress the sudden changes of residual noise energy and improve the quality of enhanced speech. The quality of the enhanced speech is evaluated by means of segmental SNR and ITU-PESQ scores. Experimental results indicate that the proposed algorithm achieves obvious improvements compared with conventional Wiener filter.

关键词： Kalman filters Smoothing methods Speech enhancement Signal processing algorithms Noise shaping Wiener filter linear predictive coding Degradation Signal processing Additive noise

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：