版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Laboratory for Artificial Perception Faculty of Electrical and Computer Engineering Tržaška 25 Ljubljana Slovenia
出 版 物:《PATTERN RECOGNITION LETTERS》 (模式识别快报)
年 卷 期:1992年第13卷第12期
页 面:879-891页
核心收录:
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
主 题:SPEECH RECOGNITION PHONE COMPONENTS RECOGNITION FEATURE EXTRACTION AND SELECTION CLASSIFICATION QUADRATIC DISCRIMINANT FUNCTIONS SUBSPACE METHOD HIDDEN MARKOV MODELS KOHONEN SELF-ORGANIZING MAP
摘 要:In this paper the comparison of performances of different feature representations of the speech signal and comparison of classification procedures for Slovene phoneme recognition are presented. Recognition results are obtained on the database of continuous Slovene speech consisting of short Slovene sentences spoken by female speakers. MEL-cepstrum and LPC-cepstrum features combined with the normalized frame loudness were found to be the most suitable feature representations for Slovene speech. It was found that determination of MEL-cepstrum using linear spacing of bandpass filters gave significantly better results for speaker dependent recognition. Comparison of classification procedures favours the Bayes classification assuming normal distribution of the feature vectors (BNF) to the classification based on quadratic discriminant functions (DF) for minimum mean-square error and subspace method (SM), which does not confirm the results obtained in some previous studies for German and Finn speech. Additionally, classification procedures based on hidden Markov models (HMM) and the Kohonen Self-Organizing Map (KSOM) were tested on a smaller amount of speech data (1 speaker only). Classification results are comparable with classification using BNF.