检索结果-内蒙古大学图书馆

A NEW ARTIFICIAL SPEECH SIGNAL FOR OBJECTIVE QUALITY EVALUATION OF SPEECH coding SYSTEMS

IEEE TRANSACTIONS ON COMMUNICATIONS 1994年第2-4期42卷 664-672页

作者： ITOH, K KITAWAKI, N IRII, H NAGABUCHI, H NIPPON TELEGRAPH & TEL PUBL CORP TELECOMMUN NETWORKS LABS TOKYO 190 JAPAN

This paper describes a new artificial speech signal (ASVQ: Artificial Speech by Vector Quantization technique) which reflects the average characteristics of the human voice. The ASVQ is intended for use as a test signal in the objective evaluation of speech coding system quality. To obtain the average characteristics, a very large speech data base is analyzed. The ASVQ generation method which reflects the extracted average characteristics of the human voice is formulated. This method applies vector quantizing analysis to the speech data base. The LPC speech synthesis circuit is used to reproduce the average characteristics. Finally, the new artificial speech signal is compared with a human voice and the estimation accuracy of the subjective quality of speech coding systems and nonlinear distortions is evaluated.

关键词： Speech analysis Human voice Speech coding Vector quantization System testing Data analysis Character generation Data mining linear predictive coding Speech synthesis

来源：评论

学校读者我要写书评

暂无评论

Algebraic vector quantization of LSF parameters with low storage and computational complexity

引用

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1996年第3期4卷 234-239页

作者： Xie, MJ Adoul, JP Department of Electrical and Computer Engineering University of Sherbrooke Sherbrooke Canada

This correspondence presents quantization scheme for encoding line spectral parameters used in linear predictive coding (LPC) of speech. The scheme is based on low-dimensionality regular-point lattices. The algebraic codebook need not be stored, and the optimum codevector is found through simple rounding of the input vector. Thus, the scheme results in significant savings of memory and reduced computational complexity when compared to traditional vector-quantizer solutions. The quantizer achieves an average spectral distortion of about 1 dB at 28 b/frame for the telephone bandwidth.

关键词： Vector quantization Computational complexity Hidden Markov models linear predictive coding Speech recognition Signal processing algorithms Frequency Speech coding Filters Parameter estimation

来源：评论

学校读者我要写书评

暂无评论

ENHANCEMENT OF ADPCM SPEECH coding WITH BACKWARD-ADAPTIVE ALGORITHMS FOR POSTFILTERING AND NOISE FEEDBACK

引用

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS 1988年第2期6卷 364-382页

作者： RAMAMOORTHY, V JAYANT, NS COX, RV SONDHI, MM AT and T Bell Laboratories Inc. Murray Hill NJ USA

It is shown that postfiltering circuits based on higher order LPC (linear predictive coding) models can provide very low distortion in terms of special tilt. Thus, they can provide better speech enhancement than circuits based on the backward-adaptive pole-zero predictor in ADPCM (adaptive digital pulse code modulation). Quantitative criteria for designing postfiltering circuits based on higher-order LPC models are discussed. These postfilters are particularly attractive for systems where high-order LPC analysis is an integral part of the coding algorithm. In a subjective test that used a computer-simulated version of these circuits, enhanced ADPCM obtained a mean opinion score of 3.6 at 16 kb/s.< >

关键词： Speech coding linear predictive coding Pulse modulation Circuit testing predictive models Speech enhancement Pulse circuits Digital modulation Modulation coding Algorithm design and analysis

来源：评论

学校读者我要写书评

暂无评论

A standard set of American-English voiced stop-consonant stimuli from morphed natural speech

引用

SPEECH COMMUNICATION 2011年第6期53卷 877-888页

作者： Stephens, Joseph D. W. Holt, Lori L. Carnegie Mellon Univ Dept Psychol Pittsburgh PA 15213 USA Carnegie Mellon Univ Ctr Neural Basis Cognit Pittsburgh PA 15213 USA

linear predictive coding (LPC) analysis was used to create morphed natural tokens of English voiced stop consonants ranging from /b/ to /d/ and /d/ to /g/ in four vowel contexts (/i/, /ae/, /a/, /u/). Both vowel consonant vowel (VCV) and consonant vowel (CV) stimuli were created. A total of 320 natural-sounding acoustic speech stimuli were created, comprising 16 stimulus series. A behavioral experiment demonstrated that the stimuli varied perceptually from /b/ to /d/ to /g/, and provided useful reference data for the ambiguity of each token. Acoustic analyses indicated that the stimuli compared favorably to standard characteristics of naturally-produced consonants, and that the LPC morphing procedure successfully modulated multiple acoustic parameters associated with place of articulation. The entire set of stimuli is freely available on the Internet (http://***/similar to lholt/php/***) for use in research applications. (C) 2011 Elsevier B.V. All rights reserved.

关键词： Speech stimuli Consonants linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

AN EFFICIENT TECHNIQUE TO IMPROVE NORA CMOS TESTING

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS 1987年第12期34卷 1609-1611页

作者： NAM, L BAYOUMI, MA Center of Advanced Computer Studies University of Southwestern Louisiana Lafayette LA USA

This paper presents a novel circuit technique to improve the testability of NORA (NO RAce) CMOS circuits. It is based on the structure, properties and operations of NORA CMOS. The precharge and evaluation properties of NORA CMOS enable one to design simple testing circuit for output stuck-at-zero, stuck-at-one, stuck-open and stuck-on faults. Area and time considerations, as well as the applications of this testability enhancement technique are also discussed.

关键词： Circuit testing Lattices linear predictive coding Finite impulse response filter System testing Clocks Equations CMOS technology Digital filters

来源：评论

学校读者我要写书评

暂无评论

New studies on adaptive predictive coding of images using multiplicative autoregressive models

New studies on adaptive predictive coding of images using mu...

引用

IEEE Region 10 International Conference TENCON

作者： M. Das N.K. Loh Department of Electrical and Systems Engineering Oakland University Rochester MI USA

Adaptive predictive coding of digitized images using multiplicative autoregressive (MAR) models is discussed. Three MAR models, designated as nonsymmetric half plane (NSHP) (3*3), quarter plane (QP) (2*3), and NSHP (2*3), are studied in detail. Results demonstrate that both NSHP (3*3) and QP (2*3) are very effective for coding and transmission of such images at bit rates less than one bit per pixel. Comparison with a 2-D model that has a quarter plane 2*2 region of support indicates that the performance of NSHP (3*3) and QP (2*3) either exceeds or matches that of the former. The proposed scheme has the following advantages. First, the signal-to noise ratio and the bit rate attainable with this method are comparable to those of two-dimensional (2-D) predictive techniques. Second, unlike the 2-D schemes, the stability of the predictive coder is easily guaranteed.< >

关键词： predictive coding predictive models Image coding Two dimensional displays Stability linear predictive coding Polynomials Image reconstruction Systems engineering and theory Signal to noise ratio

来源：评论

学校读者我要写书评

暂无评论

A SYSTEMATIC-APPROACH TO THE EXTRACTION OF DIPHTHONG ELEMENTS FROM NATURAL SPEECH

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1986年第2期34卷 264-271页

作者： KAESLIN, H Institute of Electronics Swiss Federal Institute of Technology Zurich Switzerland

Synthetic speech can be generated with an unrestricted vocabulary by concatenating stored units such as diphone elements. When joining speech segments that were not adjacent in the original context they were taken from, discontinuities in the spectral envelope may arise that impair intelligibility. The method proposed here attempts to find optimum diphone boundaries in order to minimize these discontinuities, Steady-state zones of all phones carrying a diphone boundary are specified by means of a centroid vector. Based on the centroids and on an objective distance measure, hypothetical boundary cost functions are defined. Their minimization together with the evaluation of a set of additional rules determines the boundary locations. A rhyme test carried out with speech generated by concatenating diphone elements extracted according to this method yielded an intelligibility score of 96.7 percent for isolated words.

关键词： Natural languages Steady-state Speech synthesis Vocabulary linear predictive coding Cost function Acoustic testing Speech processing Interpolation Stability

来源：评论

学校读者我要写书评

暂无评论

Validation of Computer Models for Evaluating the Efficacy of Cognitive Stimulation Therapy

引用

WIRELESS PERSONAL COMMUNICATIONS 2017年第3期94卷 301-314页

作者： Pham, Tuan D. Univ Aizu Res Ctr Adv Informat Sci & Technol Aizu Res Cluster Med Engn & Informat Aizu Wakamatsu Fukushima 9658580 Japan

The notion of using computational methods for evaluating cognitive stimulation therapy (CST) based on the synchronized recording of photoplethysmographic (PPG) signals of care-givers and participants offers an objective and cost-effective analysis in health care to improve the patient's quality of life. While computer models are promising as a useful tool for such a purpose, a question of interest is how the model reliability, which is the degree to which an assessment tool produces stable and consistent results, can be established. This paper addresses this issue with the application of dynamic-time warping and resampling to measure the performance of two PPG features known as the largest Lyapunov exponent and linear predictive coding, which have been applied for studying the efficacy of CST. The potential success of this computerized evaluation can be a precursor to the development of a personalized e-therapy system that operates on mobile devices.

关键词： Cognitive stimulation therapy Cognitive decline Model performance assessment Therapeutic communication Dynamic-time warping Photoplethysmograph Largest Lyapunov exponent linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Unified pulse-replacement search algorithms for algebra codebooks of speech coders

引用

IET SIGNAL PROCESSING 2010年第6期4卷 658-665页

作者： Chen, F-K Chen, G-M Su, B-K Tsai, Y-R So Taiwan Univ Dept Comp Sci & Informat Engn Tainan Taiwan

The algebraic code excited linear prediction (ACELP) algorithm, because of low complexity and high quality in its analysis-by-synthesis optimisation, has been adopted by many speech coding standards. This study proposes the unified generalised pulse replacement (UPR) search algorithm for the stochastic codebook of ACELP speech coders. The proposed UPR algorithm discusses the search breadth, the order of the search direction and the update frequency based on the pulse replacement method. In addition, there are many derivative types of UPR algorithms discussed. The proposed approaches can achieve the lowest computational complexity with imperceptible degradation of the speech quality. Furthermore, the normalised degradation ratio based on the standard subjective quality measurement is proposed to fairly compare the performance. The experimental results will verify the claims.

关键词： algebraic code excited linear prediction algorithm linear predictive coding speech coders vocoders Codecs, coders and decoders unified pulse-replacement search algorithms algebra codebooks stochastic codebook algebraic codes Codes

来源：评论

学校读者我要写书评

暂无评论

OBJECTIVE QUALITY EVALUATION FOR LOW-BIT-RATE SPEECH coding SYSTEMS

引用

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS 1988年第2期6卷 242-248页

作者： KITAWAKI, N NAGABUCHI, H ITOH, K NIPPON TELEGRAPH & TEL PUBL CORP HUMAN INTERFACE LABS TOKYO 100 JAPAN

An LPC (linear predictive coding) cepstrum distance measure (CD) is introduced as an objective measure for estimating the subjective quality of speech signals. Good correspondence between LPC CD and the subjective quality, expressed in terms of both opinion equivalent Q and mean opinion score, are shown. Good repeatability of objective quality evaluation using LPC CD is also shown. A method for generating an artificial voice signal that reflects the characteristics of real speech signals is described. The LPC CD values calculated using this artificial voice are almost the same as those calculated using real speech signals. The speaker-dependency of the coded-speech quality is shown to be an important factor in low-bit-rate speech coding. Even taking this factor into consideration, LPC CD is shown to be effective for estimating the subjective quality.< >

关键词： Speech coding Distortion measurement Frequency measurement linear predictive coding Nonlinear distortion Cepstrum Speech analysis Time measurement Speech codecs Character generation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：