This paper proposes to use non-negative matrix factorization based speech enhancement in robust automatic recognition of mixtures of speech and music. We represent magnitude spectra of noisy speech signals as the non-...
详细信息
ISBN:
(纸本)9781617821233
This paper proposes to use non-negative matrix factorization based speech enhancement in robust automatic recognition of mixtures of speech and music. We represent magnitude spectra of noisy speech signals as the non-negative weighted linear combination of speech and noise spectral basis vectors, that are obtained from training corpora of speech and music. We use overcomplete dictionaries consisting of random exemplars of the training data. The method is tested on the Wall Street Journal large vocabulary speech corpus which is artificially corrupted with polyphonic music from the RWC music database. Various music styles and speech-to-music ratios are evaluated. The proposed methods are shown to produce a consistent, significant improvement on the recognition performance in the comparison with the baseline method. Audio demonstrations of the enhanced signals are available at http://***.f/-tuomasv.
作者:
Carel, LénaAlquier, PierreCREST
ENSAE Université Paris Saclay 3 avenue Pierre Larousse Malakó Cedex92245 France TRANSDEV Group
32 boulevard Gallieni Issy-les-Moulineaux92130 France
We propose to use non-negative matrix factorization (NMF) to build a dictionary of travelers temporal profiles. Clustering based on decomposition in this dictionary rather than on the full profiles (as in previous wor...
详细信息
Discovering a discriminative feature representative together with a suitable distance measure is the key for a successful speaker recognition system. In this paper, we propose a new approach for automatic speaker veri...
详细信息
non-negative matrix factorization (NMF) has received considerable attentions in various areas for its psychological and physiological interpretation of naturally occurring data whose representation may be parts-based ...
详细信息
non-negative matrix factorization (NMF) has received considerable attentions in various areas for its psychological and physiological interpretation of naturally occurring data whose representation may be parts-based in the human brain. Despite its good practical performance, one shortcoming of original NMF is that it ignores intrinsic structure of data set. On one hand, samples might be on a manifold and thus one may hope that geometric information can be exploited to improve NMF's performance. On the other hand, features might correlate with each other, thus conventional L2 distance can not well measure the distance between samples. Although some works have been proposed to solve these problems, rare connects them together. In this paper, we propose a novel method that exploits knowledge in both data manifold and features correlation. We adopt an approximation of Earth Mover's Distance (EMD) as metric and add a graph regularized term based on EMD to NMF. Furthermore, we propose an efficient multiplicative iteration algorithm to solve it. Our empirical study shows the encouraging results of the proposed algorithm comparing with other NMF methods.
In this paper, we proposed a novel method called nonnegativematrixfactorization based on Locally Linear Embedding (LLE-NMF). This idea is to factorize the nonnegativematrix considering the intrinsic geometric struc...
详细信息
The nonnegative matrices decomposition algorithm is discussed, and its objective function based on Euclidean distance proposed by Lee & Seung is simplified. A decomposition factor is identified in the primitive ma...
详细信息
Recently, a novel matrixfactorization, named non-negative matrix factorization (NMF), attracts much attention in the field of signal processing. A matrix with non-negative elements can be decomposed into a product of...
详细信息
Mid-infrared (wavelengths of 2-25μm) astronomy has progressed significantly in the last decades, thanks to space and ground based telescopes. Space observatories benefit from the absence of atmospheric absorption, al...
详细信息
A non-negative matrix factorization approach to dimensionality reduction is proposed to aid classification of images. The original images can be stored as lower-dimensional columns of a matrix that hold degrees of bel...
详细信息
Credit risk assessment of financial intermediaries is an essential problem in finance. The key is to find accurate predictors of individual risk in the credit portfolios of institutions. However, accessing credit risk...
详细信息
暂无评论