Detection of speech in noisy recordings becomes a challenging problem when the noise does not follow the usual whiteness, stationarity and high signal-to-noise ratio assumptions. A robust speech detector can affect si...
详细信息
Detection of speech in noisy recordings becomes a challenging problem when the noise does not follow the usual whiteness, stationarity and high signal-to-noise ratio assumptions. A robust speech detector can affect significantly the performance of several speechprocessing tasks, such as endpoint detection, segmentation, and finally recognition, if we deal with real life data, as opposed to laboratory or controlled environment recordings. The detector proposed is based on a Gaussianity test that employs third-order cumulants of the data to decide on the binary hypotheses of noise only versus speech plus noise. speech intervals are detected by exploiting the third-order information present in the speech signal. The detector can handle a large family of additive noises, thanks to its third-order statistics basis. The sample-adaptive and decision feedback variations proposed, provide the detector with a tracking ability both with respect to the time variations of speech and the possible nonstationarity of noise. Experiments carried out using real data, recorded in a moving car interior, show satisfactory performance of the proposed algorithms down to -6 dB signal-to-noise ratio.< >
An overview is presented of results obtained in listeners with real and simulated impairments. Spectral processing is assessed by psychoacoustical tuning curves and growth-of-masking functions; temporal processing is ...
详细信息
An overview is presented of results obtained in listeners with real and simulated impairments. Spectral processing is assessed by psychoacoustical tuning curves and growth-of-masking functions; temporal processing is assessed by temporal integration and gap detection thresholds. When comparisons are made to normal listeners tested in the quiet, impaired listeners generally show deficits on all these tasks. When the comparisons are made to simulated impairments, some impaired listeners show no deficits, whereas others still show deficits. These results indicate that abnormal intensity perception is responsible for some, but not all, of impaired listeners' apparent deficits in spectral and temporal processing. In many cases, however, it appears that spectral and temporal processing, per se, are altered in the impaired auditory system.< >
A method for recognition of speech of the nonverbal is presented. Results of the application of the procedure to simple utterances by cerebral palsied individuals and implications for an overall communications device ...
详细信息
A method for recognition of speech of the nonverbal is presented. Results of the application of the procedure to simple utterances by cerebral palsied individuals and implications for an overall communications device are addressed.
暂无评论