This paper presents methods for prominence classification in conversational speech. Most existing tools rely on prosodic features extracted at syllable- or phone-level, performing well on read speech. This is not the ...
详细信息
Belief propagation (BP) is an effective approximate inference method but lacks theoretical guarantees for loopy graphs. We discuss the optimization landscape and the message dynamics and how this helps to understand t...
详细信息
Generalized cross-correlation is considered as the most straightforward time delay estimation *** on various weighting function,different methods were derived and a straightforward method,named phase transform(PHAT)ha...
详细信息
Generalized cross-correlation is considered as the most straightforward time delay estimation *** on various weighting function,different methods were derived and a straightforward method,named phase transform(PHAT)has been widely *** is well-known for its robustness to reverberation and its sensitivity to noise,which is partly due to the fact that PHAT distributes same weights to the frequencies dominated by signal or *** alleviate this problem,two weighting functions are proposed in this *** taking a posteriori signal-to-noise ratio(SNR)into account to classify reliable and unreliable frequencies,different weights could be *** first proposed weighting function borrows the idea of binary mask and distributes same weights to frequencies in same set,whereas,the second one assigns weights based on coherence *** showed the robustness of proposed methods to reverberation and noise for improving the performance of time delay estimation through various criteria.
Algorithms for mutual interference mitigation and object parameter estimation are a key enabler for automotive applications of frequency-modulated continuous wave (FMCW) radar. In this paper, we introduce a signal sep...
详细信息
Recent progress in Single Channel Source Separation (SCSS) using deep neural networks led to impressive performance gains while also increasing the model sizes, requiring tremendous data resources. This demand is cove...
详细信息
Vectorized language embeddings of raw audio data improve tasks like language recognition, automatic speech recognition, and machine translation. Although embeddings exhibit high effectiveness in their respective tasks...
详细信息
The deep learning (DL) based direction-of-arrival (DOA) estimation is one of the research hotspots, and many methods have been proposed recently. However, most of those methods will face serious performance degradatio...
详细信息
In this work, we investigate causal learning of independent causal mechanisms (ICMs) from a Bayesian perspective. Confirming previous claims from the literature, we show in a didactically accessible manner that unlabe...
详细信息
To enhance the perceptual quality of speechsignal, the Packet Loss Concealment (PLC) technique focuses on recovering the lost speech caused by network latency and jitter. In practical applications, the PLC methods ty...
详细信息
Due to the network constrain, the packet loss is inevitable in the real-Time speech communication. The packet loss often leads to the short interruption of the voice communication, which seriously affects the quality ...
详细信息
暂无评论