We describe the KELVIN system for extracting entities and relations from large text collections and its usc in the TAC Knowledge Base Population Cold Start task run by the U.S. National Institute of Standards and Tech...
详细信息
In this paper, we analyse the emotion of children's stories in sentence level by considering the context information. We demonstrate that the emotion of a sentence is not only dependent on its content, but also af...
详细信息
ISBN:
(纸本)9781479942503
In this paper, we analyse the emotion of children's stories in sentence level by considering the context information. We demonstrate that the emotion of a sentence is not only dependent on its content, but also affected by its neighbours in a story. A Hidden Markov Model (HMM) based method is proposed to model the emotion sequence and to detect whether a sentence is neutral or not. We show the important features for emotion detection by studying a children's story corpus. An empirical evaluation is conducted to investigate the efficiency of the model. The results demonstrate that the proposed method can achieve competitive performance with the state-of-the-art methods, and it is affected more slightly by the training set than traditional classification methods. Classifier fusion is applied to combine different methods to achieve better results.
We present a novel approach to learning phrasal inversion transduction grammars via Bayesian MAP (maximum a posteriori) or information-theoretic MDL (minimum description length) model optimization so as to incorporate...
详细信息
We present the first ever results showing that tuning a machine translation system against a semantic frame based objective function, MEANT, produces more robustly adequate translations than tuning against BLEU or TER...
详细信息
The linguistically transparent MEANT and UMEANT metrics are tunable, simple yet highly effective, fully automatic approximation to the human HMEANT MT evaluation metric which measures semantic frame similarity between...
详细信息
We propose using AdaBoost with decision stumps to implement multimodal music emotion classification (MEC) as a more appropriate alternative to the conventional SVMs. By modeling the presence or absence of salient phra...
详细信息
ISBN:
(纸本)9781479903573
We propose using AdaBoost with decision stumps to implement multimodal music emotion classification (MEC) as a more appropriate alternative to the conventional SVMs. By modeling the presence or absence of salient phrases in the lyric texts and seeking for proper thresholds for certain audio signal features, it exploits interdependencies between aspects from both modalities in the multimodal MEC system to make the final classification. It can especially prevent the "short text problem" in lyrics. Our accuracy reached an average of 78.19% for classifying 3766 unique songs into 14 emotion categories, with a statistically significant improvement over the audio-only and lyrics-only monomodal MEC systems. We also show that the proposed AdaBoost with decision stumps method performs statistically better on multimodal MEC than the well-known SVM classifier, which only has an average accuracy of 72.08%.
We propose an integrated framework for large vocabulary continuous mixed language speech recognition that handles the accent effect in the bilingual acoustic model and the inversion constraint well known to linguists ...
详细信息
ISBN:
(纸本)9781479903573
We propose an integrated framework for large vocabulary continuous mixed language speech recognition that handles the accent effect in the bilingual acoustic model and the inversion constraint well known to linguists in the language model. Our asymmetric acoustic model with phone set extension improves upon previous work by striking a balance between data and phonetic knowledge. Our language model improves upon previous work by (1) using the inversion constraint to predict code switching points in the mixed language and (2) integrating a code-switch prediction model, a translation model and a reconstruction model together. This integration means that our language model avoids the pitfall of propagated error that could arise from decoupling these steps. Finally, a WFST-based decoder integrates the acoustic models, code-switch language model and a monolingual language model in the matrix language all together. Our system reduces word error rate by 1.88% on a lecture speech corpus and by 2.43% on a lunch conversation corpus, with statistical significance, over the conventional bilingual acoustic model and interpolated language model.
Speech data and their accurate transcriptions are essential to acoustic models training of modern Automatic Speech Recognition (ASR) systems including ones applied to the parliament meeting speech transcription system...
详细信息
Speech data and their accurate transcriptions are essential to acoustic models training of modern Automatic Speech Recognition (ASR) systems including ones applied to the parliament meeting speech transcription system. In such a task, the amount of speech data is in abundance but there are discrepancies between what were actually uttered, or word-for-word transcription, and their corresponding texts in the official meeting reports. This work proposes a method for automatically detecting locations of such discrepancies. Rules derived from a handbook for Thai parliament stenographer and patterns of discrepancies were used to generate alternative hypotheses supplied, with texts obtained directly from the reports, to a forced-alignment procedure. The forced-alignment procedure selects the best hypothesis for each speech utterance. Experimental results show that 72.6% of syllabic discrepancies were correctly detected while interference to correctly transcribed syllables was kept minimal. Moreover, the proposed method shows a premise in increasing accuracy of word-for-word transcription and achieves 96.5% of accuracy.
In Thai, it is not easy to find transcriptions of Roman characters such as English words. This leads to problems in Thai text-to-speech when facing with English words among Thai writing. Automatic English to Thai Grap...
详细信息
In Thai, it is not easy to find transcriptions of Roman characters such as English words. This leads to problems in Thai text-to-speech when facing with English words among Thai writing. Automatic English to Thai Grapheme-to-Phoneme conversion is proposed to tackle this problem. This paper presents an application of English syllable features to tone prediction model which is capable of classifying ambiguous tones and a refinement of post-processing module for phone prediction model. Experimental results show that the syllable and word accuracy are 76.03% and 53.93% respectively, which outperform all previous models.
暂无评论