This paper proposes a word-by-word model selection approach to domain adaptation for Word Sense Disambiguation. By this approach, the model for a target word is automatically selected from a candidate model set, which...
详细信息
This paper proposes a word-by-word model selection approach to domain adaptation for Word Sense Disambiguation. By this approach, the model for a target word is automatically selected from a candidate model set, which is comprised of improved self-training models and a supervised model. The improved self-training uses sense priors to prevent its iteration from converging into undesirable states. Experimental results on a domain-specific corpus show that: (1) our improved self-training model is effective for the words which have target domain linked senses; (2) the selected models obtain higher accuracies than each single model and effectively improve the performance compared to the state-of-the-art supervised model.
Neurobiological research has uncovered the existence of cortical neurons in various animal species tuned to particular spectro-temporal modulations (STM) in the auditory stimulus. Other findings indicate that temporal...
详细信息
Neurobiological research has uncovered the existence of cortical neurons in various animal species tuned to particular spectro-temporal modulations (STM) in the auditory stimulus. Other findings indicate that temporal statistics of the resulting neural spike trains may encode the underlying content of species-specific communication calls. With this motivation, we present an alternative approach to speech recognition based on point process statistical models of the local maxima events produced by a cortically-inspired spectro-temporal filter bank. We demonstrate the computational adequacy of this approach on the practical task of keyword spotting.
Many statistical translation models can be regarded as weighted logical deduction. Under this paradigm, we use weights from the expectation semiring (Eisner, 2002), to compute first-order statistics (e.g., the expecte...
详细信息
Hypergraphs are used in several syntax-inspired methods of machine translation to compactly encode exponentially many translation hypotheses. The hypotheses closest to given reference translations therefore cannot be ...
详细信息
Statistical models in machine translation exhibit spurious ambiguity. That is, the probability of an output string is split among many distinct derivations (e.g., trees or segmentations). In principle, the goodness of...
详细信息
Speaker segmentation is widely applied in many domains such as multi-peaker detection and speaker tracking. However, the performance of the conventional metric-based methods is neither good enough nor stable due to th...
Speaker segmentation is widely applied in many domains such as multi-peaker detection and speaker tracking. However, the performance of the conventional metric-based methods is neither good enough nor stable due to the stability of the between-window distance calculation. In order to enhance the stability and hence to improve the performance, a new method based on the between-window correlation over speakers' characteristics is proposed. In this method, a set of reference speaker models are trained which can represent the whole speaker model space. The between-window correlation of likelihood vectors of scores against these reference models is taken as the metric. The gender information and the Peak and Valley information are also used. Experiments over NIST SRE 2002 Segmentation BNEWS and SWBD Datasets show that better performance can be achieved compared with the BIC and the GLR methods. What's more, the proposed method can achieve approximately the best performance in a wider value range of predefined thresholds than the BIC and the GLR methods, which reduces the threshold sensitivity.
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the center for Research on Bangla languageprocessing (CRBLP). Tesseract, maintaine...
详细信息
Generative statistical models with a very large number of parameters are frequently used in real-world data applications, such as large-vocabulary speech recognition (LVCSR). Complex models are needed in order to capt...
详细信息
ISBN:
(纸本)9781424439904
Generative statistical models with a very large number of parameters are frequently used in real-world data applications, such as large-vocabulary speech recognition (LVCSR). Complex models are needed in order to capture the ubiquitous variability in the observed signal, but data sparsity causes significant problems in their training. One way of dealing with data sparsity is to perform dimensionality reduction of the observed features, with the goal of reducing the model parameter space without sacrificing performance. When the data are Gaussian distributed, the dimensionality reduction can be done efficiently using the maximum likelihood criterion; this leads to the heteroscedastic linear discriminant analysis (HLDA), which is a natural extension of linear discriminant analysis (LDA) to the case where the class-conditional Gaussians have unequal covariance matrices. A further extension of HLDA to multiple transforms (MLDA) can also be tackled efficiently. This paper presents the theory behind HLDA and MLDA, and demonstrates their performance with synthetic data.
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the center for Research on Bangla languageprocessing (CRBLP). Tesseract, maintaine...
详细信息
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the center for Research on Bangla languageprocessing (CRBLP). Tesseract, maintained by Google, is considered to be one of the most accurate free open source OCR engines currently available. In this paper, we present a new OCR for the Bangla/Bengali script that combines the recognition power of Tesseract and the Bangla script processing power of BanglaOCR by integrating the Tesseract recognition engine into BanglaOCR. We first present the complete methodology to build the combined OCR, followed by the implementation strategy. In this paper, we focus on the training data preparation process, Tesseract integration procedure and the post-processing techniques. The techniques described in this paper can be readily applied to build OCRs for other scripts as well.
Sentiment analysis often relies on a semantic orientation lexicon of positive and negative words. A number of approaches have been proposed for creating such lexicons, but they tend to be computationally expensive, an...
详细信息
暂无评论