In this paper, we present a practical solution to identifying pornographic images based on multiple low-level image features and support vector machine (SVM). First, the region of interest (ROI) is obtained from an or...
详细信息
SVM (Support Vector Machines) is a novel algorithm of machine learning which is based on SLT (Statistical Learning Theory). It can solve the problem characterized by nonlinear, high dimension, small sample and local m...
详细信息
We propose a structure called dependency forest for statistical machine translation. A dependency forest compactly represents multiple dependency trees. We develop new algorithms for extracting string-to dependency ru...
详细信息
We propose a structure called dependency forest for statistical machine translation. A dependency forest compactly represents multiple dependency trees. We develop new algorithms for extracting string-to dependency rules and training dependency language models. Our forest-based string-to-dependency system obtains significant improvements ranging from 1.36 to 1.46 BLEU points over the tree-based baseline on the NIST 2004/2005/2006 Chinese-English test sets.
As tokenization is usually ambiguous for many natural languages such as Chinese and Korean, tokenization errors might potentially introduce translation mistakes for translation systems that rely on 1-best tokenization...
详细信息
As tokenization is usually ambiguous for many natural languages such as Chinese and Korean, tokenization errors might potentially introduce translation mistakes for translation systems that rely on 1-best tokenizations. While using lattices to offer more alternatives to translation systems have elegantly alleviated this problem, we take a further step to tokenize and translate jointly. Taking a sequence of atomic units that can be combined to form words in different ways as input, our joint decoder produces a tokenization on the source side and a translation on the target side simultaneously. By integrating tokenization and translation features in a discriminative framework, our joint decoder outperforms the baseline translation systems using 1-best tokenizations and lattices significantly on both Chinese- English and Korean-Chinese tasks. Interestingly, as a tokenizer, our joint decoder achieves significant improvements over monolingual Chinese tokenizers.
Fast stereoscopic video encoding becomes a highly desired technique because the stereoscopic video has been realizable for applications like TV broadcasting and consumer electronics. The stereoscopic video has high in...
详细信息
This paper tries to fill the gap between Traditional Chinese Pulse Diagnosis (TCPD) and Doppler diagnosis by applying digital signal analysis and pattern classification techniques to wrist radial arterial Doppler bloo...
详细信息
Minimum Error Rate Training (MERT) as an effective parameters learning algorithm is widely applied in machine translation and system combination area. However, there exists an ambiguity problem in respect to the train...
详细信息
Slideshow is a popular way to display a digital album automatically. However, it is time consuming and boring when the album is large. Actually, while browsing an album manually, an audience may spend different length...
详细信息
How to reduce the amount of relevance judgments is an important issue in retrieval evaluation. In this paper, we propose a novel method using global statistics to rank retrieval systems without relevance judgments. In...
详细信息
In this paper, we present a novel method to detect violent scenes in movies. The detection process involves two views - audio and video views. For the audio view, a supervised method based on HCRFs is exploited to imp...
详细信息
暂无评论