This paper looks at the problem of searching for Indian language (IL) content on the Web. Even though the amount of IL content that is available on the Web is growing rapidly, searching through this content using the ...
详细信息
ISBN:
(纸本)9781605584164
This paper looks at the problem of searching for Indian language (IL) content on the Web. Even though the amount of IL content that is available on the Web is growing rapidly, searching through this content using the most popular websearch engines poses certain problems. Since the popular search engines do not use any stemming / orthographic normalization for Indian languages, recall levels for IL searches can be low. We provide some examples to indicate the extent of this problem, and suggest a simple and efficient solution to the problem. Copyright 2008 ACM.
Spelling error is broadly classified in two categories namely non word error and real word error. In this paper a localized real word error detection and correction method is proposed where the scores of bigrams gener...
详细信息
A few studies of online Bangla handwriting recognition such as isolated character recognition or limited vocabulary cursive word recognition are found in the literature. However, development of an end-to-end recogniti...
详细信息
Over the last decade or so, remarkable developments in computer technology have given a major impetus to research in the field of multimedia. With the proliferation of the Internet and the increasingly widespread use ...
详细信息
Struck-out words are often found in handwritten manuscripts. A realistic off-line handwriting recognition system should take care of this common aspect. A simple but efficient approach to this problem is to subject ea...
详细信息
This article presents our recent study on fusion of information at feature and classifier output levels for improved performance of offline handwritten Devanagari word recognition. We consider here two state-of-the-ar...
详细信息
This paper deals with an OCR error detection and correction technique for a highly inflectional language script like Bangla (a major Indian language). This is the first report of its kind. Using two separate lexicons ...
详细信息
Considering the vast collection of handwritten documents in various archives, research studies for their automatic processing have major impact in the society. Line segmentation from images of such documents is a cruc...
详细信息
In this paper we describe a texture segmentation approach without feature computation based on a multilayer perceptron network (MLP). Thus, the users need not bother about the selection and then computation of feature...
详细信息
With the increasing popularity of digital cameras attached with various handheld devices, many new computational challenges have gained significance. One such problem is extraction of texts from natural scene images c...
详细信息
暂无评论