Optical Character recognition (OCR) systems show poor performance while processing documents like old books or newspapers, Xerox materials, faxed documents, etc. Such documents are considered as degraded documents. On...
详细信息
This paper presents the online handwriting recognition for Indian scripts. The primary concern of the approach is the modeling of human motor functionality while writing characters. This is achieved by looking at the ...
详细信息
This paper presents the online handwriting recognition for Indian scripts. The primary concern of the approach is the modeling of human motor functionality while writing characters. This is achieved by looking at the ...
详细信息
ISBN:
(纸本)076951695X
This paper presents the online handwriting recognition for Indian scripts. The primary concern of the approach is the modeling of human motor functionality while writing characters. This is achieved by looking at the whole pen trajectory where the time evaluation of the pen coordinates plays a crucial role. A low complexity classifier was designed and the proposed similarity measure appears to be quite robust against wide variations in writing styles. Initially, the approach was applied for online recognition of handwritten characters in Devnagari and Bangla, the two major Indian scripts. A test on a dataset of considerable size shows promising recognition rates: 97.29% for Devnagari and 96.34% for Bangla.
Polysemy implies the presence of more than one sense of a particular word both in its context-bound and context-free situation. The inherent aspect of polysemy is that a particular word will show multiple sense variat...
详细信息
Polysemy implies the presence of more than one sense of a particular word both in its context-bound and context-free situation. The inherent aspect of polysemy is that a particular word will show multiple sense variations related by way of semantic extension and conceptual expansion. In the last fifty years or so, polysemy has been recognised as one of the central issues in lexical semantics, word sense disambiguation (WSD), actual sense extraction (ASE), language learning, conceptual categorisation of words as well as in computer processing of language. Language users can identify a polysemous word quite easily, but are not equipped to decipher all its possible sense variations without appropriate reference to a proper knowledge-base and other relevant information embedded within contextual environments. We make an empirical effort to understand the basic nature of polysemy in Bangla. We also intend to know how words denote sense variations, which factors are instrumental in making them polysemous, what impact they have on language understanding, and how sense variation can be best understood using information from various sources of knowledge bases. Finally, we use a method to understand the role of various contexts, obtained from corpora, in maintaining an interface between words and their sense variations. Quick reference to local context is handy at times, but reference to focal, topical and global contexts as well as an extralinguistic knowledge base is necessary for understanding sense variation and obtaining actual contextual sense.
This paper deals with the development of a spell-checker in Indian languages using as an example Bangla, the second most popular language on the Indian Subcontinent. A brief review of problems and the current scenario...
详细信息
This paper deals with the development of a spell-checker in Indian languages using as an example Bangla, the second most popular language on the Indian Subcontinent. A brief review of problems and the current scenario of Indian language spell-checkers is described. The approach for the Bangla spell-checker is then elaborated. In this approach the technique works in two stages. The first stage takes care of phonetic similarity error. For that the phonetically similar characters are mapped into single units of character code. A new dictionary D/sub c/ is constructed with this reduced set of alphabets. A phonetically similar but wrongly spelt word can be easily corrected using this dictionary. The second stage takes care of errors other than phonetic similarity. A wrongly spelt word S of n characters is searched in the dictionary D/sub c/. If S is a nonword, its first k/sub 1//spl les/n characters will match with a valid word in D/sub c/. (if k/sub 1/=n then the word in D/sub c/ must be longer than n). A reversed word dictionary D/sub r/ is also generated where the characters of the word are maintained in a reversed order. If the last k/sub 2/ characters of S match with a word in D/sub r/ then, for a single error, it is located within the intersection region of first k/sub 1/+1 and last k/sub 2/+1 characters of S. We observed that this region is very small compared to word length for most cases and the number of suggested correct words can be drastically reduced using this information. We have used our approach in correcting Bangla text, where the problem of inflection is tackled by a simplified version of a morphological analyser. Another problem encountered in Indian languages is the existence of a large number of compound words formed by euphony and assimilation. The problem of compound words is also carefully tackled.
Head detection is an important, but difficult task, if no restrictions such as static illumination, frontal face appearance or uniform background can be assumed. We present a system that is able to perform head detect...
详细信息
This paper proposes an automatic recognition scheme for hand printed Bangla (an Indian script) numerals using neural network models. A Topology Adaptive Self Organizing Neural Network is first used to extract from a n...
详细信息
In this paper we deal with performance improvement of robust PCA algorithms by replacing regular subsampling of images by an irregular image pyramid adapted to the expected image content. The irregular pyramid is a st...
详细信息
Fast robotic unloading of piled deformable box-like objects (e.g. box-like sacks), is undoubtedly of great importance to the industry. Existing systems although fast, can only deal with layered, neatly placed configur...
详细信息
In this paper we present an invisible spatial domain watermarking technique. The technique divides the image into n small blocks and the intensity of some of these blocks are modified depending on the key, which is a ...
详细信息
暂无评论