This paper deals with an OCR error detection and correction technique for a highly inflectional language script like Bangla (a major indian language). This is the first report of its kind. Using two separate lexicons ...
详细信息
This paper deals with an OCR error detection and correction technique for a highly inflectional language script like Bangla (a major indian language). This is the first report of its kind. Using two separate lexicons of root words and suffixes, candidate root-suffix pairs of each input word are detected, their grammatical agreement are tested and the root/suffix part in which the error has occurred is noted. The correction is made on the corresponding error part of the input string by a fast dictionary access technique. To do so some alternative strings are generated for an erroneous word. Among the alternative strings, those satisfying grammatical agreement in root-suffix and also having smallest Levenstein-Damerau distance are finally chosen as the correct ones. The system has an accuracy of 75.61%.
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for indian postal automation there is no significant work. This...
详细信息
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for indian postal automation there is no significant work. This paper deals with word-wise handwritten script identification for indian postal automation. In the proposed scheme at first document skew is detected and corrected. Non-text parts are then segmented from the document using run length smoothing algorithm (RLSA). Next, using a piece-wise projection method the destination address block (DAB) is at first segmented into lines and then links into words. Using water reservoir concept we compute the busy-zone of the word. Finally, using matra/Shirorekha, water reservoir concept based feature, etc. a tree classifier is generated for word-wise Bangla/Devnagari and English scripts identification.
The following topics are dealt with: convolutional neural nets; learning (artificial intelligence); image classification; computervision; feature extraction; video signal processing; deep learning (artificial intelli...
详细信息
ISBN:
(数字)9781728185798
ISBN:
(纸本)9781728185804
The following topics are dealt with: convolutional neural nets; learning (artificial intelligence); image classification; computervision; feature extraction; video signal processing; deep learning (artificial intelligence); image segmentation; object detection; image recognition.
Writer identification is a vibrant research field, though a lot of work has been done on writer identification on normal text, writer identification in music score sheet has not been addressed in that large scale. Her...
详细信息
ISBN:
(纸本)9781479925735
Writer identification is a vibrant research field, though a lot of work has been done on writer identification on normal text, writer identification in music score sheet has not been addressed in that large scale. Here we propose a method to identify writers of music score sheets using Daubchies wavelet features along with SVM classifier. We have evaluated our proposed approach in a sub-set of CVC-MUSCIMA dataset. From the experiment on 140 score sheet images from 7 writers we obtained encouraging results.
This paper deals with segmentation and recognition of touching characters appearing in scanned mathematical expressions. The technique is based on multifactorial analysis that integrates several factors determining cu...
详细信息
This paper deals with segmentation and recognition of touching characters appearing in scanned mathematical expressions. The technique is based on multifactorial analysis that integrates several factors determining cut-positions in a touching character image. A predictive algorithm is developed for efficient selection of possible cut-positions for segmenting touching characters. Experiment has been carried out using a test-set of reasonable size and results show that a considerable improvement in recognition accuracy can be achieved with a modest increase in computations.
In a multi-lingual country like India, a document may contain more than one script forms. For such a document it is necessary to separate different script forms before feeding them to OCRs of individual script. In thi...
详细信息
In a multi-lingual country like India, a document may contain more than one script forms. For such a document it is necessary to separate different script forms before feeding them to OCRs of individual script. In this paper an automatic word segmentation approach is described which can separate Roman, Bangla and Devnagari scripts present in a single document. The approach has a tree structure where at first Roman script words are separated using the 'headline' feature. The headline is common in Bangla and Devnagari but absent in Roman. Next, Bangla and Devnagari words are separated using some finer characteristics of the character set although recognition of individual character is avoided. At present, the system has an overall accuracy of 96.09%.
An OCR system is proposed that can read two indian language scripts: Bangla and Devnagari (Hindi), the most popular ones in the indian subcontinent. These scripts, having the same origin in ancient Brahmi script, have...
详细信息
An OCR system is proposed that can read two indian language scripts: Bangla and Devnagari (Hindi), the most popular ones in the indian subcontinent. These scripts, having the same origin in ancient Brahmi script, have many features in common and hence a single system can be modeled to recognize them. In the proposed model, document digitization, skew detection, text line segmentation and zone separation, word and character segmentation, character grouping into basic, modifier and compound character category are done for both scripts by the same set of algorithms. The feature sets and classification tree as well as the knowledge base required for error correction (such as lexicon) differ for Bangla and Devnagari. The system shows a good performance for single font scripts printed on clear documents.
We propose an approach for understanding mathematical expressions in printed documents. The overall approach is divided into three main steps: (i) detection of mathematical expressions in a document, (ii) recognition ...
详细信息
ISBN:
(纸本)0769507506
We propose an approach for understanding mathematical expressions in printed documents. The overall approach is divided into three main steps: (i) detection of mathematical expressions in a document, (ii) recognition of the symbols present in the expression and (iii) arrangement of the recognized symbols. The detection of mathematical expressions is done through recognition of a few most common symbols and exploiting some structural features of the expressions. A hybrid of feature based and a template-based technique is used for the recognition of symbols. A two-pass approach is used for arrangement of the symbols. The first pass (scanning or lexical analysis) performs a micro-level examination of the symbols in order to identify the symbol groups occurring in them and to determine their categories or descriptors. The second pass (parsing or syntax analysis) processes the descriptors synthesized in the first pass, to determine the syntactic structure of the expression. A set of predefined rules guides the activities in both the passes. Experiments conducted using this approach on a large number of documents show high accuracy.
Air-writing refers to virtually writing linguistic characters through hand gestures in three-dimensional space with six degrees of freedom. This paper proposes a generic video camera-aided convolutional neural network...
详细信息
There are many documents where text lines are not parallel to each other i.e. these lines have different inclinations with the horizontal lines (multi-skew documents). For the OCR of such a document we have to estimat...
详细信息
ISBN:
(纸本)0769512631
There are many documents where text lines are not parallel to each other i.e. these lines have different inclinations with the horizontal lines (multi-skew documents). For the OCR of such a document we have to estimate the skew angle of individual text lines because a single rotation cannot de-skew all text lines of the document. In this paper, we describe a robust technique for multi-skew angle detection from indian documents containing the most popular indian scripts Devnagari and Bangla. Most characters in these scripts have horizontal lines at the top, called head-lines. The character head-lines usually connect one another in a word and the word appears as a single component. In the proposed method, the connected components are at first labeled and selected. The upper envelopes of selected components are found by column-wise scanning from the top of the component. Portions of the upper envelope satisfying the properties of a digital straight line are detected. They are then clustered into groups belonging to single text lines. Estimates from these individual clusters give the skew angle of each text line. The proposed multi-skew detection technique has an accuracy about 98.3%.
暂无评论