No significant research work towards recognition of handwritten Bangla characters has yet been done. Only a few works in this area are found in the literature which are based on small databases col-lected in laborator...
详细信息
recognition of handwritten characters is difficult because of variability involved in the writing style of different individuals. This paper deals with recognition of off-line Bangla handwritten characters using quadr...
详细信息
Autonomous model building is a crucial trend in model based methods like AAMs. This paper introduces an approach that deals with non-linearities by detecting distinct sub-parts in the data. Sub-models each representin...
详细信息
ISBN:
(纸本)1901725294
Autonomous model building is a crucial trend in model based methods like AAMs. This paper introduces an approach that deals with non-linearities by detecting distinct sub-parts in the data. Sub-models each representing an individual sub-part are derived from a minimum description length criterion. Thereby the resulting clique of models is more compact and obtains a better generalization behavior than a single model. The proposed AAM clique generation deals with non-linearities in the data in a generic information theoretic manner reducing the necessity of user interaction during training.
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for Indian postal automation there is no significant work. This...
详细信息
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for Indian postal automation there is no significant work. This paper deals with word-wise handwritten script identification for Indian postal automation. In the proposed scheme at first document skew is detected and corrected. Non-text parts are then segmented from the document using run length smoothing algorithm (RLSA). Next, using a piecewise projection method the destination address block (DAB), is at first segmented into lines and then into words. Using water reservoir concept we compute the busy-zone of the word. Finally, using matra/Shirorekha, water reservoir concept based feature, fractal based feature, etc. a neural network (NN) classifier is generated for word-wise Bangla and English scripts identification. Overall accuracy of the proposed system is at present 9 7.62%.
This paper deals with segmentation and recognition of touching characters appearing in scanned mathematical expressions. The technique is based on multifactorial analysis that integrates several factors determining cu...
详细信息
This paper deals with segmentation and recognition of touching characters appearing in scanned mathematical expressions. The technique is based on multifactorial analysis that integrates several factors determining cut-positions in a touching character image. A predictive algorithm is developed for efficient selection of possible cut-positions for segmenting touching characters. Experiment has been carried out using a test-set of reasonable size and results show that a considerable improvement in recognition accuracy can be achieved with a modest increase in computations.
Stemming is used in many information retrieval (IR) systems to reduce variant word forms to common roots, and thereby improving the overall retrieval efficiency. This paper presents an algorithm for stemming in the co...
详细信息
Stemming is used in many information retrieval (IR) systems to reduce variant word forms to common roots, and thereby improving the overall retrieval efficiency. This paper presents an algorithm for stemming in the context of document image retrieval system. The algorithm assumes that the documents are symbolically compressed and stemming has been attempted in the compressed domain itself. Experiments have been conducted on Indian language imaged documents for which efficient OCR still remains a challenging task. Results obtained from a set 150 document images (in Bangla script, the second most popular script in the Indian sub-continent) consisting of about 12K word show a promising performance of the proposed approach.
In this paper, we present a system towards Indian postal automation based on the recognition of pin-code and city name of the postal document. In the proposed system, at first, non-text blocks (postal stamp, postal se...
详细信息
In this paper, we present a system towards Indian postal automation based on the recognition of pin-code and city name of the postal document. In the proposed system, at first, non-text blocks (postal stamp, postal seal etc.) are detected and destination address block (DAB) is identified from the document. Next, lines and words of the DAB are segmented. Since India is a multi-lingual and multi-script country, the address part may be written by combination of two scripts. To identify the script by which a word is written, we propose a water reservoir based technique. It is very difficult to identify the script by which the pin-code portion is written. So, we have used two-stage artificial neural network (NN) based general classifiers for the recognition of pin-code digits written in English/Bangla. For recognition of city names, we propose an NSHP-HMM (non-symmetric half plane-hidden Markov model) based technique.
This paper proposes a method for robustly matching active appearance models (AAMs) on images with gross disturbances (outliers). The method consists of two steps. First, an initial residual is calculated by comparing ...
详细信息
ISBN:
(数字)9783540264316
ISBN:
(纸本)3540250522
This paper proposes a method for robustly matching active appearance models (AAMs) on images with gross disturbances (outliers). The method consists of two steps. First, an initial residual is calculated by comparing model and image appearance, and modes of the residual are analyzed. Second, all possible mode combinations are tested by evaluating an objective function. The objective function allows the selection of an outlier-free mode combination. Experiments demonstrate the ability of the robust matching method to successfully cope with outliers - compared to standard AAM matching, no degeneration of the model during matching occurs.
This paper deals with recognition of off-line unconstrained Oriya handwritten numerals. To take care of variability involved in the writing style of different individuals, the features are mainly considered from the c...
详细信息
This paper deals with recognition of off-line unconstrained Oriya handwritten numerals. To take care of variability involved in the writing style of different individuals, the features are mainly considered from the contour of the numerals. At first, the bounding box of a numeral is segmented into few blocks and chain code histogram is computed in each of the blocks. Features are mainly based on the direction chain code histogram of the contour points of these blocks. Neural network (NN) classifier and quadratic classifier are used separately for recognition and the results obtained from these two classifiers are compared. We tested the result on 3850 data collected from different individuals of various background and we obtained 90.38% (94.81%) recognition accuracy from NN (quadratic) classifier with a rejection rate of about 1.84% (1.31%), respectively.
Character segmentation is a necessary preprocessing step for character recognition in many handwritten word recognition systems. The most difficult case in character segmentation is the cursive script. Fully cursive n...
详细信息
暂无评论