Character segmentation is a necessary preprocessing step for character recognition in many handwritten word recognition systems. The most difficult case in character segmentation is the cursive script. Fully cursive n...
详细信息
Character segmentation is a necessary preprocessing step for character recognition in many handwritten word recognition systems. The most difficult case in character segmentation is the cursive script. Fully cursive nature of Bangla handwriting, the natural skewness in words poses some challenges for automatic character segmentation. In this article a novel approach to skew detection, correction as well as character segmentation has been presented for handwritten Bangla words as a test case. Segmenting points are extracted on the basis of some patterns observed in the handwritten words. With these segmenting points a graphical path (hereafter referred to as a candidate path) has been constructed. The handwritten words contain some consistent and also inconsistent skewness. Our algorithm can cope with both types of skewness at a time. Further the method is so direct that with the help of a candidate path one can handle both skew correction and segmentation successfully. the algorithm has been tested on a database prepared for laboratory use. The method yields fairly good results for this database.
Active shape models are powerful and widely used tool to interpret complex image data. By building models of shape variation they enable search algorithms to use a priori knowledge in an efficient and gainful way. How...
详细信息
Active shape models are powerful and widely used tool to interpret complex image data. By building models of shape variation they enable search algorithms to use a priori knowledge in an efficient and gainful way. However, due to the linearity of PCA, non-linearities like rotations or independently moving sub-parts in the data can deteriorate the resulting model considerably. Although non-linear extensions of active shape models have been proposed and application specific solutions have been used, they still need a certain amount of user interaction during model building. In this paper the task of building/choosing optimal models is tackled in a more generic information theoretic fashion. In particular, we propose an algorithm based on the minimum description length principle to find an optimal subdivision of the data into sub-parts, each adequate for linear modeling. This results in an overall more compact model configuration. Which in turn leads to a better model in terms of modes of variations. The proposed method is evaluated on synthetic data, medical images and hand contours.
ISITRA is a new scheme of signal decomposition and reconstruction. In ISITRA, the space of PRF sets is much larger and more well-behaved than that in the existing schemes like filter bank or wavelets. Since such a spa...
详细信息
This paper describes a novel fast correlation attack of stream ciphers. The salient feature of the algorithm is the absence of any pre-processing or iterative phase, an usual feature of existing fast correlation attac...
详细信息
One of the major challenges in speech synthesis and recognition is co-articulated unit segmentation. In this paper we present a novel technique for segmenting the basic co-articulated units using multifactorial analys...
详细信息
In this paper a rule based rough set decision system for development of a disease inference engine is described. For this purpose an off-line data acquisition system of paper electrocardiogram (ECG) records are develo...
详细信息
In this paper a rule based rough set decision system for development of a disease inference engine is described. For this purpose an off-line data acquisition system of paper electrocardiogram (ECG) records are developed using image processing techniques. A QRS detector is developed for detection of R-R interval from ECG waves. After detection of this R-R interval the P and T waves are detected based on syntactic approaches and different time-plane features are extracted from every ECG signals. From a knowledgebase which is developed from the feedback of different reputed cardiologists and consultation of different medical books the essential time plane features for ECG interpretation have been selected. Finally, a rule-based roughest decision system is generated for the development of an inference engine for disease identification from these time-plane features.
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for Indian postal automation there is no significant work. This...
详细信息
Postal automation is a topic of research over the last few years. There are many works towards the postal automation in USA, UK, Japan and Australia, but for Indian postal automation there is no significant work. This paper deals with word-wise handwritten script identification for Indian postal automation. In the proposed scheme at first document skew is detected and corrected. Non-text parts are then segmented from the document using run length smoothing algorithm (RLSA). Next, using a piece-wise projection method the destination address block (DAB) is at first segmented into lines and then links into words. Using water reservoir concept we compute the busy-zone of the word. Finally, using matra/Shirorekha, water reservoir concept based feature, etc. a tree classifier is generated for word-wise Bangla/Devnagari and English scripts identification.
ISITRA is a new scheme of signal decomposition and reconstruction. In ISITRA, the space of PRF sets is much larger and more well-behaved than that in the existing schemes like filter bank or wavelets. Since such a spa...
详细信息
ISITRA is a new scheme of signal decomposition and reconstruction. In ISITRA, the space of PRF sets is much larger and more well-behaved than that in the existing schemes like filter bank or wavelets. Since such a space is constrained, it is mapped to an unconstrained space in which an optimization technique can be applied to find optimal PRF sets in terms of some criterion. Our criterion here is based on mean square error and the optimization technique used is genetic algorithms. Optimal PRF sets thus found perform better than the popular Daubechies' filters for a compression task.
One of the major challenges in speech synthesis and recognition is coarticulated unit segmentation. In this paper we present a novel technique for segmenting the basic coarticulated units using multifactorial analysis...
详细信息
One of the major challenges in speech synthesis and recognition is coarticulated unit segmentation. In this paper we present a novel technique for segmenting the basic coarticulated units using multifactorial analysis based approach. The proposed algorithm is applied on isolated spoken words in Bangla. The results obtained from a considerably large database show the strength of the approach.
Efficient extraction of mathematical expressions is considered as an important pre-processing step to apply existing OCR systems to convert scientific papers into their electronic format. In this correspondence, a tec...
详细信息
Efficient extraction of mathematical expressions is considered as an important pre-processing step to apply existing OCR systems to convert scientific papers into their electronic format. In this correspondence, a technique for extracting embedded (or in-line) expressions has been presented. The proposed method for expression extraction initially invokes an existing OCR to recognize the input document. Several features including word n-grams (a statistical analysis of a corpus of scientific documents reveals that the word level n-gram profile for sentences containing embedded expressions is quite different from that of the sentences without any expression) are computed on sentence level to spot sentences containing expressions. Expression zones are pin pointed by exploiting OCR inability to handle expressions and by using some common typographical aspects followed in typing mathematical expressions. Experimental results on a considerable size of dataset show high efficiency of the proposed technique.
暂无评论