We introduce a technique based on diagonal white runs and vertical edges, that divides a document image into columns and blocks which are subsequently classified as text or graphics. A diagonal white run (drun) is a s...
Presents a new approach called regional projection transformation (APT) which converts a compound pattern into an integral object. Diagonal-diagonal regional projection transformation (DDRPT), has been described and a...
详细信息
Presents a new approach called regional projection transformation (APT) which converts a compound pattern into an integral object. Diagonal-diagonal regional projection transformation (DDRPT), has been described and analyzed. The patterns transformed from this method possesses a couple of important characteristics which facilitate the recognition of compound patterns. Parallel algorithm for the DDRPT has been presented in this paper. It can speed up computation and the recognition process.< >
Knowing the structure of a document is the key to successful processing of that document. From different points of view, there exist different definitions for document structures. A survey which contains a collection ...
详细信息
Knowing the structure of a document is the key to successful processing of that document. From different points of view, there exist different definitions for document structures. A survey which contains a collection of many methods of describing document structures is presented. Several novel concepts and theoretical analyses are also presented in this survey.< >
The correlation calculation is a kind of frequency calculation. It has been used in many application fields, such as: image processing and patternrecognition, control and robots, etc. We present a new and highly effi...
详细信息
The correlation calculation is a kind of frequency calculation. It has been used in many application fields, such as: image processing and patternrecognition, control and robots, etc. We present a new and highly efficient algorithm. The algorithm, named variable parameter hierarchical discrete correlation (VPHDC), is an extension of HDC. It has some distinguishing features like HDC in its applications, such as efficiency, speed and so on, moreover it is more flexible than HDC, which makes complex correlation calculation fast and efficient. Because the windows can be increased to any size, it can be used for objects of all sizes.< >
With the ever-increasing amounts of published materials being made available, developing efficient means of locating target items has become a subject of significant interest. Among the approaches adopted for this pur...
详细信息
With the ever-increasing amounts of published materials being made available, developing efficient means of locating target items has become a subject of significant interest. Among the approaches adopted for this purpose is word spotting, which enables the identification of documents through the use of pertinent keywords. This paper reports on an effective method of word spotting for Arabic handwritten documents that takes into consideration the nature of Arabic handwriting. Parts of Arabic Words (PAWs) form the basic components of this search process, and a hierarchical classifier (consisting of a set of classifiers each trained on a different part of the input pattern) is implemented. For the first time in Arabic word spotting, language models are incorporated into the process of reconstructing words from PAWs. Details of the method and promising experimental results are also presented.
The correlation calculation is a kind of frequency calculation. It has been used in many application fields, such as: image processing and patternrecognition, control and robots, etc. We present a new and highly effi...
详细信息
Extraction of a stable and representative set of features is the heart of the design of a patternrecognition system. Knowing the distribution of information on the pixels of a character will be of great assistance to...
详细信息
Extraction of a stable and representative set of features is the heart of the design of a patternrecognition system. Knowing the distribution of information on the pixels of a character will be of great assistance to the study of feature extraction. In this paper, an analysis of the distribution of information on the pixels of binarized Chinese characters is presented. From the analysis, it is obvious that the information of a Chinese character tends to concentrate around the peripheries of the character. Several methods to extract peripheral shape features are presented. Some experiments are conducted on Chinese character recognition and the results show the advantages of the peripheral shape features.
A new method of separating touching unconstrained handwritten digits is proposed. A binary image containing a string of touching digits is scanned to give contour chains. The chains are analyzed and subdivided into fo...
详细信息
A new method of separating touching unconstrained handwritten digits is proposed. A binary image containing a string of touching digits is scanned to give contour chains. The chains are analyzed and subdivided into four kinds of regions: valleys, mountains, holes, and open regions. Individual points of interest in the outer contour are then identified, e.g., points of high curvature. The separating path is assumed to pass between some pair of these significant contour points (SCPs). Nine features of the SCPs are measured and are used to sort the list of all possible pairings of SCPs. Preliminary results show that the correct cut is sorted within the first three choices in 89% of tests.< >
A new courtesy amount recognition module of CENPARMIpsilas check reading system (CRS) is proposed in this paper. The module consists of 3 main segments: pre-processing, segmentation and recognition, and post-processin...
详细信息
ISBN:
(纸本)9781424421749
A new courtesy amount recognition module of CENPARMIpsilas check reading system (CRS) is proposed in this paper. The module consists of 3 main segments: pre-processing, segmentation and recognition, and post-processing. A new feedback-based segmentation algorithm is adopted for the segmentation task. Besides one individual numeral recognizer for numerals from dasia0psila to dasia9psila, one convolutional neural network(CNN) recognizer for ldquo00rdquo and ldquo000rdquo numeral strings is also integrated into our module for the recognition task. The experimental results on the Quebec Bell Check database show that the recognition rate of the courtesy amount has improved from 41.2% to 74.3%.
This paper proposes a general local learning framework to effectively alleviate the complexities of classifier design by means of "divide and conquer" principle and ensemble method. The learning framework co...
详细信息
ISBN:
(纸本)0769512631
This paper proposes a general local learning framework to effectively alleviate the complexities of classifier design by means of "divide and conquer" principle and ensemble method. The learning framework consists of quantization layer and ensemble layer. After GLVQ and MLP are applied to the framework, the proposed method is tested on MNIST handwritten digit database. The obtained performance is very promising, an error rate with 0.99%, which is comparable to that of LeNet5, one of the best classifiers on this database. Further, in contrast to LeNet5, our method is especially suitable for a large-scale real-world classification problem.
暂无评论