image shape restoration based on mathematical transformation is a successful approach to nonlinear distortions in computer vision, robot vision and patternrecognition. The key of this process is to find the distortio...
详细信息
image shape restoration based on mathematical transformation is a successful approach to nonlinear distortions in computer vision, robot vision and patternrecognition. The key of this process is to find the distortion function and its inverse function. Usually, the distortion function is unknown or unclear. Even in the case where the function is known, it remains difficult to compute or estimate the parameters necessary for the restoration. To overcome this problem, Coons transformation utilizing boundary functions for the distorted images have been used to approximate the exact distortion function. The boundary functions are calculated using B-spline curve interpolation which is coincided with the necessary condition of major elements that constitute a Coons transformation.
A new method of reading the handwritten zip codes in the U.S. Postal Services CD-ROM database is presented. Zip code images are binarized, segmented and recognised. A recognition driven method for splitting multiple c...
详细信息
A new method of reading the handwritten zip codes in the U.S. Postal Services CD-ROM database is presented. Zip code images are binarized, segmented and recognised. A recognition driven method for splitting multiple connected digits has been developed; for grouping together of broken digits, the system targets components with near-touching stroke tips, 5-hats, and 4-Ls. The digit recogniser is a majority vote combination of 3 neural networks with a zero rejection performance of 96.53% on the 2711 imperfectly segmented digits in the cedarbs test set. With digit splitting capability disabled, the system performance on the 930 whole zip codes of the test set is 61.0% correct with no errors when up to two rejected symbol positions are allowed. With digit splitting enabled the performance rises to 66.3%.
In this paper, we propose an integrated segmentation and recognition method for recognizing connected handwritten characters with recurrent neural network. It has been developed to both integrate segmentation and reco...
详细信息
ISBN:
(纸本)0818671289
In this paper, we propose an integrated segmentation and recognition method for recognizing connected handwritten characters with recurrent neural network. It has been developed to both integrate segmentation and recognition within a single recurrent neural network and recognize connected handwritten characters using the spatial dependencies in the images of connected handwritten characters. In order to verify the performance of the proposed method, experiments with the NIST database have been carried out and the performance of the proposed method has been compared with those of the previous integrated segmentation and recognition methods.
The generalized Hough transform is a useful method for detecting arbitrary 2-dimensional shapes. Based on the consideration that handwritten Chinese characters are complex 2-dimensional shapes, we make an attempt to r...
详细信息
ISBN:
(纸本)0818671289
The generalized Hough transform is a useful method for detecting arbitrary 2-dimensional shapes. Based on the consideration that handwritten Chinese characters are complex 2-dimensional shapes, we make an attempt to recognize them by using the generalized Hough transform. Features extracted from the parameter space are used to calculate the similarity measure between two character images. The algorithm is implemented in a look-up-table scheme which can be extended to other applications of patternrecognition. Experimental results on personal (writer-dependent) handwritten Chinese character recognition show that this algorithm is suitable for the pre-classification of personal handwritten Chinese character recognition.
A novel page segmentation algorithm is provided in this paper. Based on the extraction of the background, it offers the benefit of being adaptive to the context of the document and to be insensitive to the orientation...
详细信息
A novel page segmentation algorithm is provided in this paper. Based on the extraction of the background, it offers the benefit of being adaptive to the context of the document and to be insensitive to the orientation of the text blocks. It involves a two-dimensional isotropic structuring element used to characterized the white streams. This element is a disk approximated by a regular octagon which can be recursively generated. Another advantage of the proposed method is that a hierarchical segmentation can be derived from the image built upon the octagonal pattern. This tree allows to perform an isotropic multi-scale smearing, which leads to a physical segmentation. The algorithms are based on an input-time tracing principle and use a single scan of the image, they are very well suited to a real-time implementation.
Rumours of the death of the problem of machine-printed text recognition have been greatly exaggerated. Reported results can be good enough to lead one to believe that this is a "solved problem". Closer analy...
详细信息
Rumours of the death of the problem of machine-printed text recognition have been greatly exaggerated. Reported results can be good enough to lead one to believe that this is a "solved problem". Closer analysis reveals test data that is often limited in its range of fonts and point sizes. Worse still, results are commonly quoted for noise-free images, ignoring the problems of recognising "real" documents such as faxes. Various methods have been proposed for modelling characters with Hidden Markov Models. The authors, amongst others, have suggested representing a character by analysing the pixel pattern in columns of its image, and linking sequential column patterns together with a HMM. In this paper we propose a method of quantising the patterns by means of a Shift Invariant Hamming Distance. A full experimental evaluation (45 fonts, 5 point sizes) in typical noise results in a recognition accuracy of 99% in the top-3 choices, and 94% top-choice for the best font. The method has a significant advantage in recognising noisy wordimages, due to classification being achieved without a prior segmentation of the word into characters.
A new method is presented to the B-spline surface presentation of an object defined by a set of parallel slices. For the application of B-spline inversion procedure, the methods about the data points generated are mai...
详细信息
ISBN:
(纸本)3540606971
A new method is presented to the B-spline surface presentation of an object defined by a set of parallel slices. For the application of B-spline inversion procedure, the methods about the data points generated are mainly introduced. The mesh of the data points is generated from interpolation curves of the vertex of the contour, which enable the interpolating surface to approximate the original object. It has the advantage that the reconstructed surface keep the smoothness in total longitudinal direction. The proposed method is also capable of handling the branching problem. Several experimental results corroborate the theory. The results show the image with high fidelity and the rendering speed are satisfactory and pleasing.
This paper presents a new approach to document analysis. The proposed approach is based on modified fractal signature. Instead of the time-consuming traditional approaches (top-down and bottom-up approaches) where ite...
详细信息
This paper presents a new approach to document analysis. The proposed approach is based on modified fractal signature. Instead of the time-consuming traditional approaches (top-down and bottom-up approaches) where iterative operations are necessary to break a document into blocks to extract its geometric (layout) structure, this new approach can divide a document into blocks in only one step. This approach can be used to process documents with high geometrical complexity. Experiments have been conducted to prove the proposed new approach for document processing.
This paper presents a visualization method called the deformed cube for visualizing 3D velocity vector field. Based on the decomposition of the tensor which describes the changes of the velocity, it provides a techniq...
详细信息
ISBN:
(纸本)3540606971
This paper presents a visualization method called the deformed cube for visualizing 3D velocity vector field. Based on the decomposition of the tensor which describes the changes of the velocity, it provides a technique for visualizing local how. A deformed cube,a cube transformed by a tensor in a local coordinate frame, shows the local stretch, shear and rigid body rotation of the local flow corresponding to the decomposed component of the tenser. User can interactively view the local deformation or any component of the changes. The animation of the deformed cube moving along a streamline achieves a more global impression of the flow field. This method is intended as a complement to global visualisation methods.
An off-line recognition engine is proposed for handwritten Korean characters based on a distance matching and the neural network technique. The distance matching selects a set of several candidates from the large set ...
详细信息
ISBN:
(纸本)0818671289
An off-line recognition engine is proposed for handwritten Korean characters based on a distance matching and the neural network technique. The distance matching selects a set of several candidates from the large set of character classes, and the neural network performs a detailed classification on the candidates. As an approach for combining the two methodologies, a clustering method based on sample distributions has been devised. recognition accuracy of the engine on a public database, PE92, is 84.1%. About four character patterns can be processed in a second on PC.
暂无评论