检索结果-内蒙古大学图书馆

A pseudo-skeletonization algorithm for static handwritten scripts

INTERNATIONAL JOURNAL ON document ANALYSIS AND RECOGNITION 2009年第1期12卷 47-62页

作者： Nel, Emli-Mari du Preez, J. A. Herbst, B. M. Oxford Metr Grp Oxford OX2 0JB England Univ Stellenbosch Dept Elect & Elect Engn ZA-7602 Matieland South Africa Univ Stellenbosch Dept Appl Math ZA-7602 Matieland South Africa

This paper describes a skeletonization approach that has desirable characteristics for the analysis of static handwritten scripts. We concentrate on the situation where one is interested in recovering the parametric curve that produces the script. Using Delaunay tessellation techniques where static images are partitioned into sub-shapes, typical skeletonization artifacts are removed, and regions with a high density of line intersections are identified. An evaluation protocol, measuring the efficacy of our approach is described. Although this approach is particularly useful as a pre-processing step for algorithms that estimate the pen trajectories of static signatures, it can also be applied to other static handwriting recognition techniques.

关键词： Skeletonization Thinning Pseudo skeleton document and text processing document analysis

来源：评论

学校读者我要写书评

暂无评论

document IMAGE BINARISATION USING A SUPERVISED NEURAL NETWORK

引用

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS 2008年第5期18卷 405-418页

作者： Khashman, Adnan Sekeroglu, Boran Near East Univ Dept Elect & Elect Engn TR-10 Lefkosa Mersin Turkey Near East Univ Dept Comp Engn TR-10 Lefkosa Mersin Turkey

Advances in digital technologies have allowed us to generate more images than ever. Images of scanned documents are examples of these images that form a vital part in digital libraries and archives. Scanned degraded documents contain background noise and varying contrast and illumination, therefore, document image binarisation must be performed in order to separate foreground from background layers. Image binarisation is performed using either local adaptive thresholding or global thresholding;with local thresholding being generally considered as more successful. This paper presents a novel method to global thresholding, where a neural network is trained using local threshold values of an image in order to determine an optimum global threshold value which is used to binarise the whole image. The proposed method is compared with five local thresholding methods, and the experimental results indicate that our method is computationally cost-effective and capable of binarising scanned degraded documents with superior results.

关键词： Image enhancement binarisation thresholding neural network document and text processing

来源：评论

学校读者我要写书评

暂无评论

Verification of dynamic curves extracted from static handwritten scripts

引用

PATTERN RECOGNITION 2008年第12期41卷 3773-3785页

作者： Nel, Emli-Mari du Preez, J. A. Herbst, B. M. Univ Stellenbosch Dept Appl Math ZA-7602 Matieland South Africa Univ Stellenbosch Dept Elect & Elect Engn ZA-7602 Matieland South Africa

Static handwritten scripts originate as images on documents and do not, by definition, contain any dynamic information. To improve the accuracy of static handwriting recognition systems, many techniques aim to estimate dynamic information from the static scripts. Mostly, the pen trajectories of the scripts are estimated. However, the efficacy of the resulting pen trajectories are rarely evaluated quantitatively. This paper proposes a protocol for the objective evaluation of automatically determined pen trajectories. A hidden Markov model is derived from a ground-truth trajectory. An estimated trajectory is then matched to the derived model. Statistics describing substitution, insertion and deletion errors are then computed from this match. The proposed algorithm is especially useful for performance comparisons between different pen trajectory estimation algorithms. (C) 2008 Elsevier Ltd. All rights reserved.

关键词： pattern recognition document and text processing document analysis handwriting analysis estimating pen trajectories of static scripts automatic assessment stroke recovery

来源：评论

学校读者我要写书评

暂无评论

Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping

引用

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2008年第3期14卷 564-575页

作者： Paulovich, Fernando V. Nonato, Luis Gustavo Minghim, Rosane Levkowitz, Haim Univ Sao Paulo Inst Ciencia Matemat & Computacao BR-13560970 Sao Carlos SP Brazil Univ Massachusetts Dept Comp Sci Lowell MA 01854 USA

The problem of projecting multidimensional data into lower dimensions has been pursued by many researchers due to its potential application to data analyses of various kinds. This paper presents a novel multidimensional projection technique based on least square approximations. The approximations compute the coordinates of a set of projected points based on the coordinates of a reduced number of control points with defined geometry. We name the technique Least Square Projections ( LSP). From an initial projection of the control points, LSP defines the positioning of their neighboring points through a numerical solution that aims at preserving a similarity relationship between the points given by a metric in mD. In order to perform the projection, a small number of distance calculations are necessary, and no repositioning of the points is required to obtain a final solution with satisfactory precision. The results show the capability of the technique to form groups of points by degree of similarity in 2D. We illustrate that capability through its application to mapping collections of textual documents from varied sources, a strategic yet difficult application. LSP is faster and more accurate than other existing high-quality methods, particularly where it was mostly tested, that is, for mapping text sets.

关键词： document and text processing visualization simulation modeling and visualization data and knowledge visualization information visualization visualization techniques and methodologies

来源：评论

学校读者我要写书评

暂无评论

Reconstructing the frontal geometry of drawings of arbitrary surfaces

引用

COMPUTERS & GRAPHICS-UK 2007年第4期31卷 568-579页

作者： Kaplan, Matthew Cohen, Elaine Univ Utah Salt Lake City UT 84112 USA

We present a method for creating 2 1/2D models from line drawings of opaque solid objects. As input, we use a single drawing composed of strokes indicative of surface geometry, but not of texture, color or shading. We attempt to allow the artist to draw naturally, differing from many previous approaches. Our system allows both perspective and orthographic projection to be used and we make no a priori assumptions about the type of model to be produced (i.e. planar, curved, normalon). The frontal geometry of the input drawing is reconstructed by placing constraints at the contours and solving a 2D variational system for the smoothest piecewise smooth surface. An analysis of line labelling allows us to determine what constraints are possible and/or required for each input line. However, because line labelling produces a combinatorial explosion of valid output geometries, we allow the user to guide the constraint selection and optimization with a simple user interface that abstracts the technical details away from the user. The system produces candidate reconstructions using different constraint values, from which the user selects the one that most closely approximates the model represented by the drawing. These choices allow the system to determine the constraints and reconstruct the model. The system runs at interactive speeds. (c) 2007 Published by Elsevier Ltd.

关键词： artificial intelligence perceptual reasoning computer graphics computational geometry and object modelling document and text processing graphics recognition and interpretation

来源：评论

学校读者我要写书评

暂无评论

Extraction of plant identification keys using approximate string matching for species properties classification

Extraction of plant identification keys using approximate st...

引用

International Multiconference of Engineers and Computer Scientists

作者： Sharifalillah, N. Mohd, S. B. Khairuddin, I. Univ Teknol MARA Fac Informat Technol & Quantitat Sci Shah Alam 40450 Selangor Malaysia Univ Malaya Fac Comp Sci & Informat Technol Kuala Lumpur 50630 Malaysia Univ Malaya Inst Biol Sci Kuala Lumpur 50630 Malaysia

ISBN: (纸本)9789889867140

Most biologists keep data in separate databases. These databases are not necessary well-structured, Plant identification keys are among such data. They are data-rich description containing plant identification terminologies and maybe used to identify various plant species. The way the data is kept often requires the species identification to be done using rules that are applied sequentially. Done manually, this is very time consuming. Information extraction (IE) is a process of selecting information such as names, terms, or phrases, from a natural language text documents. This information is then structured into a specified template for retrieval. This method is applied to plant identification keys kept by the biologists. Before the keys are extracted from the description,they have to go through a number of processes. In this paper, we illustrate the pre-processing and processing methods with an example from a database, with emphasis on the approximate string matching algorithm to extract the most relevant keys from the description.

关键词： bioinformatics document and text processing

来源：评论

学校读者我要写书评

暂无评论

A scale space approach for automatically segmenting words from historical handwritten documents

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2005年第8期27卷 1212-1225页

作者： Manmatha, R Rothfeder, JL Univ Massachusetts Ctr Intelligent Informat Retrieval Dept Comp Sci Amherst MA 01003 USA

Many libraries, museums, and other organizations contain large collections of handwritten historical documents, for example, the papers of early presidents like George Washington at the Library of Congress. The first step in providing recognition/retrieval tools is to automatically segment handwritten pages into words. State of the art segmentation techniques like the gap metrics algorithm have been mostly developed and tested on highly constrained documents like bank checks and postal addresses. There has been little work on full handwritten pages and this work has usually involved testing on clean artificial documents created for the purpose of research. Historical manuscript images, on the other hand, contain a great deal of noise and are much more challenging. Here, a novel scale space algorithm for automatically segmenting handwritten ( historical) documents into words is described. First, the page is cleaned to remove margins. This is followed by a gray-level projection profile algorithm for finding lines in images. Each line image is then filtered with an anisotropic Laplacian at several scales. This procedure produces blobs which correspond to portions of characters at small scales and to words at larger scales. Crucial to the algorithm is scale selection, that is, finding the optimum scale at which blobs correspond to words. This is done by finding the maximum over scale of the extent or area of the blobs. This scale maximum is estimated using three different approaches. The blobs recovered at the optimum scale are then bounded with a rectangular box to recover the words. A postprocessing filtering step is performed to eliminate boxes of unusual size which are unlikely to correspond to words. The approach is tested on a number of different data sets and it is shown that, on 100 sampled documents from the George Washington corpus of handwritten document images, a total error rate of 17 percent is observed. The technique outperforms a state-of-the-art gap met

关键词： segmentation document and text processing document analysis handwriting analysis document indexing smoothing optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Estimating the pen trajectories of static signatures using hidden Markov models

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2005年第11期27卷 1733-1746页

作者： Nel, EM du Preez, JA Herbst, BM Univ Stellenbosch Dept Elect & Elect Engn ZA-7602 Matieland South Africa Univ Stellenbosch Dept Appl Math ZA-7602 Matieland South Africa

Static signatures originate as handwritten images on documents and by definition do not contain any dynamic information. This lack of information makes static signature verification systems significantly less reliable than their dynamic counterparts. This study involves extracting dynamic information from static images, specifically the pen trajectory while the signature was created. We assume that a dynamic version of the static image is available (typically obtained during an earlier registration process). We then derive a hidden Markov model from the static image and match it to the dynamic version of the image. This match results in the estimated pen trajectory of the static image.

关键词： pattern recognition document and text processing document analysis handwriting analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：