This paper presents the algorithms for recognition and beautification which are used in incremental graphic design applications. These applications propose multimodal interfaces integrating handwriting, gesture, and s...
详细信息
This paper describes a method for extracting words, textlines and text blocks by analyzing the spatial configuration of bounding boxes of connected component on a given document image. The basic idea is that connected...
详细信息
In this paper, we present a mathematical model for evaluating codes and primitives in optical character recognition. The model is based on code eficiency, calculated from its average length and its transmitted informa...
详细信息
We present a segmentation method guided by a generic layout description expressed in a new language. The proposed language allows to describe a page as superposed layers that may be used to separate the main text body...
详细信息
This paper deals with the extraction of words from printed documents that have interference and other strokes (enclosing curves, underlines etc.). A new algorithm which combines the two processes of thinning and detec...
详细信息
In this paper we consider the problem of clustering line-segments into new ones. The clustering-hierarchy gives an answer to the question what original line segments are combined into larger ones. Such a clustering is...
详细信息
This paper describes how Document analysis techniques like OCR, layout analysis, model based recognition and interpretation can be fruitfully applied in the field of high-volume, high-accuracy document capturing with ...
详细信息
Past research in shape analysis and OCR has often emphasized graph matching techniques. We propose to use matching of graph embeddings because this is what is actually of interest. In this way we obtain faster and sim...
详细信息
Optical Music recognition is a form of document analysis for which a priori knowledge is particularly important. Musical notation is governed by a substantial set of rules, but current systems fail to use them adequat...
详细信息
Extracting structural information from paper documents supports the daily document processing by e.g. automatically finding index terms, document topics, etc. Knowledge about such components are modeled in a semantic ...
详细信息
暂无评论