The document image segmentation is an important component in the documentimage understanding. kernel-based methods have demonstrated excellent performances in a variety of pattern recognition problems. This paper app...
详细信息
ISBN:
(纸本)0780393953
The document image segmentation is an important component in the documentimage understanding. kernel-based methods have demonstrated excellent performances in a variety of pattern recognition problems. This paper applies kernel-based methods and Gabor wavelet to the document image segmentation. The feature image are derived from Gabor filtered images. Taking the computational complexity into account, we subject the sampled feature image to spectral clustering algorithm (SCA). The clustering results serve as training samples to train a support vector machine (SVM). The initial segmentation is obtained by assigning class labels to pixels of the feature image with the trained SVM. A proper post-processing is used to improve the segmentation result. Several representative documentimages scanned from popular newspapers and journals are employed to verify the effectiveness of our algorithm.
In this paper, we present a text segmentation method using wavelet packet analysis and k-means clustering algorithm. This approach assumes that the text and non-text regions are considered as two different texture reg...
详细信息
ISBN:
(纸本)9788955191318
In this paper, we present a text segmentation method using wavelet packet analysis and k-means clustering algorithm. This approach assumes that the text and non-text regions are considered as two different texture regions. The text segmentation is achieved by using wavelet packet analysis as a feature analysis. The wavelet packet analysis is a method of wavelet decomposition that offers a richer range of possibilities for documentimage. From these multiscale features, we compute the local energy and intensify the features before adapting the k-means clustering algorithm based on the unsupervised learning rule. The results show that our text segmentation method is effective for documentimages scanned from newspapers and journals.
The requirement of detection and identification of tables from documentimages is crucial to any documentimage analysis and digital library system. In this paper we report a very simple but extremely powerful approac...
详细信息
The requirement of detection and identification of tables from documentimages is crucial to any documentimage analysis and digital library system. In this paper we report a very simple but extremely powerful approach to detect tables present in document pages. The algorithm relies on the observation that the tables have distinct columns which implies that gaps between the fields are substantially larger than the gaps between the words in text lines. This deceptively simple observation has led to the design of a simple but powerful table detection system with low computation cost. Moreover, mathematical foundation of the approach is also established including formation of a regular expression for ease of implementation.
In this paper, we introduce a new system to segment and label documentimages into text, halftoned images, and background using a modified fuzzy c-means (FCM) algorithm. Each pixel is assigned a feature vector, extrac...
详细信息
ISBN:
(纸本)0819461040
In this paper, we introduce a new system to segment and label documentimages into text, halftoned images, and background using a modified fuzzy c-means (FCM) algorithm. Each pixel is assigned a feature vector, extracted from edge information and gray level distribution. The feature pattern is then assigned to a specific region using the modified fuzzy c-means approach. In the process of minimizing the new objective function. the neighborhood effect acts as a regularizer and biases the solution towards piecewise-homogeneous labelings. Such a regularization is useful in se,menting scans corrupted by scanner noise.
The task of document image segmentation is to represent a digital image in a more interpretable form, recognising regions containing text, background and graphics. This paper presents a peculiar strategy for document ...
详细信息
ISBN:
(纸本)0889865280
The task of document image segmentation is to represent a digital image in a more interpretable form, recognising regions containing text, background and graphics. This paper presents a peculiar strategy for document image segmentation, where a neuro-fuzzy approach is involved. Firstly, image is segmented into text, graphics or background during a pixel level classification step. Successively, an analysis performed over the obtained regions is devoted to refine the initial segmentation results. A knowledge discovery process is applied to automatically derive from sample data the fuzzy rule bases, responsible of the inference scheme presiding over the classification of image pixels and regions. The proposed method proves to be accurate and robust to page skew and noise.
In this paper, we present a new system to segment and label the contents of scanned documents as either text or image, using a modified fuzzy c-means (FCM) algorithm. Each pixel is assigned a feature pattern extracted...
详细信息
ISBN:
(纸本)0819456462
In this paper, we present a new system to segment and label the contents of scanned documents as either text or image, using a modified fuzzy c-means (FCM) algorithm. Each pixel is assigned a feature pattern extracted from the gray level distribution and computed at different scales. The invariant feature pattern is then assigned to a specific region using fuzzy logic. Our algorithm is formulated by modifying the objective function of the standard FCM algorithm to allow the labeling of a pixel to be influenced by the labels in its immediate neighborhood. The neighborhood effect acts as a regularizer and biases the solution towards piecewise-homogeneous labelings. Such a regularization is useful in segmenting scans corrupted by scanner noise.
Automatic title and author location can be a crucial step in journal documentimage processing systems. This paper presents a Delaunay triangulation-based method for identification of title and author areas in a techn...
详细信息
Automatic title and author location can be a crucial step in journal documentimage processing systems. This paper presents a Delaunay triangulation-based method for identification of title and author areas in a technical documentimage. The positions and alignments of small text line regions are measured by different triangle groups and the character stroke widths are calculated from the constrained Delaunay triangulation. The rules defining spatial features and font attributes of the title and author region are applied to single line text regions to extract the title and author regions. Our experiment results show that the proposed method is effective. (C) 2003 Elsevier B.V. All fights reserved.
The requirement of identifying and segmenting the table of contents (TOC) and index pages in the development of digital library is obvious. Digital document library is created to provide a non-labour intensive, cheap ...
详细信息
ISBN:
(纸本)0769519482
The requirement of identifying and segmenting the table of contents (TOC) and index pages in the development of digital library is obvious. Digital document library is created to provide a non-labour intensive, cheap and flexible way of storing, representing and managing paper document in electronic form to facilitate indexing, viewing, printing and extracting the intended portions. Information from the TOC and index pages be extracted to use in document database for effective retrieval of the required pieces of information. In this paper we present fully auotmatic identification and segmentation of TOC and index pages from scanned document.
In this paper, an algorithm is developed for segmenting documentimages into four classes: background, photograph, text, acid graph. Features used for classification are based on the distribution patterns of wavelet c...
详细信息
In this paper, an algorithm is developed for segmenting documentimages into four classes: background, photograph, text, acid graph. Features used for classification are based on the distribution patterns of wavelet coefficients in high frequency bands. Two important attributes of the algorithm are its multiscale nature-it classifies an image at different resolutions adaptively, enabling accurate classification at class boundaries as well as fast classification overall-and its use of accumulated context information for improving classification accuracy.
A system for segmentation of documentimage and ordering text areas is described, and applied to complex printed page layouts of both Japanese and English. There is no need to make any assumptions about the shape of b...
详细信息
A system for segmentation of documentimage and ordering text areas is described, and applied to complex printed page layouts of both Japanese and English. There is no need to make any assumptions about the shape of blocks, hence the segmentation technique can handle not only skewed images without skew-correction but also documents where columns are not rectangular. In this technique, based on the bottom-up strategy, the connected components are extracted from the reduced image, and classified according to their local information. The connected components classified as characters are then merged into lines, and the lines are merged into areas. Extracted text areas are classified as body, caption, header or footer. A tree graph of the layout of the body texts is made, and the texts ordered by preorder traversal on the graph. We introduce the concept of an influence range of each node, a procedure for handling titles, thus obtaining good results on various documents. The total system is fast and compact.
暂无评论