This paper presents a new color document image binarization that is suitable for palm leaf manuscripts and color document images. The proposed method consists of two steps: a pre-processing procedure using low-pass Wi...
详细信息
ISBN:
(纸本)9781424442195
This paper presents a new color document image binarization that is suitable for palm leaf manuscripts and color document images. The proposed method consists of two steps: a pre-processing procedure using low-pass Wiener filter, and contrast adaptive binarization for segmentation of text from the background. Firstly, in the pre-processing stage, low-pass Wiener filter is used to eliminate noisy areas, smoothing of background texture as well as contrast enhancement between background and text areas. Finally, binarization is performed by using contrast adaptive binarization method in order to extract useful text information from low quality document images. The techniques are tested on a set of palm leaf manuscript images/color document images. The performance of the algorithm is demonstrated on by palm leaf manuscripts/color documents distorted with show-through effects, uneven background color and localized spot.
We propose a novel physically based method to simulate explosions and other compressible fluid phenomena. The method uses compressible Navier Stokes equations for modeling the explosion with a Semi-Lagrangian integrat...
详细信息
ISBN:
(纸本)9781424442195
We propose a novel physically based method to simulate explosions and other compressible fluid phenomena. The method uses compressible Navier Stokes equations for modeling the explosion with a Semi-Lagrangian integration method. The proposed integration method addresses the issues of stability and larger timesteps. This is achieved by modifying the Semi-Lagrangian method to reduce dissipation and increase accuracy, using improved interpolation and an error correction method. The proposed method allows the rendering of related phenomena like a fireball, dust and smoke clouds, and the simulation of solid interaction like rigid fracture and rigid body simulation. Our method is flexible enough to afford substantial artistic control over the behavior of the explosion.
Current denoising techniques use the classical orthonormal wavelets for decomposition of an image corrupted with additive white Gaussian noise, upon which various thresholding strategies are built. The use of availabl...
详细信息
ISBN:
(纸本)9781424442195
Current denoising techniques use the classical orthonormal wavelets for decomposition of an image corrupted with additive white Gaussian noise, upon which various thresholding strategies are built. The use of available biorthogonal wavelets in image denoising is less common because of their poor performance. hi this paper, we present a method to design image-matched biorthogonal wavelet bases and report on their potential for denoising. We have conducted experiments on various image datasets namely Natural, Satellite and Medical with the designed wavelets using two existing thresholding strategies. Test results front comparing the performance of matched and fixed biorthogonal wavelets show an average improvement of 35% in MSE for low SNR values (0 to 18db) in every dataset. This improvement was also seen in the PSNR and visual comparisons. This points to the importance of matching when using wavelet-based denoising.
This paper presents a new approach to achieve the performance improvement for the traditional palmprint authentication approaches. The cohort information is used in the matching stage but only when the matching scores...
详细信息
ISBN:
(纸本)9781424442195
This paper presents a new approach to achieve the performance improvement for the traditional palmprint authentication approaches. The cohort information is used in the matching stage but only when the matching scores are inadequate to generate reliable decisions. The cohort information can also be utilized to achieve the significant performance improvement for the combination of modalities and this is demonstrated from the experimental results in this paper. The rigorous palmprint authentication results presented in this paper are the best in the literature and confirm the utility of significant information that can be extracted from the imposter scores. The statistical estimation of confidence level for the palmprint matching requires an excellent match between the theoretical distribution and the real score distribution. The performance analysis presented in this paper, from over 29.96 million imposter matching scores, suggests that Beta-Binomial function can more accurately model the distribution of real palmprint matching scores.
We propose a new method to compress the geometry component of 3D animation sequence. It is based on the Linear Discriminant Analysis (LDA) of the animation geometry data. The redundancy across the animation frames has...
详细信息
ISBN:
(纸本)9781424442195
We propose a new method to compress the geometry component of 3D animation sequence. It is based on the Linear Discriminant Analysis (LDA) of the animation geometry data. The redundancy across the animation frames has been exploited by using the LDA in the temporal direction. Owing to the redundancy between the frames of a class, the covariance matrix of that class for the LDA computation may become singular. To overcome this drawback, we first transform the data into a new basis using the Principal Component Analysis (PCA) and then apply the LDA on a few principal components. The reconstruction is simple and involves two stages: firstly for the LDA and then for the PCA. The experimental results show that the proposed method has the advantage of better reconstruction error at high compression ratios.
A novel nonlinear cooperative approach to image denoising and restoration is presented. Samples from the image field with similar characteristics are first grouped into clusters by first performing image decomposition...
详细信息
ISBN:
(纸本)9781424442195
A novel nonlinear cooperative approach to image denoising and restoration is presented. Samples from the image field with similar characteristics are first grouped into clusters by first performing image decomposition based on the Mumford-Shah model using a total variational framework and performing fuzz), c-means clustering within each image partition. Samples within each cluster are then aggregated using an cooperative Bayesian estimation method based on information from all the samples to provide a nonlinear estimate of the original image. The proposed method exploits information redundancy within each cluster to denoise and restore the original image. Furthermore, the proposed cooperative Bayesian estimation method is capable of suppressing noise and reducing image degradation while preserving image detail by utilizing intra-cluster statistics. The experimental results using different types of images demonstrate that the proposed algorithm provides state-of-the-art image denoising performance in terms of both peak signal-to-noise ratio (PSNR) and subjective visual quality
State of art document segmentation algorithms employ adhoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to p...
详细信息
ISBN:
(纸本)9781424442195
State of art document segmentation algorithms employ adhoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution based on the optimization problem of spectral partitioning which makes the decision of proper segmentation based on the Spectral properties of the pairwise similarity matrix. The solution described in the paper is shown to be general, global and closed form. The claims have been demonstrated on 142 page images from a Telugu book, in a script set in both poetry and prose layouts. This particular class of scripts has been proved to be challenging for the existing state of the art algorithms, where the proposed solution achieves significant results.
In this paper, we propose a novel framework for automated analysis of surveillance videos. By analysis, we imply summarizing and mining of the information in the video for learning usual patterns and discovering unusu...
详细信息
ISBN:
(纸本)9781424442195
In this paper, we propose a novel framework for automated analysis of surveillance videos. By analysis, we imply summarizing and mining of the information in the video for learning usual patterns and discovering unusual ones. We approach this video analysis problem by acknowledging that a video contains information at multiple levels and in multiple attributes. Each such component and co-occurrences of these component values play an important role in characterizing an event as usual or unusual. Therefore, we cluster the video data at multiple levels of abstraction and in multiple attributes and view these clusters as a summary of the information in the video. We apply cluster algebra to mine this summary from multiple perspectives and to adapt association learning for automated selection of components because of which the event is unusual. We also propose a novel incremental clustering algorithm.
The GPUs pack high computation power and a restricted architecture into easily available hardware today. They are now used as computation co-processors and come with programming models that treat them as standard para...
详细信息
ISBN:
(纸本)9781424442195
The GPUs pack high computation power and a restricted architecture into easily available hardware today. They are now used as computation co-processors and come with programming models that treat them as standard parallel architectures. We explore the problem of real time ray casting of large deformable models (over a million triangles) on large displays (a million pixels) on an off-the-shelf GPU in this paper Ray casting is an inherently, parallel and highly compute intensive operation. We build a GPU-efficient three-dimensional data structure for this purpose and a corresponding algorithm that uses it for fast ray casting. We also present fast methods to build the data structure on the SIMD GPUs, including a fast multi-split operation. We achieve real-time ray-casting of a million triangle model onto a million pixels on current Nvidia GPUs using the CUDA model. Results are presented on the data structure building and ray casting on a number of models. The ideas presented here are likely to extend to later models and architectures of the GPU as well as to other multi core architectures.
In shape recognition, a multiscale description provides more information about the object, increases discrimination power and immunity to noise. In this paper, we develop a new multiscale Fourier-based object descript...
详细信息
ISBN:
(纸本)9781424442195
In shape recognition, a multiscale description provides more information about the object, increases discrimination power and immunity to noise. In this paper, we develop a new multiscale Fourier-based object description in 2-D space using a low-pass Gaussian filter (LPGF) and a high-pass Gaussian filter (HPGF), separately. Using the LPGF, at different scales, represents the inner and central part of an object more than the boundary. On the other hand using the HPGF, at different scales, represents the boundary and exterior parts of an object more than the central part. Our algorithms are also organized to achieve size, translation and rotation invariance. Evaluation indicates that representing the boundary and exterior parts more than the central part using the HPGF performs better than the LPGF based multiscale representation, and in comparison to Zernike moments and elliptic Fourier descriptors with respect to increasing noise.
暂无评论