We propose a text scanner, which detects wide text strings in a sequence of scene images. For scene text detection, we use a multiple-CAMShift algorithm on a text probability image produced by a multi-layer perceptron...
详细信息
We propose a text scanner, which detects wide text strings in a sequence of scene images. For scene text detection, we use a multiple-CAMShift algorithm on a text probability image produced by a multi-layer perceptron...
详细信息
Two years after the first edition, a new Fingerprint Verification Competition (FVC2002) was organized by the authors, with the aim of determining the state-of-the-art in this challenging pattern recognition applicatio...
详细信息
This paper presents an unsupervised range image segmentation based on Kohonen neural network. At first, the derivative and partial derivative of each point are calculated and the normal in each points is gotten. With ...
详细信息
This paper presents an unsupervised range image segmentation based on Kohonen neural network. At first, the derivative and partial derivative of each point are calculated and the normal in each points is gotten. With the character vectors including normal and range value, self-organization map is introduced to cluster. The normal analysis is used to eliminate over-segmentation and the last result is gotten. This method avoid selecting original seeds and uses fewer samples, moreover computes rapidly. The experiment shows the better performance.
This paper proposes a genetic-based algorithm for surface reconstruction of three-dimension (3-D) objects from a group of contours representing its section plane lines. The algorithm can optimize the triangulation of ...
详细信息
This paper proposes a genetic-based algorithm for surface reconstruction of three-dimension (3-D) objects from a group of contours representing its section plane lines. The algorithm can optimize the triangulation of the surface of 3-D objects with a multi-objective optimization function to meet the needs of a wide range of applications. Further, a new crossover operator for triangulation and a new 3-D quadrilateral mutation operator are also introduced.
As a multi-thresholding technique for gray images, the maximum likelihood method under the assumption of mixture of normal distributions and the thresholding method considering the quantization error have already been...
详细信息
As a multi-thresholding technique for gray images, the maximum likelihood method under the assumption of mixture of normal distributions and the thresholding method considering the quantization error have already been proposed. In this report, separability measures are defined from the likelihood criteria which are used as evaluation functions in these methods. The proposed separability measures are invariant under affine transformations of gray level scale and are normalized to be within values from 0 to 1. Binarization for a gray image whose distribution is uniform is considered. By investigating properties of the separability measures in this case, it is shown that consideration of the quantization error is effective and the defined separability measures are valid.
In this paper,a parallel coordinative visual model—The revised Plate Parallel Retrieval Model(Wang 1994) [1]is presented based on the analysis of the global effect of Chinese Characters and the recent neurobiological...
详细信息
In this paper,a parallel coordinative visual model—The revised Plate Parallel Retrieval Model(Wang 1994) [1]is presented based on the analysis of the global effect of Chinese Characters and the recent neurobiological researches on the function of neuroglia in learning and *** theory assumes that visual neurons possess the function of memory and retrieval in addition to the commonly recognised function of signal *** supposes all the pixels on the array of a Chinese character are encoded,stored and relieved simultaneously and sychronously by each neuron separately.
We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propos...
We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propose new image coding techniques which combine a wavelet representation, embedded coding of the wavelet coefficients, and segmentation of image-domain regions in the wavelet domain. A bitstream is generated in which each image region is encoded independently of other regions, without having to explicitly store information describing the regions. Simulation results show that our proposed algorithms achieve coding performance which compares favorably, both perceptually and objectively, to that achieved using state-of-the-art image/video coding techniques while additionally providing region-based support.
We investigate the performance of selected texture models for the purpose of land use classification. The texture models are evaluated based on the resulting classification error rates. Three classes of texture models...
详细信息
暂无评论