Combining bottom-up and top-down attention influences, a novel region extraction model which based on object-accumulated visual attention mechanism is proposed in this paper. Compared with early research, the new appr...
详细信息
Combining bottom-up and top-down attention influences, a novel region extraction model which based on object-accumulated visual attention mechanism is proposed in this paper. Compared with early research, the new approach brings in prior information at the proper time, updates scan path dynamically, needs less computational resources and reduces the probability to direct the attention to a less-meaning area. The application to search an airport target in remote sensing image was provided, through which the novel mechanism that how visual attention chose the area was described. Compared with another two region extraction models, experimental results confirm the effectiveness of the approach proposed in this paper.
In order to preserve our cultural heritage and for automated document processing libraries and national archives have started digitizing historical documents. In the case of degraded manuscripts (e.g. by mold, humidit...
详细信息
ISBN:
(纸本)9781424421749
In order to preserve our cultural heritage and for automated document processing libraries and national archives have started digitizing historical documents. In the case of degraded manuscripts (e.g. by mold, humidity, bad storage conditions) the text or parts of it can disappear. The remaining parts of the text can be segmented and the ruling can be extrapolated with the a priori knowledge. Since the ruling defines the position of the text within a page, it can be used for layout analysis and as a basis for the enhancement of the readability. Furthermore, information about the scribe (hand) of the manuscript, its spatiotemporal origin can be gained by analyzing the ruling. This paper presents an algorithm for ruling estimation of Glagolitic texts based on text line extraction and is suitable for degraded manuscripts by extrapolating the baselines with the a priori knowledge of the ruling. The algorithm was tested on 30 pages of the Missale Sinaiticum and the evaluation was based on visual criteria.
This paper deals with the enhancement of the readability in historic texts written on parchment. Due to mold, air, humidity, water, etc. parchment and text are partially damaged and consequently hard to read. In order...
详细信息
This paper deals with the enhancement of the readability in historic texts written on parchment. Due to mold, air, humidity, water, etc. parchment and text are partially damaged and consequently hard to read. In order to enhance the readability of the text, the manuscript pages are imaged in different spectral bands ranging from 360 to 1000 nm. The readability enhancement is based on a spectral and spatial analysis of the multivariate image data by multivariate spatial correlation. The main advantage of the method is that especially the text regions are enhanced which is provided by generating a mask image. This mask is based on the automatic reconstruction of the ruling scheme of the text pages. The method is tested on two medieval Slavonic manuscripts written on parchment.
This paper presents a new feature extraction method for iris recognition. Since two dimensional complex wavelet transform (2D-CWT) does not only keep wavelet transformpsilas properties of multiresolution decomposition...
详细信息
ISBN:
(纸本)9781424421749
This paper presents a new feature extraction method for iris recognition. Since two dimensional complex wavelet transform (2D-CWT) does not only keep wavelet transformpsilas properties of multiresolution decomposition analysis and perfect reconstruction, but also adds its new merits: approximate shift invariance, good directional selectivity for 2-D image, and limited redundancy, which are useful for iris feature extraction. So, a set of high frequency 2D-CWT coefficients are selected as features for iris recognition. The phase information of the coefficients is used for feature encoding and Hamming distance is adopted for classification. Experimental results show that the proposed algorithm can get good recognition rate.
In this work, multi-view ear recognition problems are examined in detail. A new multi-view ear recognition approach based on B-Spline pose manifold construction in discriminative projection space which is formed by nu...
详细信息
In this work, multi-view ear recognition problems are examined in detail. A new multi-view ear recognition approach based on B-Spline pose manifold construction in discriminative projection space which is formed by null kernel discriminant analysis (NKDA) feature extraction is presented. Many experiments and comparisons are provided to show the effectiveness of our multi-view ear recognition approach.
Palmprint is one of the most unique and stable biometric characteristics. Although 2D palmprint recognition can achieve high accuracy, the 2D palmprint images can be easily counterfeited and much 3D depth information ...
详细信息
Palmprint is one of the most unique and stable biometric characteristics. Although 2D palmprint recognition can achieve high accuracy, the 2D palmprint images can be easily counterfeited and much 3D depth information is lost in the imaging process. This paper presents a new approach, 3D palmprint recognition, to exploit the 3D structural information of the palm surface. The structured-light imaging is used to acquire the 3D palmprint data, from which the features of Mean Curvature, Gauss Curvature and Surface Type (ST) are extracted. A fast feature matching and score level fusion strategy are then used to classify the input 3D palmprint data. With the established 3D palmprint database, a series of verification and identification experiments are conducted and the results show that 3D palmprint technique can achieve high recognition rate while having high anti-counterfeiting capability.
This paper presents an efficient face segmentation approach based on face attention model and seeded region merging.A face attention model that jointly exploits the information of skin color and eye's position is ...
详细信息
This paper presents an efficient face segmentation approach based on face attention model and seeded region merging.A face attention model that jointly exploits the information of skin color and eye's position is first constructed to obtain a facial saliency map,which indicates the position of possible faces and is used to determine seed *** a seeded region merging algorithm based on regional facial saliency is proposed to generate a sequence of regions,and the region with the highest regional facial saliency is selected to represent each *** results on a variety of images demonstrate the good segmentation performance of the proposed face segmentation algorithm.
Human key posture extraction from videos will benefit video storage, video retrieval, human action recognition, human behaviour understanding and so on. This paper presents an approach to select key postures from huma...
详细信息
ISBN:
(纸本)9781424422944
Human key posture extraction from videos will benefit video storage, video retrieval, human action recognition, human behaviour understanding and so on. This paper presents an approach to select key postures from human action sequences using 2D information. There are two steps in the proposed method. Information measurement which is a kind of global feature of a frame is used to roughly find key posture candidates. Then, a body skeleton feature which is a kind of local feature is applied to select final key postures from the candidates obtained in the first step. The experiments show that the proposed method is efficient.
Past decades, numerous spectral analysis based algorithms have been proposed for dimensionality reduction, which plays an important role in machine learning and artificial intelligence. However, most of these existing...
详细信息
Past decades, numerous spectral analysis based algorithms have been proposed for dimensionality reduction, which plays an important role in machine learning and artificial intelligence. However, most of these existing algorithms are developed intuitively and pragmatically, i.e., on the base of the experience and knowledge of experts for their own purposes. Therefore, it will be more informative to provide some a systematic framework for understanding the common properties and intrinsic differences in the algorithms. In this paper, we propose such a framework, i.e., ldquopatch alignmentrdquo, which consists of two stages: part optimization and whole alignment. With the proposed framework, various algorithms including the conventional linear algorithms and the manifold learning algorithms are reformulated into a unified form, which gives us some new understandings on these algorithms.
暂无评论