In face recognition, the dimensionality of raw data is very high, dimension reduction (Feature Extraction) should be applied before classification. There exist several feature extraction methods, commonly used are Pri...
详细信息
Blurred images are caused by many factors such as defocus, motion, and atmospheric turbulence. Due to the unknown various factors that cannot be distinguished in the blurred image, it is necessary to propose a unified...
详细信息
This paper presents a texture segmentation approach which is based on the Markov random field model (MRF) and feed forward neural *** texture is modeled by the second order Gauss MRF model, and the least square error ...
详细信息
This paper presents a texture segmentation approach which is based on the Markov random field model (MRF) and feed forward neural *** texture is modeled by the second order Gauss MRF model, and the least square error estimation is employed for the solution of model parameters. To perform texture segmentation, we introduced an improved BP algorithm to get faster learning speed. Experiment shows that better segmentation results can be obtained than the traditional Euclidean distance method.
This paper introduced a novel high performance algorithm and VLSI architectures for achieving bit plane coding (BPC) in word level sequential and parallel mode. The proposed BPC algorithm adopts the techniques of co...
详细信息
This paper introduced a novel high performance algorithm and VLSI architectures for achieving bit plane coding (BPC) in word level sequential and parallel mode. The proposed BPC algorithm adopts the techniques of coding pass prediction and parallel & pipeline to reduce the number of accessing memory and to increase the ability of concurrently processing of the system, where all the coefficient bits of a code block could be coded by only one scan. A new parallel bit plane architecture (PA) was proposed to achieve word-level sequential coding. Moreover, an efficient high-speed architecture (HA) was presented to achieve multi-word parallel coding. Compared to the state of the art, the proposed PA could reduce the hardware cost more efficiently, though the throughput retains one coefficient coded per clock. While the proposed HA could perform coding for 4 coefficients belonging to a stripe column at one intra-clock cycle, so that coding for an NxN code-block could be completed in approximate N2/4 intra-clock cycles. Theoretical analysis and experimental results demonstrate that the proposed designs have high throughput rate with good performance in terms of speedup to cost, which can be good alternatives for low power applications.
This paper addresses the application of hand gesture recognition in monocular image sequences using Active Appearance Model (AAM). For this work, the proposed algorithm is conposed of constructing AAMs and fitting the...
详细信息
This paper addresses the application of hand gesture recognition in monocular image sequences using Active Appearance Model (AAM). For this work, the proposed algorithm is conposed of constructing AAMs and fitting the models to the interest region. In training stage, according to the manual labeled feature points, the relative AAM is constructed and the corresponding average feature is obtained. In recognition stage, the interesting hand gesture region is firstly segmented by skin and movement ***, the models are fitted to the image that includes the hand gesture, and the relative features are ***, the classification is done by comparing the extracted features and average features. 30 different gestures of Chinese sign language are applied for testing the effectiveness of the method. The Experimental results are given indicating good performance of the algorithm.
Irregular pyramids are made of a stack of successively reduced graphs embedded in the plane. Each vertex of a reduced graph corresponds to a connected set of vertices in the level below. One connected set of vertices ...
详细信息
A classifier-based method to select and fuse grey level co-occurrence matrix (GLCM), Gaussian Markov random field (GMRF) and discrete wavelet transform (DWT) features to improve texture discrimination is presented. Fe...
详细信息
Automated tongue image segmentation in tongue diagnosis system of traditional Chinese medicine is difficult due to two factors: There are lots of pathological details on the surface of tongue, and the shapes of tongue...
详细信息
Chromosome karyotyping is a critical way to diagnose various hematological malignancies and genetic diseases,of which chromosome detection in raw metaphase cell images is the most critical and challenging *** this wor...
详细信息
Chromosome karyotyping is a critical way to diagnose various hematological malignancies and genetic diseases,of which chromosome detection in raw metaphase cell images is the most critical and challenging *** this work,focusing on the joint optimization of chromosome localization and classification,we propose ChromTR to accurately detect and classify 24 classes of chromosomes in raw metaphase cell *** incorporates semantic feature learning and class distribution learning into a unified DETR-based detection ***,we first propose a Semantic Feature Learning Network(SFLN)for semantic feature extraction and chromosome foreground region segmentation with object-wise ***,we construct a Semantic-Aware Transformer(SAT)with two parallel encoders and a Semantic-Aware decoder to integrate global visual and semantic *** provide a prediction with a precise chromosome number and category distribution,a Category Distribution Reasoning Module(CDRM)is built for foreground-background objects and chromosome class distribution *** evaluate ChromTR on 1404 newly collected R-band metaphase images and the public G-band dataset *** proposed ChromTR outperforms all previous chromosome detection methods with an average precision of 92.56%in R-band chromosome detection,surpassing the baseline method by 3.02%.In a clinical test,ChromTR is also confident in tackling normal and numerically abnormal *** extended to the chromosome enumeration task,ChromTR also demonstrates state-of-the-art performances on R-band and G-band two metaphase image *** these superior performances to other methods,our proposed method has been applied to assist clinical karyotype diagnosis.
In this paper, a face recognition method using local qualitative representations is proposed to solve the problem of face recognition in varying lighting. Based on the observation that the ordinal relationship between...
详细信息
ISBN:
(纸本)9780819469526
In this paper, a face recognition method using local qualitative representations is proposed to solve the problem of face recognition in varying lighting. Based on the observation that the ordinal relationship between the average brightness of image regions pair is invariant under lighting changes, Local Binary Mapping is defined as an illumination invariant for face recognition based on Local Binary pattern descriptor, which extracts the local variance features of an image. For the 'symbol' feature vector, hamming distance is used as similarity measurement. It has been proved that the proposed method can provide the accuracy of 100 percent for subset 2, 3, 4 and 98.89 percent for subset 5 of the Yale facial database B when all images in subset 1 are used as gallery.
暂无评论