Today, great focus has been placed on context-aware human-machine interaction, where systems are aware not only of the surrounding environment, but also about the mental/affective state of the user. Such knowledge can...
详细信息
ISBN:
(纸本)9781509037247
Today, great focus has been placed on context-aware human-machine interaction, where systems are aware not only of the surrounding environment, but also about the mental/affective state of the user. Such knowledge can allow for the interaction to become more human-like. To this end, automatic discrimination between laughter and speech has emerged as an interesting, yet challenging problem. Typically, audio- or video-based methods have been proposed in the literature;humans, however, are known to integrate both sensory modalities during conversation and/or interaction. As such, this paper explores the fusion of support vector machine classifiers trained on localbinary pattern (LBP) video features, as well as speech spectral and prosodic features as a way of improving laughter detection performance. Experimental results on the publicly-available MAHNOB Laughter database show that the proposed audio-visual fusion scheme can achieve a laughter detection accuracy of 93.3%, thus outperforming systems trained on audio or visual features alone.
Melanoma is a deadly form of skin lesion for which the mortality rate can be significantly reduced, if detected at an early stage. Clinical findings have shown that an early detection of melanoma can be done by an ins...
详细信息
ISBN:
(纸本)9783319248349;9783319248332
Melanoma is a deadly form of skin lesion for which the mortality rate can be significantly reduced, if detected at an early stage. Clinical findings have shown that an early detection of melanoma can be done by an inspection of visual characteristics of some specific regions (lesions) of the skin. This paper proposes a pattern recognition system that includes three vital stages to conform the analysis of skin lesions by the clinicians: segmentation, feature extraction, and classification. Segmentation is performed using active contours with creasness features. The feature extraction phase consists of a variant of localbinary pattern (LBP) in which joint histogram of LBP pattern along with the contrast of the patterns are used to extract scale adaptive patterns at each pixel. Classification was performed using support vector machines. Experimental results demonstrate the superiority of the proposed feature set over several other state-of-the-art texture feature extraction methods for melanomas detection. The results indicate the significance of contrast of the pattern along with LBP patterns.
In recent years, the binary coding of face image features, such as local binary patterns (LBP) and local ternary patterns (LTP) have become popular in face recognition systems. These local feature descriptors provide ...
详细信息
ISBN:
(纸本)9781479952779
In recent years, the binary coding of face image features, such as local binary patterns (LBP) and local ternary patterns (LTP) have become popular in face recognition systems. These local feature descriptors provide a simple and powerful means for texture description. In this paper, we present a novel approach, which uses these descriptors to represent face images, and a similarity feature-based selection and classification algorithm to improve recognition rate. The face image is first divided into small regions from which LBP and LTP histograms are extracted and concatenated into a single feature vector. The proposed algorithm is used to select the similarity features of training set and classify the face image. The experiments are conducted on the ORL Database of Faces and the Extended Yale Face Database B. The results clearly show the superiority of the proposed algorithm.
A novel local feature descriptor, namely Directional local binary patterns (DLBP), was proposed in this paper and applied for face recognition. The descriptor first extracts directional edge information, then codes th...
详细信息
ISBN:
(纸本)9783642254482
A novel local feature descriptor, namely Directional local binary patterns (DLBP), was proposed in this paper and applied for face recognition. The descriptor first extracts directional edge information, then codes these information using local binary patterns (LBP). When applied for face recognition, a face image is divided into a number of small sub-windows, DLBP histogram extracted from each sub-window are then concatenated to form a global description of the face. The proposed method was extensively evaluated on two publicly available databases. i.e. the FERET face database and the PolyU-NIRED near-infrared face database. Experimental results show advantages of DLBP over LBP and Directional binary Code (DBC).
This paper proposes the combination of different variants of local binary patterns for texture retrieval. Recent studies have shown that LBP features extracted at multiple resolutions give the best performance for a s...
详细信息
ISBN:
(纸本)9781479941667
This paper proposes the combination of different variants of local binary patterns for texture retrieval. Recent studies have shown that LBP features extracted at multiple resolutions give the best performance for a standard texture retrieval collection. Techniques have been proposed to create a multi-dimensional histogram of these features. In this study we use a simpler approach. We hypothesize that the different variants of LBP may actually extract slightly different but useful image information. If this were the case, then using them in combination will result in further improved performance, compared to using each one independently. We demonstrate this by using LBP, LBp(ri), and LBp(u2) features in combination. We take ideas from the GNU Image Finding Tool (GIFT), and use separate normalisation to ensure the different feature vector lengths do not bias the system towards one of the features. We performed experiments on the two standard texture collections. Our results demonstrate that combining the different LBP features do indeed result in improved performance. In fact, for one of the collections (Outex_TR_00000) collections, using our method give better performance than the current state-of-the-art.
Real-time fusion of Magnetic Resonance (MR) and Trans Rectal Ultra Sound (TRUS) images aid in the localization of malignant tissues in TRUS guided prostate biopsy. Registration performed on segmented contours of the p...
详细信息
ISBN:
(纸本)9780819485045
Real-time fusion of Magnetic Resonance (MR) and Trans Rectal Ultra Sound (TRUS) images aid in the localization of malignant tissues in TRUS guided prostate biopsy. Registration performed on segmented contours of the prostate reduces computational complexity and improves the multimodal registration accuracy. However, accurate and computationally efficient segmentation of the prostate in TRUS images could be challenging in the presence of heterogeneous intensity distribution inside the prostate gland, and other imaging artifacts like speckle noise, shadow regions and low Signal to Noise Ratio (SNR). In this work, we propose to enhance the texture features of the prostate region using local binary patterns (LBP) for the propagation of a shape and appearance based statistical model to segment the prostate in a multi-resolution framework. A parametric model of the propagating contour is derived from Principal Component Analysis (PCA) of the prior shape and texture information of the prostate from the training data. The estimated parameters are then modified with the prior knowledge of the optimization space to achieve an optimal segmentation. The proposed method achieves a mean Dice Similarity Coefficient (DSC) value of 0.94 +/- 0.01 and a mean segmentation time of 0.68 +/- 0.02 seconds when validated with 70 TRUS images of 7 datasets in a leave-one-patient-out validation framework. Our method performs computationally efficient and accurate prostate segmentation in the presence of intensity heterogeneities and imaging artifacts.
We investigate the discriminant power of two local and two global texture measures on virus images. The viruses are imaged using negative stain transmission electron microscopy. local binary patterns and a multi scale...
详细信息
ISBN:
(纸本)9783642250842
We investigate the discriminant power of two local and two global texture measures on virus images. The viruses are imaged using negative stain transmission electron microscopy. local binary patterns and a multi scale extension are compared to radial density profiles in the spatial domain and in the Fourier domain. To assess the discriminant potential of the texture measures a Random Forest classifier is used. Our analysis shows that the multi scale extension performs better than the standard local binary patterns and that radial density profiles in comparison is a rather poor virus texture discriminating measure. Furthermore, we show that the multi scale extension and the profiles in Fourier domain are both good texture measures and that they complement each other well, that is, they seem to detect different texture properties. Combining the two, hence, improves the discrimination between virus textures.
This paper presents a novel framework for objective measurement of facial paralysis in biomedial videos. The motion information in the horizontal and vertical directions and the appearance features on the apex frames ...
详细信息
ISBN:
(纸本)9781424418145
This paper presents a novel framework for objective measurement of facial paralysis in biomedial videos. The motion information in the horizontal and vertical directions and the appearance features on the apex frames are extracted based on the local binary patterns (LBP) on the temporal-spatial domain in each facial region. These features are temporally and spatially enhanced by the application of block schemes. A multi-resolution extension of uniform LBP is proposed to efficiently combine the micro-patterns and large-scale patterns into a feature vector, which increases the algorithmic robustness and reduces noise effects while still retaining computational simplicity. The symmetry of facial movements is measured by the Resistor-Average Distance (RAD) between LBP features extracted from the two sides of the face. Support Vector Machine (SVM) is applied to provide quantitative evaluation of facial paralysis based on the House-Brackmann (H-B) Scale. The proposed method is validated by experiments with 197 subject videos, which demonstrates its accuracy and efficiency.
local binary patterns (LBP) and its variants are widely used for texture classification. In this paper we propose a new variant of LBP descriptor called the extended Gaussian filtered local binary patterns (xGF-LBP) w...
详细信息
ISBN:
(纸本)9781479930227
local binary patterns (LBP) and its variants are widely used for texture classification. In this paper we propose a new variant of LBP descriptor called the extended Gaussian filtered local binary patterns (xGF-LBP) which is robust to illumination changes, noise and captures more informative edge-like features for classification. Experiments on a colonoscopy image dataset with 2100 images for binary ('normal' or 'abnormal') classification show that the proposed xGF-LBP descriptor significantly outperforms the standard LBP descriptor and its considered variants.
Facial expression recognition has widely been investigated in the literature. The need of accurate facial alignment has however limited the deployment of facial expression systems in real-world applications. In this p...
详细信息
ISBN:
(纸本)9783642388866;9783642388859
Facial expression recognition has widely been investigated in the literature. The need of accurate facial alignment has however limited the deployment of facial expression systems in real-world applications. In this paper, a novel feature extraction method is proposed. It is based on extracting local binary patterns (LBP) from image key points. The face region is first segmented into six facial components (left eye, right eye, left eyebrow, right eyebrow, nose, and mouth). Then, local binary patterns are extracted only from the edge points of each facial component. Finally, the localbinary pattern features are collected into a histogram and fed to an SVM classifier for facial expression recognition. Compared to the traditional LBP methodology extracting the features from all image pixels, our proposed approach extracts LBP features only from a set of points of face components, yielding in more compact and discriminative representations. Furthermore, our proposed approach does not require face alignment. Extensive experimental analysis on the commonly used JAFFE facial expression benchmark database showed very promising results, outperforming those of the traditional localbinary pattern approach.
暂无评论