Automatic facial expression recognition (FER) has gained enormous interest among the computervision researchers in recent years because of its potential deployment in many industrial, consumer, automobile, and societ...
详细信息
ISBN:
(纸本)9789813290884;9789813290877
Automatic facial expression recognition (FER) has gained enormous interest among the computervision researchers in recent years because of its potential deployment in many industrial, consumer, automobile, and societal applications. There are a number of techniques available in the literature for FER;among them, many appearance-based methods such as local binary pattern (LBP), local directional pattern (LDP), local ternary pattern (LTP), gradient local ternary pattern (GLTP), and improved local ternary pattern (IGLTP) have been shown to be very efficient and accurate. In this paper, we propose a new descriptor called local neighborhood difference binary pattern (LNDBP). This new descriptor is motivated by the recent success of local neighborhood difference pattern (LNDP) which has been proven to be very effective in image retrieval. The basic characteristic of LNDP as compared with the traditional LBP is that it generates binary patterns based on a mutual relationship of all neighboring pixels. Therefore, in order to use the benefit of both LNDP and LBP, we have proposed LNDBP descriptor. Moreover, since the extracted LNDBP features are of higher dimension, therefore a dimensionality reduction technique has been used to reduce the dimension of the LNDBP features. The reduced features are then classified using the kernel extreme learning machine (K-ELM) classifier. In order to, validate the performance of the proposed method, experiments have been conducted on two different FER datasets. The performance has been observed using well-known evaluation measures, such as accuracy, precision, recall, and F1-score. The proposed method has been compared with some of the state-of-the-art works available in the literature and found to be very effective and accurate.
Recently computervision and Natural language processing paradigm contains enormous research progress in their respective areas. Despite the progress in both areas, still it remains as a challenging task for machines ...
详细信息
ISBN:
(纸本)9781538678084
Recently computervision and Natural language processing paradigm contains enormous research progress in their respective areas. Despite the progress in both areas, still it remains as a challenging task for machines to extract image semantics and then communicate this extracted information with the desired users. These problems will be solved by Visual Question Answering (VQA) system by connecting both computervision and natural language processing paradigms. In VQA, system is presented with an image and textual question related to that image. The system will generate the answer by processing on both image and textual features. Answer generated by VQA is in one word, phrase or in sentence. Various datasets are available for training and evaluating VQA system which contains real or abstract images and question-answer pairs related to the semantics available in the image. VQA is being used in many areas such as for blind and visually impaired users, robotics, art gallery and many more areas. This paper discusses VQA techniques, VQA datasets and highlights the parametric evaluation of these techniques along with generic issues in VQA system.
image enhancement as an effective means to improve image quality, is still a basic topic in imageprocessing, which has attracted largely attentions from researches. Yet among enhancement algorithms, grayscale correct...
详细信息
ISBN:
(纸本)9781450365307
image enhancement as an effective means to improve image quality, is still a basic topic in imageprocessing, which has attracted largely attentions from researches. Yet among enhancement algorithms, grayscale correction, including gray transformation and histogram processing, is widely used because of its simple principle. Hence, this paper discuss commonly used algorithms in above two categories, and apply them for different scenarios, to obtain their enhance effect. Through analyze and compare their distinctions from theory to experiment, we found that these methods are still to be improved, for instance in real-time, and grayscale. This paper is a fundamental investigate on imageprocessing, which could provide a reference for further research on image enhancement.
At present, the study of non-contact heart rate measurement based on vision is mostly a theoretical method. A few of the relevant algorithms are only described by the MATLAB script language. The specific research and ...
详细信息
ISBN:
(纸本)9781538663967
At present, the study of non-contact heart rate measurement based on vision is mostly a theoretical method. A few of the relevant algorithms are only described by the MATLAB script language. The specific research and design are still basically blank. Contact-type heart rate measurement devices, while accurate, are expensive, inconvenient to carry, and have a limited range of applications. Existing non-contact heart rate devices require a dedicated light source for measurement. Therefore, aiming at the above issues, this paper studies the design and implementation of a non-contact embedded device based on vision, proposes a design scheme, and perfects the currently known non-contact heart rate detection theory based on actual test results. Combining the high performance, low cost, and low power consumption of the ARM processor, a new method for determining the heartbeat crest is proposed, and a low-cost vision-based embedded heart rate device is designed and implemented for the first time.
We have been witnessing lately a convergence among mathematical morphology and other nonlinear fields, such as curve evolution, PDE-based geometrical imageprocessing, and scale-spaces. An obvious benefit of such a co...
详细信息
We have been witnessing lately a convergence among mathematical morphology and other nonlinear fields, such as curve evolution, PDE-based geometrical imageprocessing, and scale-spaces. An obvious benefit of such a convergence is a cross-fertilization of concepts and techniques among these fields. The concept of adjunction however, so fundamental in mathematical morphology, is not yet shared by other disciplines. The aim of this paper is to show that other areas in imageprocessing can possibly benefit from the use of adjunctions. In particular, a strong relationship between pyramids and adjunctions is presented. We show how this relationship may help in analyzing existing pyramids, and construct new pyramids. Moreover, it will be explained that adjunctions based on a curve evolution scheme can provide idempotent shape filters. This idea is illustrated in this paper by means of a simple affine-invariant polygonal flow. Finally, the use of adjunctions in scale-space theory is also addressed.
image segmentation is a crucial step in imageprocessing having various applications in biomedical image analysis. Segmentation of the magnetic resonance images of the brain is one such key area in biomedical image an...
详细信息
ISBN:
(纸本)9783031585340;9783031585357
image segmentation is a crucial step in imageprocessing having various applications in biomedical image analysis. Segmentation of the magnetic resonance images of the brain is one such key area in biomedical image analysis that segments various tissues in the brain and detects tumor regions. In this paper, an unsupervised rough spatial ensemble kernelized fuzzy clustering segmentation algorithm is presented for automated segmentation of magnetic resonance images of the brain. The proposed algorithm is an integration of Rough Fuzzy C Means clustering and the kernel method with a novel ensemble kernel being a combination of spherical kernel, Gaussian, and Cauchy kernels, which improves the performance of the segmentation algorithm. The proposed algorithm performs better than the existing clustering algorithms across a wide range of magnetic resonance images of the brain along with visual indications obtained from the results.
A new method for measuring the dimensional accuracy of the cylindrical spun parts based on machine vision was proposed to overcome the artificial deviation and low efficiency of manual measurement. The image acquisiti...
详细信息
A new method for measuring the dimensional accuracy of the cylindrical spun parts based on machine vision was proposed to overcome the artificial deviation and low efficiency of manual measurement. The image acquisition system of machine vision was built up. The methods of imageprocessing and edge extraction of cylindrical spun parts were studied. The straightness and ovality of the cylindrical spun parts were obtained by the proposed new method. The results showed that the edge contour of the cylindrical spun parts extracted by Canny edge detector is better than Sobel and Prewitt edge detector. The dimensional accuracy of the cylindrical spun parts can be obtained accurately by the proposed measurement method based on machine vision. The relative errors of the straightness and ovality between the machine vision and the manual measurement are less than 10%.
In this paper we present a technique to measure the crater wear using imageprocessing and automatic focusing. The new contour detection algorithm, which can adapt in a noisy image, is suggested. It is suitable for el...
详细信息
In this paper we present a technique to measure the crater wear using imageprocessing and automatic focusing. The new contour detection algorithm, which can adapt in a noisy image, is suggested. It is suitable for eliminating high frequency noises without blurring and with lower processing time. An automatic focusing technique is applied to measure a crater wear depth with a one-dimensional search algorithm for finding the best focus. This method is implemented in the tool microscope driven by a servo motor. The results show that the contour and the depth of crater wear can be measured reliably.
image augmentation of automatic facial expression classification is proposed on the basis of a combination of a deep neural network and a support vector machine. A neural network pre-trained with a large-scale object ...
详细信息
ISBN:
(纸本)9781450365307
image augmentation of automatic facial expression classification is proposed on the basis of a combination of a deep neural network and a support vector machine. A neural network pre-trained with a large-scale object image database is used as a feature extractor for facial images. The accuracy of system performance is evaluated using the database "ATR Facial Expression image Database (DB99)." By using image augmentation, an average recognition rate of 97.92% was obtained, which was a 9.84 percentage point improvement compared with that without augmentation. The experimental results showed the effectiveness of our scheme.
In this paper, a method for distance measurement only using a single target image without any internal camera parameters is developed. The mapping relation between image row pixel values and the actual distances is es...
详细信息
ISBN:
(纸本)9781538663967
In this paper, a method for distance measurement only using a single target image without any internal camera parameters is developed. The mapping relation between image row pixel values and the actual distances is established by detecting and locating the corners on the reference target image. The distance information is extracted in real time by combining the moving target detection method based on Gaussian mixture model (GMM) and the shadow elimination in Hue-Saturation-Intensity (HSI) color space. The method has a simple calibration procedure with a single image, which is suitable for practical application. The experimental result shows that the algorithm is effective and satisfies the real-time and accurate requirements of distance detection.
暂无评论