Nowadays, video security systems are essential for supervision everywhere, for example video conference, WhatsApp, ATM, airport, railway station, and other crowded places. In multi-view video systems, various cameras ...
详细信息
Emotion detection has been achieved widely in facial and voice recognition separately with considerable success. The 6 emotional categories coming out of the classification include anger, fear, disgust, happiness and ...
详细信息
ISBN:
(纸本)9781450365291
Emotion detection has been achieved widely in facial and voice recognition separately with considerable success. The 6 emotional categories coming out of the classification include anger, fear, disgust, happiness and surprise. These can be infered from one's facial expressions both in the form of micro and macro expressions. In facial expressions the emotions are derived by feature extracting the facial expressions in different facial poses and classifying the expression feature vectors derived. Similarly automatic classification of a person's speech's affective state has also been used in signal processing to give insights into the nature of emotions. Speech being a critical tool for communication has been used to derive the emotional state of a human being. Different approaches have been successfully used to derive emotional states either in the form of facial expression recognition or speech emotional recognition being used. Less work has looked at fusing the two approaches to see if this improves emotional recognition accuracy. The study analyses the strengths of both and also limitations of either. The study reveals that emotional derivation based on facial expression recognition and acoustic information complement each other and a fusion of the two leads to better performance and results compared to the audio or acoustic recognition alone.
As of late, deep learning has gained remarkable growth in various fields, for example, computervision and natural language processing. Contrasted with conventional machine learning strategies, deep learning has a rob...
详细信息
ISBN:
(纸本)9781538684313
As of late, deep learning has gained remarkable growth in various fields, for example, computervision and natural language processing. Contrasted with conventional machine learning strategies, deep learning has a robust learning capacity and can improve utilization of datasets for feature extraction. In view of its practicability, deep learning turns out to be increasingly mainstream for many researchers to do research works. In this paper we mainly focus on the optimization of different parameters of convolutional neural network of deep learning for classifying 8000 labelled natural images of cat and dog. First the convolutional neural network is trained to learn features then ANN binary classifier is used for classification. Various level of optimization have been done to improve the performance level of the network and finally, we achieved the best classification accuracy of 88.31%.
Object detection from repository of images is challenging task in the area of computervision andimageprocessing in this work we present object classification and detection using cifar-10 data set with intended clas...
详细信息
In this paper, we developed a hand gesture recognition technology in a human-computer interaction system with several hand gestures. The proposed system can help teacher to control the multi-screen touchable teaching ...
详细信息
ISBN:
(纸本)9781450364607
In this paper, we developed a hand gesture recognition technology in a human-computer interaction system with several hand gestures. The proposed system can help teacher to control the multi-screen touchable teaching tools, such as sweeping right or left to access the previous or next slide, with a fist to call the eraser tool to rub out the wrong content. To verify its efficiency and other qualities, we conducted a quasi-experiment in our program site in east part of China which analyzed the pre- and post-test scores in Math class of each experimental groups. Moreover, the error recognition rate is reduced by increasing the relevant features and threshold training for the teaching application. Experimental results demonstrate that the proposed system achieves the high accuracy and real-time performance.
Estimation of phase from the complex interference field has become an emerging area of research for last few decades. The phase values obtained by using arctan function are limited to the interval (-π, π]. Such phas...
详细信息
Watermarking has been supplicate as a tool for the preservation of cognitive property rights of multimedia contents. Because of their digital nature, multimedia documents can be mimeographed, reformed, metamorphosed, ...
详细信息
ISBN:
(纸本)9781538611418
Watermarking has been supplicate as a tool for the preservation of cognitive property rights of multimedia contents. Because of their digital nature, multimedia documents can be mimeographed, reformed, metamorphosed, and promulgated very effortlessly. In this ambience, it is indispensable to burgeon a system for copyright protection, preservation against mimeograph, and validation of contents. In this case, a watermark data is embedded into the actual multimedia content in such a way that it is long-lasting to the content itself. Later on, such watermark can be extracted to manifest ownership to pursue the distribution of the patent toil through the network. In this developed watermarking, predominantly concentrated on Discrete Wavelet Transform (DWT) and Singular Value Decomposition (SVD). Apart from this, it also prevents the copyright of virtual views which is generated by using a technique called Depth-image-Based-Rendering. In this approach, secret data is inserted in the actual center vision using two mathematical transform, DWT and SVD. Then, multiple vision are automatically produced from the watermarked middle view and their corresponding profundity map at the content provider side. Finally watermark will be able to removed from all the three views in a sightless vogue without using the actual content of all the three views. In this proposed watermarking strategy is much better strong to image compression, noise addition and geometric distortion.
The principle model and calibration method of measurement are very important to the measurement system. In the vision measurement system, the auxiliary light source, such as laser, interference light, sine grating and...
详细信息
In this paper, we propose a novel automatic license plate recognition (ALPR) method based on convolutional neural network to achieve a better performance in detecting and recognizing license plate (LP) with relatively...
详细信息
ISBN:
(纸本)9781450366137
In this paper, we propose a novel automatic license plate recognition (ALPR) method based on convolutional neural network to achieve a better performance in detecting and recognizing license plate (LP) with relatively large angle of inclination. Most existing methods only perform well on dataset where LPs are presented in almost upright position with little or no tilted angle. While, in practice, the LP images collected by roadside cameras or hand-held image capturing devices can be fairly slanted, which causes great difficulties on recognition tasks. To solve this problem, we design an angle correction module and integrate it into a holistic ALPR model with a spatial transformer network embedded inside. The whole model can be trained end-to- end by back-propagation. A large and comprehensive rotated LP dataset Rlpd is collected and introduced in our work for model training and testing. Through extensive experiments, this approach is proved to have a better performance on tilted license plate dataset in terms of accuracy and computational cost than other state-of-the-art methods.
This paper mainly gives an introduction to the machine vision-based TFT-LCD point defect detection control system. As the use of otsu algorithm for segmentation is not good enough for TFT-LCD image in the condition of...
详细信息
This paper mainly gives an introduction to the machine vision-based TFT-LCD point defect detection control system. As the use of otsu algorithm for segmentation is not good enough for TFT-LCD image in the condition of weak contrast ratio of the object and background, it is proposed to improve the threshold and extract function by Wible function, and the optimized otsu algorithm has better segmentation effect than the traditional otsu algorithm in the condition of weak contrast ratio of the object and background. After the testing by transplanting the Matlab imageprocessing program to the hardware platform of TFT-LCD defect detection control system, the experimental results show that the detection control system can quickly and accurately process three types of defects, and the shortest detection time can be shortened to 4.5 s.
暂无评论