This paper proposes techniques for facedetection using Haar-like features as weak classifiers and gives the implementation details for an FPGA development board. We analyze and discuss the relation between the system...
详细信息
This paper proposes techniques for facedetection using Haar-like features as weak classifiers and gives the implementation details for an FPGA development board. We analyze and discuss the relation between the system computation cost and selection of the image scaling factor. Based on the empirical results of our previous work, we give a new method to select the stop threshold for the image reduction process, which reduces the total computation by half. We present and implement an improved integral image pipeline calculation design. We also provide a color image output mode to let our system enjoy more human-oriented design. Test results show that the system achieves real-time facedetection speed (100fps) and a high facedetection rate (87.2%) for an SVGA (600 x 800) video source. The low power consumption (3.5 W) is another advantage over previous work. (C) 2012 Elsevier B.V. All rights reserved.
Although machine learning and artificial intelligence have been widely applied, noise and interference are still major disturbances to degrade the quality of image transmission and processing efficiency in multimedia ...
详细信息
ISBN:
(纸本)9781665449083
Although machine learning and artificial intelligence have been widely applied, noise and interference are still major disturbances to degrade the quality of image transmission and processing efficiency in multimedia data transmission. Based on the face detection system utilizing the machine learning algorithms and artificial intelligence, this paper examines, analyzes and compares the performances of the AdaBoost machine learning algorithm and Convolutional Neural Network (CNN)-based algorithm in processing face information, which is disturbed by channel noise and fading effect encountered in the transmission of face image. The face detection system used is based on HAAR feature extraction. The extracted HAAR features are subjected to classification training and learning of the cascade classifier. Then the facedetection is performed on the picture information outside the database. Results of computer simulation show that for the image data affected by the fading and AWGN, the face recognition system still marked the positions of the eyes and mouth with high accuracy. However for fading and higher AWGN, using a machine learning algorithm with a convolutional neural network is better than the ADABOOST algorithm. It can be concluded that the machine learning algorithm can effectively reduce the adverse effect of multimedia data transmission without increasing the SNR and use of higher level of modulation scheme.
This paper proposes techniques for facedetection and gives the implementation details for an FPGA development board. We analyze and discuss the relation between the system computation cost and selection of the image ...
详细信息
ISBN:
(纸本)9780769543017
This paper proposes techniques for facedetection and gives the implementation details for an FPGA development board. We analyze and discuss the relation between the system computation cost and selection of the image scaling factor. We give a new method to select the stop threshold for the image reduction process, which reduces the total computation by half. We also provide a color image output mode to let our system enjoy more human-oriented design. Test results show that the system achieves real-time facedetection speed (100fps) and a high facedetection rate (87.2%) for an SVGA (600 x 800) video source. The low power consumption (3.5W) is another advantage over previous work.
The importance of correctly identifying a student in a distance learning environment comes from the necessity to avoid fraud and the fact that the student plays the central role in this kind of learning process claims...
详细信息
The importance of correctly identifying a student in a distance learning environment comes from the necessity to avoid fraud and the fact that the student plays the central role in this kind of learning process claims that the student should be correctly assessed, identified, and authenticated. The Learning Management systems available nowadays provides only the user authentication engine based on username/password, which increases the susceptibility to fraud, when the authentication certainty level about the actual user, given for this mechanism, is practically void. In this article we proposed the safe and fast approach for authentication system for distance learning. The certification methods of face recognition were adopted in proposed system. The high recognition rate of the implemented system in four cities of Iran shows that the distance education systems are more secure and reliable than the traditional authentication system, and as a result, promote the development of distance education, in future.(c) 2011 Wiley Periodicals, Inc. Comput Appl Eng Educ 22:593-603, 2014;View this article online at;DOI
Obtaining a real-time implementation for a face detection system is the first step towards human-machine interaction. This paper presents an architecture, implementable on an FPGA, for accelerating the Haar-based face...
详细信息
ISBN:
(纸本)9781479964994
Obtaining a real-time implementation for a face detection system is the first step towards human-machine interaction. This paper presents an architecture, implementable on an FPGA, for accelerating the Haar-based facedetection algorithm through use of multiple dedicated processing units by utilizing the inherent parallelism in the algorithm. The architecture is designed to be scalable and the facedetection load has been distributed among the processing units so as to reduce the idle time. The design has been synthesized for the Xilinx Virtex-5 board. Use of a single processing unit gives an improvement in the facedetection frame rate of 5.45 times over an Intel i5, 2.4 GHz processor. The frame rate is further doubled by scaling the architecture to include four processing units running in parallel.
Attending to multiple speakers in a video teleconferencing setting is a complex task. From a visual point of view, multiple speakers can occur at different locations and present radically different appearances. From a...
详细信息
Attending to multiple speakers in a video teleconferencing setting is a complex task. From a visual point of view, multiple speakers can occur at different locations and present radically different appearances. From an audio point of view, multiple speakers may be speaking at the same time, and background noise may make difficult to localize sound sources without some a priori estimate the sound source locations. This article presents a novel sensor and corresponding sensing algorithms to address the task of attending, simultaneously, to multiple speakers for video teleconferencing. A panoramic visual sensor is used to capture a 360degrees view of the speakers in the environment and from this view potential speakers are identified via a color histogram approach. A directional audio system based on beamforming is then used to confirm potential speakers and attend to them. Experimental evaluation of the sensor and its algorithms are presented including sample performance of the entire system in a teleconferencing setting. (C) 2003 Wiley Periodicals, Inc.
暂无评论