Age detection is a fundamental task in computer vision with numerous applications, from targeted advertising to security systems. This paper proposes a robust approach for age estimation based on local binary patterns...
详细信息
Age detection is a fundamental task in computer vision with numerous applications, from targeted advertising to security systems. This paper proposes a robust approach for age estimation based on local binary patterns to extract features associated with face images. The goal of accurately predicting people's ages from facial images is to overcome challenges such as changes in lighting conditions, poses, and facial expressions. The proposed method uses a combination of feature extraction, feature selection, and machine learning algorithms, which we named Hybrid method. At first, facial landmarks are detected to determine the key points of the face and enable the extraction of the corresponding facial features. These features are then fed into a feature selection algorithm to identify the most distinctive ones, reducing dimensionality and increasing model efficiency. To evaluate the proposed approach, extensive experiments are conducted on benchmark datasets, including different age groups and ethnicities. The results show the effectiveness of the proposed method in achieving high accuracy and robustness in age estimation. As shown in the calculation results, the detection rate and accuracy of Hybrid method age estimation calculations are better than competing methods. For Hybrid method, the mean absolute error is 4.94 years, with a standard deviation of 4.74 years. From the point of view of average absolute error, this age estimation method is superior to other methods that have been presented to date. The proposed method for estimating the age of people has a final sensitivity of 97.2%, an accuracy of 96.8%, and a precision of 99.1%. In addition, it is stated in the specifications of the implementation system that the program can be executed in about 3.5 s, which is a suitable speed for estimating the age of people based on their face photographs.
Pattern recognition is a prominent area of research in computer vision, where different methods have been proposed in the last 50 years. This work presents the development of a Python API to identify the result of two...
详细信息
ISBN:
(纸本)9783030991708;9783030991692
Pattern recognition is a prominent area of research in computer vision, where different methods have been proposed in the last 50 years. This work presents the development of a Python API to identify the result of two six-sided dice used in the game called "Craps" as a no-controlled environment to help visually impaired people. The software is structured in four stages. The first one is capturing images through a device with a digital camera connected to the web via IP address. The second stage corresponds to the captured imageprocessing;it is necessary to establish a standard image size and resize and equalize the digitized image. The third stage seeks to segment the object of study by artificial vision techniques to identify the result of the dice after being thrown. Finally, the fourth stage is to interpret the result and play it through a speaker. The expected possible result is a system that integrates the four stages mentioned above through an intuitive and accessible low-cost Python API, mainly aimed at visually impaired people.
Recent advancements and breakthroughs in deep learning have accelerated the rapid development in the field of computer vision. Having recorded a huge success in 2D object perception and detection, a lot of progress ha...
详细信息
The face is a critical perspective in predicting human feelings and moods. More frequently than not human senti-ments are extricated with the utilization of the camera. Various applications are being made based on the...
详细信息
The proceedings contain 173 papers. The topics discussed include: restricted area sign detector using YOLO v5;research on distance teaching course interactive system based on computer algorithm research data;APT detec...
ISBN:
(纸本)9798350323313
The proceedings contain 173 papers. The topics discussed include: restricted area sign detector using YOLO v5;research on distance teaching course interactive system based on computer algorithm research data;APT detection and attack scenario reconstruction based on big data analysis;new imageprocessing: VGG image style transfer with gram matrix style features;trajectory measurement and positioning of underwater vehicle based on monocular stereo vision;the importance of multi feature extraction and fusion for prediction of protein subcellular localization;design and implementation of FPGA-based four-dimensional ultra chaotic system;flocking towards a robust mobile network topology;real time speech recognition method for online complaints from power grid customers based on improved residual network;optimization of parking space detection system based on ZigBee wireless sensor network;and a wire drawing defect detection approach for FDM 3D printing based on machinevision technology.
This research paper presents cutting-edge technologies and methodologies to enhance precision agriculture and support sustainable farming practices. The study incorporates Satellite imageprocessing for land classific...
详细信息
The employment of AI and machine learning methods for mushroom classification has several potential uses and applications in the area of AI and related disciplines. It is possible to learn more about biodiversity and ...
详细信息
In recent years, there has been a notable surge in the utilization of industrial imageprocessingapplications across various sectors, including automotive, medical, and space industries. These applications rely on sp...
详细信息
In recent years, there has been a notable surge in the utilization of industrial imageprocessingapplications across various sectors, including automotive, medical, and space industries. These applications rely on specialized camera systems and advanced imageprocessing techniques to accurately measure working products with precise tolerances. This research presents a novel fast algorithm for measuring the diameter of a ring, employing a subpixel counting method. The algorithm classifies image pixels into two categories: full pixels and transition pixels. Full pixels reside entirely within the inner region of the workpiece, while transition pixels represent gray pixels that reside at the boundary between the workpiece and its background. To ensure accurate determination of the object area, the proposed method incorporates normalization to account for the contribution of transition pixels alongside full pixels. Subsequently, the circle area equation is employed to calculate the diameter. Moreover, a robust threshold selection method is introduced to effectively distinguish pixels with gray intensities. The experimental setup consists of an industrial camera equipped with telecentric lenses and appropriate illumination. The results demonstrate that the proposed algorithm achieves a 3-10 % improvement in accuracy compared to existing approaches. In terms of measuring sensitivity, the operational sensitivity of the proposed methodology is quantified as 1/20th of the pixel size, exhibiting an average uncertainty of 1 mu m. Furthermore, the proposed method surpasses existing works by at least 12.5 % to 35 % in terms of benchmarking computing time.
Surveillance is a major stream of research in the field of Unmanned Aerial Vehicles (UAV), which focuses on the observation of a person, group of people, buildings, infrastructure, etc. With the integration of real ti...
详细信息
Robust, fast, and low-power hardware platforms are desirable for the implementation of real-time machinevision. Here the authors develop a computing-in-sensor network using ferroelectric photo sensors with remanent-p...
详细信息
Robust, fast, and low-power hardware platforms are desirable for the implementation of real-time machinevision. Here the authors develop a computing-in-sensor network using ferroelectric photo sensors with remanent-polarization-controlled photo responsivities. Nowadays the development of machinevision is oriented toward real-time applications such as autonomous driving. This demands a hardware solution with low latency, high energy efficiency, and good reliability. Here, we demonstrate a robust and self-powered in-sensor computing paradigm with a ferroelectric photosensor network (FE-PS-NET). The FE-PS-NET, constituted by ferroelectric photosensors (FE-PSs) with tunable photoresponsivities, is capable of simultaneously capturing and processingimages. In each FE-PS, self-powered photovoltaic responses, modulated by remanent polarization of an epitaxial ferroelectric Pb(Zr0.2Ti0.8)O-3 layer, show not only multiple nonvolatile levels but also sign reversibility, enabling the representation of a signed weight in a single device and hence reducing the hardware overhead for network construction. With multiple FE-PSs wired together, the FE-PS-NET acts on its own as an artificial neural network. In situ multiply-accumulate operation between an input image and a stored photoresponsivity matrix is demonstrated in the FE-PS-NET. Moreover, the FE-PS-NET is faultlessly competent for real-time imageprocessing functionalities, including binary classification between 'X' and 'T' patterns with 100% accuracy and edge detection for an arrow sign with an F-Measure of 1 (under 365 nm ultraviolet light). This study highlights the great potential of ferroelectric photovoltaics as the hardware basis of real-time machinevision.
暂无评论