To date, the problem of automating work with images taken using satellite systems has become relevant. This task concerns a wide range of human activities, including urban planning, transport logistics, ecology and en...
详细信息
According to a recent info trends study, in 2021, mobile and camera device users will have taken more than 1.5 trillion images, a sharp increase from the data from 2016. These image data will be used in a variety...
详细信息
In the modern-day era of technology, a paradigm shift has been witnessed in the areas involving applications of Artificial Intelligence (AI), machine Learning (ML), and Deep Learning (DL). Specifically, Deep Neural Ne...
详细信息
In the modern-day era of technology, a paradigm shift has been witnessed in the areas involving applications of Artificial Intelligence (AI), machine Learning (ML), and Deep Learning (DL). Specifically, Deep Neural Networks (DNNs) have emerged as a popular field of interest in most AI applications such as computer vision, image and video processing, robotics, etc. In the context of developed digital technologies and the availability of authentic data and data handling infrastructure, DNNs have been a credible choice for solving more complex real-life problems. The performance and accuracy of a DNN is a way better than human intelligence in certain situations. However, it is noteworthy that the DNN is computationally too cumbersome in terms of the resources and time to handle these computations. Furthermore, general-purpose architectures like CPUs have issues in handling such computationally intensive algorithms. Therefore, a lot of interest and efforts have been invested by the research fraternity in specialized hardware architectures such as Graphics processing Unit (GPU), Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), and Coarse Grained Reconfigurable Array (CGRA) in the context of effective implementation of computationally intensive algorithms. This paper brings forward the various research works on the development and deployment of DNNs using the aforementioned specialized hardware architectures and embedded AI accelerators. The review discusses the detailed description of the specialized hardware-based accelerators used in the training and/or inference of DNN. A comparative study based on factors like power, area, and throughput, is also made on the various accelerators discussed. Finally, future research and development directions, such as future trends in DNN implementation on specialized hardware accelerators, are discussed. This review article is intended to guide hardware architects to accelerate and improve the effe
Facial expression recognition mimics human coding abilities and delivers non-verbal human–robot communication cues. machine learning and deep learning techniques enable real-world computer visionapplications. Deep l...
详细信息
Sign languages play an important role to bridge the communication gap with hearing-impaired people. A lot of research is carried out to provide efficient, portable, and economical, tools, techniques, and products to m...
详细信息
Edge detection is one of the most important and fundamental problems in the field of computer vision and imageprocessing. Edge contours extracted from images are widely used as critical cues for various image underst...
详细信息
Edge detection is one of the most important and fundamental problems in the field of computer vision and imageprocessing. Edge contours extracted from images are widely used as critical cues for various image understanding tasks such as image segmentation, object detection, image retrieval, and corner detection. The purpose of this paper is to review the latest developments on image edge detection. Firstly, the definition and properties of edges are introduced. Secondly, the existing edge detection methods are classified and introduced in detail. Thirdly, the existing widely used datasets and evaluation criteria for edge detection methods are summarized. Finally, future research directions for edge detection are elaborated. (C) 2022 Elsevier B.v. All rights reserved.
Close-range photogrammetry is widely used to measure the surface shape of various objects and its deformations. The classic approach for this is to use a stereo pair of images, which are captured from different angles...
详细信息
various mobile and edge devices have significantly different processing capabilities, making it challenging to develop a single universal architecture of a neural network to extract facial embeddings. In this paper, w...
详细信息
various mobile and edge devices have significantly different processing capabilities, making it challenging to develop a single universal architecture of a neural network to extract facial embeddings. In this paper, we study the automated machine learning techniques to design a neural network with the best performance on a concrete device. The novel procedure is proposed to choose the better subnetwork of the Supernet based on a genetic algorithm with a surrogate binary classifier to compare the expected accuracy of two subnetworks. The latter uses only encoding of a candidate subnetwork and does not require directly estimating its accuracy on a validation set. As a result, the most computationally efficient and accurate model in TensorFlow Lite format is obtained in less than 10 minutes for a specific device and latency constraint. An Android demo application has been developed to demonstrate the potential of designed neural networks. It is experimentally shown that the proposed approach is universal: it can extract deep embeddings for tasks such as face verification and facial expression recognition and for various types of devices, including smartphones and Raspberry Pi single-board mini-computers. Our models process one facial image in real-time and achieve much higher accuracy when compared to the best-known lightweight networks.
The development of international trade has facilitated the global distribution of food. Ensuring the safety of food products is a crucial process that spans from production to sale. Mismanagement of this process can p...
详细信息
The development of international trade has facilitated the global distribution of food. Ensuring the safety of food products is a crucial process that spans from production to sale. Mismanagement of this process can pose significant public health risks. The issue of food adulteration is increasingly prevalent, necessitating the development of fast and reliable methods for its detection. Deep learning, as an effective machine learning algorithm, has emerged as a new field in the food industry, offering rapid and accurate results in the identification of food adulteration. In this study, a digital image and deep learning-based method was developed to detect spinach adulteration in pistachios. A unique dataset with 6 classes was created in a laboratory environment for testing the method. The adulteration rates for each class were determined, and images were analyzed in various color spaces, including Red Green Blue (RGB), HSv (Hue Saturation value), Y,u and v (YUv), and L, a, and b (LAB). Subsequently, Convolutional Neural Network (CNN) architectures, namely ResNet-50, vGGNet-19, and DenseNet201, were employed for classification. The accuracy of all color spaces and architectural combinations exceeded 90%. Notably, the vGGNet-19 architecture achieved a 100% success rate in classifying the LAB color space. Moreover, the YUv/ResNet-50 and HSv/vGGNet-19 combinations demonstrated over 98% success in detecting peanut adulteration. The utilization of deep learning-based architectures enables swift and effortless analysis of complex food samples, eliminating the challenges associated with analyzing large quantities of food and effectively preventing food adulteration.
Age detection is a fundamental task in computer vision with numerous applications, from targeted advertising to security systems. This paper proposes a robust approach for age estimation based on local binary patterns...
详细信息
Age detection is a fundamental task in computer vision with numerous applications, from targeted advertising to security systems. This paper proposes a robust approach for age estimation based on local binary patterns to extract features associated with face images. The goal of accurately predicting people's ages from facial images is to overcome challenges such as changes in lighting conditions, poses, and facial expressions. The proposed method uses a combination of feature extraction, feature selection, and machine learning algorithms, which we named Hybrid method. At first, facial landmarks are detected to determine the key points of the face and enable the extraction of the corresponding facial features. These features are then fed into a feature selection algorithm to identify the most distinctive ones, reducing dimensionality and increasing model efficiency. To evaluate the proposed approach, extensive experiments are conducted on benchmark datasets, including different age groups and ethnicities. The results show the effectiveness of the proposed method in achieving high accuracy and robustness in age estimation. As shown in the calculation results, the detection rate and accuracy of Hybrid method age estimation calculations are better than competing methods. For Hybrid method, the mean absolute error is 4.94 years, with a standard deviation of 4.74 years. From the point of view of average absolute error, this age estimation method is superior to other methods that have been presented to date. The proposed method for estimating the age of people has a final sensitivity of 97.2%, an accuracy of 96.8%, and a precision of 99.1%. In addition, it is stated in the specifications of the implementation system that the program can be executed in about 3.5 s, which is a suitable speed for estimating the age of people based on their face photographs.
暂无评论