The effectiveness of the FCM image segmentation algorithm can be increased by introducing local spatial information as FCM is very sensitive to noise. Also, during transmission, the image details are lost because nois...
详细信息
Genetic algorithms have been widely used in intelligent test paper generation systems. However, traditional genetic algorithms cannot ensure that the difficulty of test questions is normally distributed, and are prone...
详细信息
Machine learning algorithms are fundamentally driven by the data provided by humans;consequently, the decisions made by those algorithms are not free from human bias. This is particularly evident in the case of facial...
详细信息
ISBN:
(纸本)9789898704498
Machine learning algorithms are fundamentally driven by the data provided by humans;consequently, the decisions made by those algorithms are not free from human bias. This is particularly evident in the case of facial analysis systems that employ machine learning algorithms. Recent studies have shown that the decisions made by many of the commercially available facial analysis systems are prejudiced against certain groups of race, ethnicity, age, gender and culture. Further studies have identified that the underlying reason for such biased decisions is that the open source material available for facial image databases which are used in commerce and academia to train the algorithms has meager diversity in these categories. To compound this issue, facial analysis technology is promoted by influential companies and artificial intelligence service providers without affirming the fairness and accuracy of the decisions given by these systems. To minimize bias and ensure representation of the Middle Eastern population in the imminent growth of this technology, we propose the development of two Arab face databases along with an algorithmic audit involving seven commercially available facial analysis systems. Of the databases, the first, Arab-LEANA, will include 300 Arab subjects' face images with variation in lighting, expression, accessory, nationality and age (LEANA). The second, Arab Public Figures Faces (APFF), will contain images and videos of 300 Arab public figures captured "in the wild". Faces for APFF will be selected manually from the internet since manual selection of faces will result in a high degree of variability in scale, pose, expression, illumination, age, occlusion and make-up. These databases will provide the worldwide community of face recognition researchers with a large-scale, diverse collection of Arab face images for training and evaluating algorithms toward developing a more representative, and therefore more robust, capacity for facial analysis. T
Considering the challenges associated with robots in optoelectronic imaging applications, typically require real-time and accurate recognition and localization of targets, especially in complex environments. Due to th...
详细信息
An integrated computerized locking mechanism is vital for upholding the safety of train movements within railway networks. As computer technology progresses swiftly, conventional locking systems are encountering a mul...
详细信息
Originating from the pistachio tree (Pistacia vera) and prized for their nutritional content and adaptability in the kitchen, pistachios have great financial worth in the agricultural field. Ensuring the quality and s...
详细信息
ISBN:
(纸本)9798331540661;9798331540678
Originating from the pistachio tree (Pistacia vera) and prized for their nutritional content and adaptability in the kitchen, pistachios have great financial worth in the agricultural field. Ensuring the quality and safety of pistachio nuts is absolutely vital and calls for effective industrial post-harvest methods. Modern technology, especially imageprocessing and computer vision algorithms, are augmenting conventional approaches of pistachio categorization and separation. This work intends to build an enhanced classification model for pistachio identification using deep learning, most especially the VGG16 Convolutional Neural Network (CNN). After gathering a dataset of 2,148 high-resolution photos of pistachios, the approach uses resizing and data augmentation methods including rotation, flipping, and zooming to improve the generalization and prevent overfitting of the model. Pretrained on imageNet, the VGG16 model is fine-tuned to fit the particular job of pistachio recognition by means of feature extraction capacity. For feature extraction, the model architecture comprises convolutional layers;for classification, it features fully connected layers. With performance criteria including accuracy, precision, recall, and F1-score watched to guarantee efficient learning, optimization methods including gradient descent and backpropagation are utilized during training. On a test set, the model shows an extraordinary accuracy of 0.98, so highlighting the potential of deep learning models-especially VGG16-in automating and improving quality control systems in agricultural uses. With possible uses in related agricultural research, this highperformance classification model not only meets the demand for effective separation of pistachio species but also considerably increases their economic value.
This paper introduces MicroDeblur, an on-device image motion deblur solution for resource-constrained microcontroller-based vision systems. Although motion blurs caused by the movement or shake of the device (camera) ...
详细信息
ISBN:
(纸本)9798400701184
This paper introduces MicroDeblur, an on-device image motion deblur solution for resource-constrained microcontroller-based vision systems. Although motion blurs caused by the movement or shake of the device (camera) are pervasive in embedded, IoT, and mobile devices, it has been considered a hard nut to crack for many microcontrollers with extremely-limited resources (e.g., hundreds of KB of RAM). To tackle this problem, we combine the DNN (deep neural network) motion deblur method with the classical motion deblur approach and take the best of both worlds, i.e., 1) powerful pattern recognition ability of DNNs and 2) simplicity and stability of matrix-based classical algorithms. To deblur an image, MicroDeblur takes three steps: 1) blur kernel estimation, 2) blur image transformation, and 3) iterative clear image restoration. We propose 1) depth-independent convolution that efficiently estimates the blur kernel (pattern) and 2) Toeplitz-based motion blur modeling that enhances the time and space complexity of the deblurring process by O(n) and O(n(3)), respectively, compared to the existing methods. To the best of our knowledge, MicroDeblur is the first self-sufficient blind deconvolution solution for a stand-alone microcontroller that does not rely on extra hardware or external systems. We implement MicroDeblur on an ARM Cortex-M4F, achieving a competitive quality of deblurred images using 187x and 429x smaller memory and energy, respectively, compared to high-end GPU-based solutions.
In computer vision and imageprocessing the surveillance of video, virtual reality, robotic perception, and image compression are just a few of the uses for image segmentation. There are a plethora of segmentation alg...
详细信息
Since the invention of computers, imageprocessing methods have been utilized in a variety of applications, where their significance has grown. An important topic and the main emphasis of imageprocessing methods are ...
详细信息
This research explores the development and implementation of a software application that utilizes live human speech to generate dynamic visual media, aiming to revolutionize smart passenger entertainment. In order to ...
详细信息
暂无评论