The elderly have a sensitive period of life in terms of physical and mental health and require close assistance or a caregiver. Medical assistance has the ability to recognize the emotional states of older adults thro...
详细信息
ISBN:
(纸本)9783031298561;9783031298578
The elderly have a sensitive period of life in terms of physical and mental health and require close assistance or a caregiver. Medical assistance has the ability to recognize the emotional states of older adults through facial expressions and take care of them in real time. This paper provides a comprehensive Facial Emotion Recognition (FER) review, especially for the elderly. Several studies have been conducted on the facial emotion recognition of young and middle-aged adults. Very few studies have focused on automatic emotion recognition for the elderly. Aging comes with a decline in the ability to recognize emotions and impacts emotion perception in humans. Furthermore, older people are suffering from cognitive impairment worldwide, which leads to abnormal emotional patterns. This paper is a literature review of FER techniques in computervision;FER approaches, and FER databases, and discusses the main challenge of facial expression recognition across age and lifespan.
Federated learning, a different direction of distributed optimization, is very much important when there are restrictions of data sharing due to privacy and communication overhead. In federated learning, instead of sh...
详细信息
The task of crime prediction is to predict the charges of a case based on the given description of the case, which has become a research hot spot. Most of the existing methods use neural networks and machine learning ...
详细信息
Aiming at the problem that the single target tracking effect of low illumination video is poor, a low illumination video single target tracking method based on computervision is proposed. The computervision processi...
Aiming at the problem that the single target tracking effect of low illumination video is poor, a low illumination video single target tracking method based on computervision is proposed. The computervisionprocessing algorithm is used to enhance the video image, and the sharpening components in the images at all levels are controlled by parameters to suppress the high-frequency noise of low illumination video. In order to better expand the dynamic range of the overall gray level of the enhanced image, this paper adopts an improved computervision method to adjust the contrast of the video image, so as to achieve the requirements of low illumination video but target tracking. Finally, from the simulation results, it can be seen that the low illumination video single target tracking method based on computervision has better integrity and accuracy, and the short-term occlusion of moving targets can achieve a better tracking effect.
Lung cancer has emerged as a significant cause for concern among individuals worldwide. Consequently, several nations provide financial resources and extend invitations to researchers and medical professionals to coll...
详细信息
ISBN:
(数字)9798331543891
ISBN:
(纸本)9798331543907
Lung cancer has emerged as a significant cause for concern among individuals worldwide. Consequently, several nations provide financial resources and extend invitations to researchers and medical professionals to collaborate in addressing this issue. Lung cancer is a significant contributor to mortality subsequent to heart illness. The timely identification of lung cancer in its first stages has the potential to enhance patient survival rates. The use of computervision is of utmost importance in the timely identification of lung cancer and the implementation of early therapeutic interventions, hence enhancing the likelihood of patient survival. The proposed deep learning model incorporates Convolutional Neural Network (CNN) and transfer learning techniques, along with other imageprocessing methods like image compression and picture enhancement. These approaches are used to analyze medical images and make correct predictions. Our study demonstrates a CNN model accuracy of 91.29% and a transfer learning accuracy of 95.9%.
In recent years, weakly supervised semantic segmentation using image-level labels as supervision has received significant attention in the field of computervision. Most existing methods have addressed the challenges ...
In recent years, weakly supervised semantic segmentation using image-level labels as supervision has received significant attention in the field of computervision. Most existing methods have addressed the challenges arising from the lack of spatial information in these labels by focusing on facilitating supervised learning through the generation of pseudolabels from class activation maps (CAMs). Due to the localized pattern detection of Convolutional Neural Networks (CNNs), CAMs often emphasize only the most discriminative parts of an object, making it challenging to accurately distinguish foreground objects from each other and the background. Recent studies have shown that vision Transformer (ViT) features, due to their global view, are more effective in capturing the scene layout than CNNs. However, the use of hierarchical ViTs has not been extensively explored in this field. This work explores the use of Swin Transformer by proposing "SWTformer" to enhance the accuracy of the initial seed CAMs by bringing local and global views together. SWTformer-V1 generates class probabilities and CAMs using only the patch tokens as features. SWTformer-V2 incorporates a multi-scale feature fusion mechanism to extract additional information and utilizes a background-aware mechanism to generate more accurate localization maps with improved cross-object discrimination. Based on experiments on the PascalVOC 2012 dataset, SWTformer-V1 achieves a 0.98% mAP higher localization accuracy, outperforming state-of-the-art models. It also yields comparable performance by 0.82% mIoU on average higher than other methods in generating initial localization maps, depending only on the classification network. SWTformer-V2 further improves the accuracy of the generated seed CAMs by 5.32% mIoU, further proving the effectiveness of the local-to-global view provided by the Swin transformer. Code available at: https://***/RozhanAhmadi/SWTformer
It is crucial for a farmer to determine the ripening of the fruit since the harvested crop can be sold at a higher price if it is ripe. In line with this, fruits are considered one of the country39;s most exported g...
详细信息
ISBN:
(纸本)9781665463553
It is crucial for a farmer to determine the ripening of the fruit since the harvested crop can be sold at a higher price if it is ripe. In line with this, fruits are considered one of the country's most exported goods (3rd as of 2020). The researchers proposed a system using Support Vector Machine (SVM) to determine the fruit maturity of Banana, Mango, and Calamansi. Said fruits are classified into three categories for fruit maturity, namely: unripe, ripe, and overripe. 140 fruits were gathered and used starting when it was unripe until it was overripe. This was done in a 12-hour interval, every 9 am and 9 pm every day for two weeks. The data gathered consists of 1729 images of bananas (Cavendish), 711 images of mangos, and 589 images of calamansi. The model was written in Python, and the integrated development environment (IDE) used was a jupyter notebook. HSV Conversion, Hyperparameter tuning, and Feature Extraction were used to increase the accuracy of the model. Three models were used, namely, Model 1, the model after fitting the dataset with background and Hyperparameter Tuning, Model 2, the model after fitting the dataset with removed background and features identified, and Model 3, the model after fitting the dataset with removed background, identified features and underwent hyperparameter tuning.
Single-photon camera is a novel camera type that utilizes image sensor with photon-counting capability. Recently, the potential of such sensors to achieve high spatial resolutions (e.g., 10^9pixels/chip) and frame rat...
Single-photon camera is a novel camera type that utilizes image sensor with photon-counting capability. Recently, the potential of such sensors to achieve high spatial resolutions (e.g., 10^9pixels/chip) and frame rates (e.g., 10^6frames/sec) is well-established. However, there is a significant difference in the output data between single-photon cameras and traditional CMOS sensor cameras. Therefore, conventional image reconstruction algorithms cannot be utilized. Vison Transformer has impressive performance on imageprocessing tasks, and this paper will design a plug-and-play algorithm for reconstructing images from single-photon cameras and integrate imageprocessing algorithms based on deep neural networks like ViT. The results demonstrate our methods can achieve improvement both in PSNR and SSIM.
With the wide application of computervision technology in sports field, how to analyze sports video efficiently and accurately has become an important research direction. Fuzzy clustering algorithm as a nonlinear dat...
With the wide application of computervision technology in sports field, how to analyze sports video efficiently and accurately has become an important research direction. Fuzzy clustering algorithm as a nonlinear data mining method can deal with uncertainty and noise provides a new idea for the analysis of sports video images. This paper mainly discusses the application of fuzzy clustering algorithm in sports video imageprocessing, and analyzes its influence on sports video image
In this paper, a new vehicle distance detection technology based on imageprocessing is proposed to solve the traditional ultrasonic vehicle distance detection errors, many blind areas and other problems. The binocula...
In this paper, a new vehicle distance detection technology based on imageprocessing is proposed to solve the traditional ultrasonic vehicle distance detection errors, many blind areas and other problems. The binocular vision sensing system was used to collect real-time images in front of vehicles. The vehicle in front of the collected image is identified through template matching, and the parallax principle is used to calculate the vehicle distance after confirming the presence of the vehicle. Experimental results show that this method can detect the distance between different angles and different positions with high accuracy.
暂无评论