Computer vision and Biometrics benefit from the recent advances in Pattern Recognition and Artificial Intelligence, which tends to make model-based face recognition more efficient. Also, deep learning combined with da...
详细信息
Computer vision and Biometrics benefit from the recent advances in Pattern Recognition and Artificial Intelligence, which tends to make model-based face recognition more efficient. Also, deep learning combined with data augmentation tends to enrich the training sets used for learning tasks. Nevertheless, face recognition still is challenging, especially because of imaging issues that occur in practice, such as changes in lighting, appearance, head posture and facial expression. In order to increase the reliability of face recognition, we propose a novel supervised appearance-based face recognition method which creates a low-dimensional orthogonal subspace that enforces the face class separability. The proposed approach uses data augmentation to mitigate the problem of training sample scarcity. Unlike most face recognition approaches, the proposed approach is capable of handling efficiently grayscale and color face images, as well as low and high-resolution face images. Moreover, proposed supervised method presents better class structure preservation than typical unsupervised approaches, and also provides better data preservation than typical supervised approaches as it obtains an orthogonal discriminating subspace that is not affected by the singularity problem that is common in such cases. Furthermore, a soft margins Support Vector machine classifier is learnt in the low-dimensional subspace and tends to be robust to noise and outliers commonly found in practical face recognition. To validate the proposed method, an extensive set of face identification experiments was conducted on three challenging public face databases, comparing the proposed method with methods representative of the state-of-the-art. The proposed method tends to present higher recognition rates in all databases. In addition, the experiments suggest that data augmentation also plays an essential role in the appearance-based face recognition, and that the CIELAB color space (L*a*b) is generally mor
Graph Neural Networks (GNNs) are neural models that use message transmission between graph nodes to represent the dependency of graphs. Variants of Graph Neural Networks (GNNs), such as graph recurrent networks (GRN),...
详细信息
Graph Neural Networks (GNNs) are neural models that use message transmission between graph nodes to represent the dependency of graphs. Variants of Graph Neural Networks (GNNs), such as graph recurrent networks (GRN), graph attention networks (GAT), and graph convolutional networks (GCN), have shown remarkable results on a variety of deep learning tasks in recent years. In this study, we offer a generic design pipeline for GNN models, go over the variations of each part, classify the applications in an organized manner, and suggest four outstanding research issues. Dealing with graph data, which provides extensive connection information among pieces, is necessary for many learning tasks. A model that learns from graph inputs is required for modelling physics systems, learning molecular fingerprints, predicting protein interfaces, and identifying illnesses. Reasoning on extracted structures (such as the dependency trees of sentences and the scene graphs of photos) is an important research issue that also requires graph reasoning models in other domains, such as learning from non-structural data like texts and images. Graph Neural Networks (GNNs) are primarily designed for dealing with graph-structured data, where relationships between entities are modeled as edges in a graph. While GNNs are not traditionally applied to image classification problems, researchers have explored ways to leverage graph-based structures to enhance the performance of Convolutional Neural Networks (CNNs) in certain scenario. GNN have been increasingly applied to Natural Language processing (NLP) tasks, leveraging their ability to model structured data and capture relationships between elements in a graph. GNN are also applied for traffic related problems particularly in modeling and optimizing traffic flow, analyzing transportation networks, and addressing congestion issues. GNN can be used for traffic flow prediction, dynamic routing & navigation, Anomaly detection, public transport network
High Dynamic Range (HDR) imaging has become a significant technological advancement in visual data processing, allowing for the capture of a wider dynamic range of luminance levels in images. This paper explores vario...
详细信息
ISBN:
(数字)9798331529505
ISBN:
(纸本)9798331529512
High Dynamic Range (HDR) imaging has become a significant technological advancement in visual data processing, allowing for the capture of a wider dynamic range of luminance levels in images. This paper explores various HDR processing techniques and their potential applications in automation and machinevision. By using methods such as multiple image fusion, image registration, and tone mapping, the paper demonstrates how HDR processing can enhance visual data in automated systems, improving accuracy in environments requiring complex lighting conditions. This work applies HDR algorithms to real-world scenarios, showcasing their potential in industrial automation and robotics, where accurate visual data plays a crucial role.
The Internet of Things, artificial intelligence, machine learning, and big data are just a few of the cutting-edge technologies that are being integrated into manufacturing processes as part of the "Industry 4.0&...
详细信息
The Internet of Things, artificial intelligence, machine learning, and big data are just a few of the cutting-edge technologies that are being integrated into manufacturing processes as part of the "Industry 4.0" revolution. Computer vision is an essential component of Industry 4.0 regarding sustainability, developed as a disruptive technology that extracts and interprets visual information from digital photos or videos using imageprocessing techniques and advanced models. In the context of Industry 4.0, this article offers an overview of computer vision, including its associated prospects, difficulties, and applications. A particular emphasis is placed on sustainability. It explores computer visionapplications in robotics and automation, safety and security, process optimization, augmented reality, robotics and inspection, object identification and tracking, predictive maintenance, and quality control and inspection. The study also identifies the critical approaches used to overcome the difficulties in implementing computer vision solutions. Incorporating computer vision into Industry 4.0 holds promise for unleashing unprecedented levels of efficiency, novelty, and competitiveness in the industrial sector. The manufacturing and industrial sectors may use Industry 4.0's prospects and adopt sustainable practices by utilizing computer vision and overcoming its inherent limits. This will help to create an eco-conscious and efficient future.
Depth estimation and 3D object detection are critical for autonomous systems to gain context of their surroundings. In recent times, compute capacity has improved tremendously, enabling computer vision and AI on the e...
详细信息
With the development of communication technology, people will be exposed to more and more graphics and images in the process of life andwork. Like using digital devices such as camera, scanner and camera to obtain ima...
详细信息
ISBN:
(纸本)9783031243660;9783031243677
With the development of communication technology, people will be exposed to more and more graphics and images in the process of life andwork. Like using digital devices such as camera, scanner and camera to obtain images, but these instruments and equipment can only obtain two-dimensional image information of objects, which is completely insufficient. Inmany fields, three-dimensional information of objects is necessary. In this paper, the 3D printing design of ceramic products is simulated based on 3D image reproduction technology. The satisfaction of users with the ceramic visual effect and hand-held comfort produced by 3D image reproduction simulation technology is investigated by means of questionnaire, and the computer vision technology and stereo matching technology are compared. The results show that more than 85% of users are very satisfied with the ceramic visual effect and hand-held comfort of three-dimensional image reproduction simulation technology, and less than 5% of users are not satisfied;The satisfaction of ceramic visual effect produced by computer vision technology and stereo matching technology is less than 60%, and the hand-held comfort is less than 70%.
machinevision system plays vital roles in the industrial application in order to maintain quality and control the process. machinevision technology has numerous applications in various industries like automotive ind...
详细信息
Actions speak more than words. In the context of the above statement, the importance of gestures and using them to control a system has become popular. The hand gesture recognition system for opening applications in W...
详细信息
The recognition of facial emotions has received growing focus in recent years due to its importance and the significant role it plays in shaping the way humans interact with computers. This can be achieved using deep ...
详细信息
Body movements are an essential part of non-verbal communication as they help to express and interpret human emotions. The potential of Body Emotion Recognition (BER) is immense, as it can provide insights into user p...
详细信息
ISBN:
(纸本)9783031667428;9783031667435
Body movements are an essential part of non-verbal communication as they help to express and interpret human emotions. The potential of Body Emotion Recognition (BER) is immense, as it can provide insights into user preferences, automate real-time exchanges and enable machines to respond to human emotions. BER finds applications in customer service, healthcare, entertainment, emotion-aware robots, and other areas. While face expression-based techniques are extensively researched, detecting emotions from body movements in the realworld presents several challenges, including variations in body posture, occlusions, and background. Recent research has established the efficacy of transformer deep-learning models beyond the language domain to solve video and image-related problems. A key component of transformers is the self-attention mechanism, which captures relationships among features across different spatial locations, allowing contextual information extraction. In this study, we aim to understand the role of body movements in emotion expression and to explore the use of transformer networks for body emotion recognition. Our method proposes a novel linear projection function of the visual transformer, which enables the transformation of 2D joint coordinates into a conventional matrix representation. Using an original method of contextual information learning, the developed approach enables a more accurate recognition of emotions by establishing unique correlations between individual's body motions over time. Our results demonstrated that the self-attention mechanism was able to achieve high accuracy in predicting emotions from body movements, surpassing the performance of other recent deep-learning methods. In addition, the impact of dataset size and frame rate on classification performance is analyzed.
暂无评论