the research is based on the data set BDDI00K, which can well reflect the complex traffic scenarios in the real situation, and build YOLOv4 based on the uncertainty regression of the prediction box to establish the re...
详细信息
ISBN:
(纸本)9798350377040;9798350377033
the research is based on the data set BDDI00K, which can well reflect the complex traffic scenarios in the real situation, and build YOLOv4 based on the uncertainty regression of the prediction box to establish the reliability relationship mapping between the prediction box and the real box, so that the model can better detect complex traffic scenarios. In order to overcome the shortcomings of traditional detection methods and meet the actual driving environment, a traffic sign and pedestrian detection method based on YOLOv4 is proposed to provide data support for intelligent analysis and decision-making of safe operation of automatic driving, and provide reliable guarantee for eliminating hidden dangers of safe operation of automatic driving.
Emotion recognition is widely applied in medicine, education, and human-computer interaction, withthree main approaches: non-physiological, physiological, and hybrid signals. Hybrid methods show promise while non-phy...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
Emotion recognition is widely applied in medicine, education, and human-computer interaction, withthree main approaches: non-physiological, physiological, and hybrid signals. Hybrid methods show promise while non-physiological signals are easily manipulable. Our study proposes a hybrid approach that combines EEG and facial expression data using decision-level fusion. Validating our approach using the DEAP database, we focused on binary classifications for alertness and valence. Our model classifies emotions into four categories: HVLA, HVHA, LVLA, and LVHA. From the dataset, we extracted 17 features from 32 EEG channels across 5 frequency bands for each subject. We applied SVM withthe RBF kernel and achieved an accuracy of 54.49%. For facial expression classification, we preprocessed frames from the tests of each subject and used CNN to obtain a validation accuracy of 68.36%. In the fusion step, we combined the predicted probabilities of the four labels from the two unimodal classifiers using weighted averaging to calculate the average predicted probabilities for the final emotion classification. Our thorough approach and strong results make a meaningful contribution to the field of emotion computing and emotion recognition.
In the digital age, the majority of paper documents are converted into electronic formats for preservation purposes. However, the conversion process can result in significant degradation of the document images, which ...
详细信息
Parkinson's Disease (PD) presents significant challenges in timely diagnosis and management, driving the need for advanced automatic recognition systems. this comparative study explores various methodologies for P...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
Parkinson's Disease (PD) presents significant challenges in timely diagnosis and management, driving the need for advanced automatic recognition systems. this comparative study explores various methodologies for PD detection, leveraging machine learning and signal processing techniques across diverse modalities. Key phases, including data collection, preprocessing, feature extraction, selection, and classification, are scrutinized. through comprehensive analysis of methodologies, datasets, and performance metrics, the study reveals promising accuracies in PD classification. However, challenges such as dataset availability and class imbalance persist. Future research avenues include multi-modal fusion, real-time monitoring, and telemedicine integration, aiming to enhance diagnostic accuracy and accessibility. Embracing these perspectives can accelerate the translation of research findings into clinical practice, improving outcomes for individuals with Parkinson's disease.
Withthe development of computer technology, graphics and image processing technology has become a key component of visual communication system. Graphics and image processing technology as a common means of computer t...
详细信息
underwater object detection and recognition are critical technologies in marine science and engineering, enabling advancements in environmental monitoring, resource exploration, and autonomous underwater vehicles. the...
详细信息
the proceedings contain 118 papers. the topics discussed include: driver action recognition in low-light conditions: a multi-view fusion framework;fuzzy logic obstacle avoidance controller for a mobile robot using lid...
ISBN:
(纸本)9798350351484
the proceedings contain 118 papers. the topics discussed include: driver action recognition in low-light conditions: a multi-view fusion framework;fuzzy logic obstacle avoidance controller for a mobile robot using lidar;modeling continuous emotions in text data using IEMOCAP database;detection and classification of neurodegenerative diseases by automatic speech analysis;a comparative study of derivative-based and wavelet-based approaches for epilepsy EEG analysis;a novel nonlinear intelligent control approach for optimizing solar PV systems in medical applications under dynamic conditions;access control based on roles and groups: case study building a secure smart hospital using BPMS;credit card fraud detection using synthetic minority oversampling technique and deep learning technique;and in-memory computing architecture for efficient hardware security.
High-dimensional imageanalysis, such as Hyperspectral Imaging (HSI) data, poses unique challenges due to their high dimensionality and non-Euclidean structures, making their analysis and classification complex. In th...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
High-dimensional imageanalysis, such as Hyperspectral Imaging (HSI) data, poses unique challenges due to their high dimensionality and non-Euclidean structures, making their analysis and classification complex. In this study, we explore the use of both graph deep learning (GDL) and multi-view graph representation learning for HSI classification. Furthermore, we present our proposed approach of multi-view Graph Convolutional Networks (GCNs) and how it leverages multiple views of the data by combining spectral and spatial features to improve classification accuracy. We discuss then specific challenges encountered when training our model on large HSIs, including managing large-scale graph data. We also discuss promising opportunities to overcome these challenges. By highlighting the challenges and opportunities associated with GDL and multi-view GCN usage for HSI classification, this study aims to shed light on recent developments and future prospects in this rapidly evolving field.
this work explores an innovative approach to image processing that provides high efficiency and accuracy in computer vision tasks. In this work, step-by-step learning of quantum machine learning models is considered, ...
详细信息
Geometric deep learning (GDL) has emerged as a powerful paradigm for analyzing complex data represented in non-Euclidean domains. In the field of neuroimaging, 3D meshes have become a prevalent representation for capt...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
Geometric deep learning (GDL) has emerged as a powerful paradigm for analyzing complex data represented in non-Euclidean domains. In the field of neuroimaging, 3D meshes have become a prevalent representation for capturing the intricate structures of the brain. this survey paper provides a comprehensive overview of the recent advancements and techniques in using GDL to analyze brain 3D meshes. We systematically review the state-of-the-art methodologies employed in tasks such as segmentation, and classification of brain meshes. Additionally, we discuss the challenges and opportunities in this rapidly evolving field, including data scarcity, interpretability, and scalability. By synthesizing insights from diverse research efforts, this survey aims to guide researchers and practitioners toward a deeper understanding of the application of GDL in neuroimaging and pave the way for future breakthroughs in brain analysis.
暂无评论