Due to The lack of comparison studies and practical applications of RNN, LSTM, and hybrid RNN-LSTM models for intrusion detection systems, especially when managing class imbalances in complex network datasets, represe...
详细信息
This study analyzes and predicts air pollution in Asia, focusing on PM 2.5 levels from 2018 to 2023 across five regions: Central, East, South, Southeast, and West Asia. South Asia emerged as the most polluted region, ...
详细信息
Multi-modal emotion recognition (MER) is crucial for improving human-computer interaction. Convolutional neural networks (CNNs) are the mainstream for MER tasks, but they require large databases, extensive memory, and...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Multi-modal emotion recognition (MER) is crucial for improving human-computer interaction. Convolutional neural networks (CNNs) are the mainstream for MER tasks, but they require large databases, extensive memory, and significant energy, limiting their practical use. This paper proposes a novel MER system that leverages wavelet scattering transform (WST) to address these challenges, achieving improved performance with lower computational consumption. Moreover, the system benefits from the noise robustness provided by WST. By integrating WST as a non-trainable initial layer in a CNN model and employing an encoder module, our system effectively captures time-frequency, local and high-level features from both speech and video. We enhance feature integration and representation with cross-modal attention (CMA) and a squeeze-and-excitation (SE) block. The results demonstrate that our system performs consistently across varying noise levels and duration thresholds. Ablation studies reveal that the combination of MFCC, Mel spectrogram, and raw waveform features yields the highest accuracy, with Mel spectrogram being the most influential. Experimental results on the IEMOCAP and RAVDESS databases achieve emotion recognition accuracy of 83.2% and 97.8%, respectively, showcasing improved performance and robustness compared to state-of-the-art models, while using fewer trainable parameters.
Head pose estimation (HPE) is a critical task for numerous applications ranging from human-computer interaction, healthcare, and robotics, to surveillance. Most existing methods employ Euler angles as a representation...
详细信息
Background and objective: Epilepsy is among the most prevalent illnesses of the central nervous system. This condition results in frequent, uncontrolled seizures that happen suddenly and are caused by a variety of tri...
详细信息
Background and objective: Epilepsy is among the most prevalent illnesses of the central nervous system. This condition results in frequent, uncontrolled seizures that happen suddenly and are caused by a variety of trigger factors, including brain injury, physiological, genetic, etc. Involuntary spasms or distraction during seizures can cause severe bodily harm or even death for epileptics. In this paper, an effective method for accurately classifying Electroencephalogram (EEG) data for the early identification of epileptic seizures is ***: The suggested process essentially hybridizes several statistical data, discrete wavelet transformations (DWT), machine learning algorithms, and feature selection techniques independently. Through the use of DWT, the automated multi-resolution signal processing approach decomposes EEG signals into detail and approximation coefficients after splitting them into detailed parts with varying window sizes to guarantee an accurate classification performance. Statistical latent features are extracted from these coefficients that describe the nonlinear and dynamical patterns in the signals. Feature selection techniques were used to reduce the dimension of the feature matrix while highlighting the important elements. Different classifier structures were developed to classify input matrices. For all classifiers, the optimal hyperparameters were found using grid search techniques. Performance metrics for classification were calculated to assess the model's ***: In the analysis, to compare the proposed procedure with the other approaches in terms of detecting the epileptic behaviors correctly, a benchmark data set from the University of Bonn database was used. The results showed that the proposed approach can estimate more robust models concerning performance metrics and information criteria in classifying EEG signals. Also, the most important frequency bands were detected to distinguish EEG ***: Th
We study online federated learning over a wireless network, where the central server updates an online global model sequence to minimize the time-varying loss of multiple local devices over time. The server updates th...
详细信息
A popular biometric identification method, renowned for its dependability and individuality in personal identification, is fingerprint recognition. This article presents an efficient fingerprint identification system ...
详细信息
ISBN:
(数字)9798331515683
ISBN:
(纸本)9798331515690
A popular biometric identification method, renowned for its dependability and individuality in personal identification, is fingerprint recognition. This article presents an efficient fingerprint identification system that significantly reduces processing times for 100,000 fingerprints from 8 minutes to less than 5 seconds by utilizing multithreading, ANSI 378 templates, and Source AFIS. The system overcomes the time-consuming image-to-template conversion procedure and increases overall storage efficiency by utilizing parallel processing and preformatted templates (ANSI/ISO). The fingerprint matching system incorporates Directional Field, CNN models for identifying fingerprint types, and Finger Code methods to classify fingerprints, improving precision and efficiency.
Stress is a significant health concern, impacting both physical and mental well-being. Prolonged exposure to stress can lead to numerous physical health issues, including cardiovascular diseases, and a weakened immune...
详细信息
Stress is a significant health concern, impacting both physical and mental well-being. Prolonged exposure to stress can lead to numerous physical health issues, including cardiovascular diseases, and a weakened immune system. This study presents a novel methodology for classifying perceived mental stress using electroencephalography (EEG) signals. By utilizing the publicly available Leipzig Study for Mind-Body-Emotion Interactions dataset, we analyze EEG data collected from 53 participants over a 7-minute resting-state duration. Our approach involves transforming EEG signals into spectrograms using the Short-Time Fourier Transform (STFT), resulting in a time-frequency representation of the input signals. We employ transfer learning to fine-tune three pre-trained deep neural networks i.e., ResNet50, EfficientNetB0, and DenseNet121 for classifying stress into two and three levels. Our findings demonstrate that the ResNet50 model achieves superior classification accuracies of 95.80% and 86.02% for two and three-level stress classification, respectively, outperforming existing state-of-the-art methods. This study is the first to utilize STFT-generated spectrograms and transfer learning for perceived stress classification, highlighting the efficacy of deep learning techniques in quantifying perceived mental stress through non-invasive EEG recordings. Our results indicate that the proposed method can significantly enhance the accuracy of stress classification frameworks, offering potential improvements in mental health assessment and intervention strategies.
Background: The immutability of smart contracts on blockchain platforms like Ethereum promotes security and trustworthiness but presents challenges for updates, bug fixes, or adding new features post-deployment. These...
详细信息
Privacy-preserving and secure data sharing are critical for medical image analysis while maintaining accuracy and minimizing computational overhead are also crucial. Applying existing deep neural networks (DNNs) to en...
详细信息
ISBN:
(数字)9798331508050
ISBN:
(纸本)9798331508067
Privacy-preserving and secure data sharing are critical for medical image analysis while maintaining accuracy and minimizing computational overhead are also crucial. Applying existing deep neural networks (DNNs) to encrypted medical data is not always easy and often compromises performance and security. To address these limitations, this research introduces a secure framework consisting of a learnable encryption method based on block-pixel operation to encrypt the data and subsequently integrate it with the Vision Transformer (ViT). The proposed framework ensures data privacy and security by creating unique scrambling patterns per key, providing robust performance against leading bit attacks and minimum difference attacks.
暂无评论