Due to the advancement of neural networks and the increasing demand for accurate and real-time Speech Emotion Recognition (SER) in human-computer interactions, it is necessary to compare existing methods and databases...
详细信息
Due to the advancement of neural networks and the increasing demand for accurate and real-time Speech Emotion Recognition (SER) in human-computer interactions, it is necessary to compare existing methods and databases in SER in order to arrive at feasible solutions and a complete understanding of this open-ended problem in SER. To detect and recognize the emotions expressed in speech, various techniques have been used in the literature, including well-established speech analysis and classification techniques. These techniques, including speech analysis and classification, have been used to extract emotions from signals. In this study, we propose a novel method for analyzing signals called Wavelet-Scaled Spectrogram which combines the frequency and scale spectrum of a signal using wavelet transform. This method is effective in analyzing signals at different scales and frequency content. In order to train models for speech emotion identification, a large number of handcrafted features and intermediary depictions i.e., frequency-time plot that have traditionally been utilized in data compilation, collection, and analysis. The development of end-to-end models which extract characteristics and learn directly from raw speech signals to improve speech recognition has recently been studied by researchers following the emergence of deep learning. After training and evaluation on the famous speech databases EmoDB, RAVDESS and IEMOCAP, the proposed model is evaluated on various speakers in both speaker-independent and speaker-dependent modes and on a variety of different voices. When advanced preprocessing techniques or data augmentation are omitted from the proposed architecture, the results demonstrate that it can produce products comparable to those produced by the current state of the art. Three concurrent CNN pipelines and a series of modified local features learning blocks (LFLBs) achieved the highest classification accuracy outperforming some advance state-of-the-approa
Centralized baseband processing (CBP) is required to achieve the full potential of massive multiple-input multiple-output (MIMO) systems. However, due to the large number of antennas, CBP suffers from two major issues...
详细信息
As healthcare services have become increasingly digitized, Electronic Health Records (EHRs) have become widely adopted, providing seamless data exchange among providers. Conventional EHRs, however, are extremely vulne...
详细信息
The risk of gas leaks has grown significantly as a life threatening issue in industrial activities, cooking, and heating. This system integrates automatic reaction mechanisms, real-time monitoring capabilities, and ad...
详细信息
The need for a personalized user experience brought recommendation systems to the forefront of digital innovation. However, traditional approaches tend to often forget human emotions, which represent a critical driver...
详细信息
This paper analyzes whether Android apps may outsource computational activities to cloud servers. Due to the complexity of mobile apps, shifting computing operations to the cloud has become popular to improve performa...
详细信息
In the contemporary world, humanoid robots are likely to play a key role in various fields, including health care, domestic service, hospitality, business, and military and security activities. The robots are employed...
详细信息
Emotion recognition from EEG (electroencephalogram) signals is crucial in mental health diagnostics and human-computer interaction but is often hindered by high dimensionality, noise, and complex temporal dependencies...
详细信息
Emotion recognition from EEG (electroencephalogram) signals is crucial in mental health diagnostics and human-computer interaction but is often hindered by high dimensionality, noise, and complex temporal dependencies in the data. This paper presents a novel approach that integrates transformer models, attention mechanisms, and transfer learning to enhance emotion recognition accuracy from EEG signals. The proposed methodology consists of two phases: Attention Enhanced Base Model Development (AE-BMD) and Cross-Dataset Fine Tuning Adaptation (CD-FTA). In the AE-BMD phase, the base model is developed and trained on the SEED-IV dataset (15 participants, 62 EEG channels), achieving an accuracy of 84%, with an average precision of 84.75%, recall of 84% and F1-score of 84%. This phase employs optimized feature extraction from key EEG frequency bands (Delta, Theta, Alpha, Beta, Gamma) using techniques such as MFCC, GFCC, power spectral density, and Hjorth parameters. A transformer encoder with integrated spectral and temporal attention mechanisms captures intricate patterns and long-range dependencies within the EEG signals. In the CD-FTA phase, the model undergoes fine-tuning on the SEED-V dataset (20 participants, 62 channels) leading to an improved accuracy of 90%, with an average precision of 90.6%, recall of 90.6%, and F1-score of 90.6%. The model’s generalization is further validated on the MPED dataset (23 participants, 62 channels, seven emotion classes), achieving 79%, with an average precision of 79.3%, recall of 79.3% and F1-score of 79.1% across diverse emotional states. This cross-dataset adaptation leverages transfer learning to enhance the model’s generalization across different emotional states and EEG datasets. Experimental results show that the proposed approach outperforms traditional methods, achieving superior accuracy and robustness in emotion recognition tasks. This work advances emotion recognition systems by addressing challenges in EEG signal proc
The digital era has brought a surge in the amount of data generated, increasing the need for data security across individuals, organizations, and governments. Protecting sensitive information from unauthorized access ...
详细信息
Secure file encryption has become increasingly important in various disciplines, including finance, healthcare, and government, where protecting sensitive data is paramount. The proposed framework aims to meet this re...
详细信息
暂无评论