Driver fatigue detection is increasingly recognized as critical for enhancing road safety. This study introduces a method for detecting driver fatigue using the SEED-VIG dataset, a well-established benchmark in EEG-ba...
详细信息
This paper aims to improve Llama 2’s performance by using personalized and modified datasets. Despite the impressive capabilities of large language models (LLMs) such as Llama 2, their effectiveness may be limited in...
详细信息
Emotion recognition based on physiological signals has become a crucial area of research in affective computing and human-computer interaction, with applications in smart homes, workplaces, educational institutions, h...
详细信息
ISBN:
(纸本)9791188428137
Emotion recognition based on physiological signals has become a crucial area of research in affective computing and human-computer interaction, with applications in smart homes, workplaces, educational institutions, healthcare, and entertainment. In this study, a real-time emotion recognition system utilizing fog computing architecture was developed by considering the challenges of latency, total response time, resource usage, and security in IoT environments. The random forest machine learning model was trained with time-based statistical features by using the DREAMER dataset. Even though the model achieved an accuracy of 84.21% with 104 features, to meet real-time performance requirements, the system was optimized to calculate 24 features, maintaining a commendable accuracy of 79.70%. Extensive experiments demonstrated the superior performance of fog computing compared to edge and cloud computing in terms of latency, queuing delay, jitter, and most importantly total response time. The results highlight the proposed system's ability to support multiple users simultaneously. Copyright 2025 Global IT Research Institute (GIRI). All rights reserved.
In this paper, we propose a framework to address the problem of guiding a person within a semi-structured environment in a socially acceptable manner that prioritises safety and comfort. We propose an algorithm based ...
详细信息
Conventional subspace-based direction-of-arrival (DOA) estimation algorithms require optimal environments to achieve satisfactory estimation accuracy. With the advancement of sparse signal recovery theory, sparse opti...
详细信息
Three-dimensional (3D) point cloud, as an emerging visual media format, is increasingly favored by consumers as it can provide more realistic visual information than two-dimensional (2D) data. Similar to 2D plane imag...
详细信息
In this paper, we present the first implementation of a social robot acting as a companion for individuals eating alone. Our system can engage in multimodal interactions with the user during meals. It conducts convers...
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and...
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and scalable driver drowsiness detection framework that integrates a Swin Transformer-based deep learning model with a diffusion model for image denoising. While conventional convolutional neural networks (CNNs) are effective in standard vision tasks, they often suffer performance degradation in real-world driving scenarios due to noise, poor lighting, motion blur, and adversarial attacks. To address these challenges, the proposed model focuses on eye-state detection, specifically, prolonged eye closure, as a primary indicator of driver disengagement and fatigue. Our system introduces a novel preprocessing stage using a denoising diffusion model built on a U-Net encoder-decoder architecture, effectively mitigating the impact of Gaussian noise and adversarial perturbations. Additionally, we incorporate adversarial training with Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks, demonstrating significant improvements in classification accuracy and resilience. Evaluations are conducted on two benchmark datasets, Eye-Blink and Closed Eyes in the Wild (CEW), under both clean and noisy conditions. Comparative experiments show that the proposed system outperforms several state-of-the-art models, including ViT, ResNet50V2, InceptionV3, MobileNet, DenseNet169, and VGG19, in terms of accuracy (up to 99.82%), PSNR (up to 41.61 dB), and SSIM (up to 0.984), while maintaining competitive inference times suitable for practical deployment. Moreover, a detailed sensitivity analysis of data augmentation strategies reveals that techniques such as rotation and horizontal flip substantially enhance the model’s generalization across variable visual inputs. The system also demonstrates improved robustness under real-world black-box scenarios and adver
Corrosion poses a significant challenge in industries due to material degradation and high maintenance costs, making effective inhibitors essential. Recent studies suggest expired pharmaceuticals as alternative corros...
详细信息
Because of environmental concerns, remanufacturing has become an integral process for many production companies. Most published papers dealing with manufacturing and remanufacturing systems (MRSs) overlook some indust...
详细信息
暂无评论