Nowadays, high energy amount is being wasted by computing servers and personal electronic devices, which produce a high amount of carbon dioxide. Thus, it is required to decrease energy usage and pollution. Many appli...
详细信息
作者:
Gote, Pradnyawant M.Kumar, PraveenVerma, PrateekYesankar, PrajyotPawar, AdeshSaratkar, Saniya
Faculty of Engineering and Technology Department of Computer Science & Design Maharashtra Wardha442001 India
Faculty of Engineering and Technology Department of Computer Science & Medical Engineering Maharashtra Wardha442001 India
Faculty of Engineering and Technology Department of Artificial Intelligence & Machine Learning Maharashtra Wardha442001 India
Faculty of Engineering and Technology Department of Artificial Intelligence & Data Science Maharashtra Wardha442001 India
The swift progression of wireless communication technologies-specifically from 5G to 6G is an approach that could be the most significant revolutionary leap towards changing connectivity and data transmission forever....
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and...
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and scalable driver drowsiness detection framework that integrates a Swin Transformer-based deep learning model with a diffusion model for image denoising. While conventional convolutional neural networks (CNNs) are effective in standard vision tasks, they often suffer performance degradation in real-world driving scenarios due to noise, poor lighting, motion blur, and adversarial attacks. To address these challenges, the proposed model focuses on eye-state detection, specifically, prolonged eye closure, as a primary indicator of driver disengagement and fatigue. Our system introduces a novel preprocessing stage using a denoising diffusion model built on a U-Net encoder-decoder architecture, effectively mitigating the impact of Gaussian noise and adversarial perturbations. Additionally, we incorporate adversarial training with Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks, demonstrating significant improvements in classification accuracy and resilience. Evaluations are conducted on two benchmark datasets, Eye-Blink and Closed Eyes in the Wild (CEW), under both clean and noisy conditions. Comparative experiments show that the proposed system outperforms several state-of-the-art models, including ViT, ResNet50V2, InceptionV3, MobileNet, DenseNet169, and VGG19, in terms of accuracy (up to 99.82%), PSNR (up to 41.61 dB), and SSIM (up to 0.984), while maintaining competitive inference times suitable for practical deployment. Moreover, a detailed sensitivity analysis of data augmentation strategies reveals that techniques such as rotation and horizontal flip substantially enhance the model’s generalization across variable visual inputs. The system also demonstrates improved robustness under real-world black-box scenarios and adver
Navigating the world with visual impairments presents unique challenges, often limiting independence and safety. This research introduces SafeStride, a novel algorithm designed to empower visually impaired individuals...
详细信息
This paper examines the escalating ransomware threats faced by government-managed educational institutions, focusing on their vulnerabilities, case studies, and mitigation strategies. With the adoption of Bring Your O...
详细信息
Millimeter-wave (mmWave) communication systems utilize narrow beamforming to ensure adequate signal power. However, beam alignment requires significant training overhead, especially in high-mobility scenarios. Previou...
详细信息
American Sign Language (ASL) recognition aims to recognize hand gestures, and it is a crucial solution to communicating between the deaf community and hearing people. However, existing sign language recognition algori...
详细信息
Machine translation (MT) for low-resource languages continues to face significant challenges because of limited digital resources and parallel corpora, despite remarkable developments in neural machine translation (NM...
详细信息
Chest x-ray studies can be automatically detected and their locations located using artificial intelligence (AI) in healthcare. To detect the location of findings, additional annotation in the form of bounding boxes i...
详细信息
Sound event detection and classification present significant challenges, particularly in noisy environments with multiple overlapping sources. This paper introduces an innovative architecture for multiple sound event ...
详细信息
暂无评论