Navigating the world with visual impairments presents unique challenges, often limiting independence and safety. This research introduces SafeStride, a novel algorithm designed to empower visually impaired individuals...
详细信息
With the rapid development of emerging services such as cellular vehicle-to-everything and immersive video service, network connections have further evolved from tangible physical connections to intangible virtual con...
详细信息
The Internet of Things (IoT) has turned the healthcare domain in a better direction due to more interconnected and smarter systems that enhance care delivery. Certainly, the IoT-accessible applications in these fields...
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and...
详细信息
With the rapid development of intelligent transportation systems and growing emphasis on driver safety, real-time detection of driver drowsiness has become a critical area of research. This study presents a robust and scalable driver drowsiness detection framework that integrates a Swin Transformer-based deep learning model with a diffusion model for image denoising. While conventional convolutional neural networks (CNNs) are effective in standard vision tasks, they often suffer performance degradation in real-world driving scenarios due to noise, poor lighting, motion blur, and adversarial attacks. To address these challenges, the proposed model focuses on eye-state detection, specifically, prolonged eye closure, as a primary indicator of driver disengagement and fatigue. Our system introduces a novel preprocessing stage using a denoising diffusion model built on a U-Net encoder-decoder architecture, effectively mitigating the impact of Gaussian noise and adversarial perturbations. Additionally, we incorporate adversarial training with Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks, demonstrating significant improvements in classification accuracy and resilience. Evaluations are conducted on two benchmark datasets, Eye-Blink and Closed Eyes in the Wild (CEW), under both clean and noisy conditions. Comparative experiments show that the proposed system outperforms several state-of-the-art models, including ViT, ResNet50V2, InceptionV3, MobileNet, DenseNet169, and VGG19, in terms of accuracy (up to 99.82%), PSNR (up to 41.61 dB), and SSIM (up to 0.984), while maintaining competitive inference times suitable for practical deployment. Moreover, a detailed sensitivity analysis of data augmentation strategies reveals that techniques such as rotation and horizontal flip substantially enhance the model’s generalization across variable visual inputs. The system also demonstrates improved robustness under real-world black-box scenarios and adver
Speech emotion recognition is a difficult task that is gaining attention in a variety of domains, including psychology, human–computer interaction, and speech processing. To recognize speech emotions, machine learnin...
详细信息
With the breakthrough of convolutional neural networks, deep hashing methods have demonstrated remarkable performance in large-scale image retrieval tasks. However, existing deep supervised hashing methods, which rely...
详细信息
We study the task of automated house design,which aims to automatically generate 3D houses from user ***,in the automatic system,it is non-trivial due to the intrinsic complexity of house designing:1)the understanding...
详细信息
We study the task of automated house design,which aims to automatically generate 3D houses from user ***,in the automatic system,it is non-trivial due to the intrinsic complexity of house designing:1)the understanding of user requirements,where the users can hardly provide high-quality requirements without any professional knowledge;2)the design of house plan,which mainly focuses on how to capture the effective information from user *** address the above issues,we propose an automatic house design framework,called auto-3D-house design(A3HD).Unlike the previous works that consider the user requirements in an unstructured way(e.g.,natural language),we carefully design a structured list that divides the requirements into three parts(i.e.,layout,outline,and style),which focus on the attributes of rooms,the outline of the building,and the style of decoration,*** the processing of architects,we construct a bubble diagram(i.e.,graph)that covers the rooms′attributes and relations under the constraint of *** addition,we take each outline as a combination of points and orders,ensuring that it can represent the outlines with arbitrary ***,we propose a graph feature generation module(GFGM)to capture layout features from the bubble diagrams and an outline feature generation module(OFGM)for outline ***,we render 3D houses according to the given style requirements in a rule-based *** on two benchmark datasets(i.e.,RPLAN and T3HM)demonstrate the effectiveness of our A3HD in terms of both quantitative and qualitative evaluation metrics.
Over the past decades, integration of wireless sensor networks (WSNs) and computer vision (CV) technology has shown promising results in mitigating crop losses caused by wild animal attacks. Studies have demonstrated ...
详细信息
Over the past decades, integration of wireless sensor networks (WSNs) and computer vision (CV) technology has shown promising results in mitigating crop losses caused by wild animal attacks. Studies have demonstrated the effectiveness of these technologies in providing real-time monitoring and early detection of animal intrusions into agricultural fields. By deploying WSNs equipped with motion sensors and cameras, farmers can receive instant alerts when wild animals enter their fields, allowing for timely intervention to prevent crop damage. Furthermore, advancements in CV algorithms possess made possible to automatically detect and classify the animal species, facilitating targeted response strategies. For example, sophisticated image processing techniques can differentiate between harmless birds and destructive mammals, allowing farmers to focus their efforts on deterring the most damaging species. Field trials and pilot projects implementing WSN-CV systems have reported significant reductions in crop losses attributed to wild animal raids. By leveraging data collected through sensor networks and analyzed using computer vision algorithms, farmers can make informed decisions regarding pest and insect management strategies. This data-driven approach has led to more efficient utilization of resources, such as targeted application of insecticides and pesticides, resulting in both economic and environmental benefits. Moreover, the integration of WSN-CV technology has enabled the development of innovative deterrent systems that leverage artificial intelligence and automation. These systems can deploy non-lethal methods, such as sound or light-based repellents, to deter wild animals without causing harm to the environment or wildlife populations. Overall, the combination of wireless sensor networks and computer vision technology provides the promising resolution to the long-standing issue of wild animal-related losses in agriculture. By harnessing the power of data and a
This study delves into the significant role played by Quantum Dot Semiconductor Optical Amplifiers (QD-SOAs) in meeting the ever-growing bandwidth demands. QD-SOAs offer a unique blend of cost-effectiveness, integrati...
详细信息
Puzzle mats made of cushioned material are widely used in environments like homes and playrooms to prevent injuries from infants and toddlers falling. The mats feature puzzle-like edges, allowing users to freely adjus...
暂无评论