To ensure effective and safer utilization of Lithium-Ion Batteries (LIB), accurate State Estimation particularly State of Health (SOH) is extremely essential, especially in automotive applications. Accurate informatio...
详细信息
Few-shot learning (FSL) is the process of rapid generalization from abundant base samples to inadequate novel samples. Despite extensive research in recent years, FSL is still not yet able to generate satisfactory sol...
详细信息
Epileptic seizures with the risk of sudden unexpected death in epilepsy affect the quality of life. Nearly, one-fourth of the individuals suffer from seizures that cannot be treated with medications. Due to the high-l...
详细信息
Artificial intelligence (AI) along with deep learning techniques has become an integral part of almost all aspects of life. One of the domains significantly impacted by this technological revolution is healthcare. Dee...
详细信息
The adoption of automation in software testing presents challenges that can hinder its effectiveness and scalability. This study systematically investigates these challenges using a multi-phase research approach. Firs...
详细信息
The brain-computer interface (BCI) and eye-tracking technologies can potentially improve the learning environment in education. Cognitive BCIs can give a deep knowledge of brain functioning, enabling the creation of m...
详细信息
Human Activity Recognition(HAR)plays an important role in life care and health monitoring since it involves examining various activities of patients at homes,hospitals,or ***,the proposed system integrates Human-Human...
详细信息
Human Activity Recognition(HAR)plays an important role in life care and health monitoring since it involves examining various activities of patients at homes,hospitals,or ***,the proposed system integrates Human-Human Interaction(HHI)and Human-Object Interaction(HOI)recognition to provide in-depth monitoring of the daily routine of *** propose a robust system comprising both RGB(red,green,blue)and depth *** particular,humans in HHI datasets are segmented via connected components analysis and skin detection while the human and object in HOI datasets are segmented via saliency *** track the movement of humans,we proposed orientation and thermal features.A codebook is generated using Linde-Buzo-Gray(LBG)algorithm for vector ***,the quantized vectors generated from image sequences of HOI are given to Artificial Neural Network(ANN)while the quantized vectors generated from image sequences of HHI are given to K-ary tree hashing for *** are two publicly available datasets used for experimentation on HHI recognition:Stony Brook University(SBU)Kinect interaction and the University of Lincoln’s(UoL)3D social activity ***,two publicly available datasets are used for experimentation on HOI recognition:Nanyang Technological University(NTU)RGB-D and Sun Yat-Sen University(SYSU)3D HOI *** results proved the validity of the proposed system.
Cancer of the breast is one of the primary causes of mortality of women across the globe. Breast abnormalities may often be diagnosed and classified with the use of ultrasound imaging. To better pinpoint health issues...
详细信息
Quantum Computing is continuously evolving and expanding. As time goes by, more and more Quantum computer implementations become available, each of them with their own features. In such a scenario, it can be difficult...
详细信息
In the field of autonomous vehicles(AVs),accurately discerning commander intent and executing linguistic commands within a visual context presents a significant *** paper introduces a sophisticated encoder-decoder fra...
详细信息
In the field of autonomous vehicles(AVs),accurately discerning commander intent and executing linguistic commands within a visual context presents a significant *** paper introduces a sophisticated encoder-decoder framework,developed to address visual grounding in *** Context-Aware Visual Grounding(CAVG)model is an advanced system that integrates five core encoders—Text,Emotion,Image,Context,and Cross-Modal—with a multimodal *** integration enables the CAVG model to adeptly capture contextual semantics and to learn human emotional features,augmented by state-of-the-art Large Language Models(LLMs)including *** architecture of CAVG is reinforced by the implementation of multi-head cross-modal attention mechanisms and a Region-Specific Dynamic(RSD)layer for attention *** architectural design enables the model to efficiently process and interpret a range of cross-modal inputs,yielding a comprehensive understanding of the correlation between verbal commands and corresponding visual *** evaluations on the Talk2Car dataset,a real-world benchmark,demonstrate that CAVG establishes new standards in prediction accuracy and operational ***,the model exhibits exceptional performance even with limited training data,ranging from 50%to 75%of the full *** feature highlights its effectiveness and potential for deployment in practical AV ***,CAVG has shown remarkable robustness and adaptability in challenging scenarios,including long-text command interpretation,low-light conditions,ambiguous command contexts,inclement weather conditions,and densely populated urban environments.
暂无评论