In the realm of autonomous agents, ensuring safety and reliability in complex and dynamic environments remains a paramount challenge. Safe reinforcement learning addresses these concerns by introducing safety constrai...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
In the realm of autonomous agents, ensuring safety and reliability in complex and dynamic environments remains a paramount challenge. Safe reinforcement learning addresses these concerns by introducing safety constraints, but still faces challenges in navigating intricate environments such as complex driving situations. To overcome these challenges, we present the safe constraint reward (Safe CoR) framework, a novel method that utilizes two types of expert demonstrations—reward expert demonstrations focusing on performance optimization and safe expert demonstrations prioritizing safety. By exploiting a constraint reward (CoR), our framework guides the agent to balance performance goals of reward sum with safety constraints. We test the proposed framework in diverse environments, including the safety gym, metadrive, and the real-world Jackal platform. Our proposed framework improves algorithm performance by 39% and reduces constraint violations by 88% on the real-world Jackal platform, highlighting its effectiveness. Through this innovative approach, we expect significant advancements in real-world performance, leading to transformative effects in the realm of safe and reliable autonomous agents.
Recent advancements in text-to-image models, such as Stable Diffusion, show significant demographic biases. Existing debiasing techniques rely heavily on additional training, which imposes high computational costs and...
详细信息
While deep learning-based Alzheimer's disease (AD) diagnosis has recently made significant advancements, particularly in predicting the conversion of mild cognitive impairment (MCI) to AD based on MRI images, ther...
This paper presents our system for the BioASQ10b Phase B task. For ideal answers, we used the fine-tuned BioBERT model on the MNLI dataset to construct sentence embeddings and combined it with BERTScore to select sent...
详细信息
The world has suffered from COVID-19 (SARS-CoV-2) for the last two years, causing much damage and change in people’s daily lives. Thus, automated detection of COVID-19 utilizing deep learning on chest computed tomogr...
详细信息
A membership inference attack (MIA) identifies if an instance was included in the victim model's train dataset. Without an appropriate defense mechanism, MIA can result in serious privacy breaches. Although severa...
详细信息
Successful detection of Out-of-Distribution (OoD) data is becoming increasingly important to ensure safe deployment of neural networks. One of the main challenges in OoD detection is that neural networks output overco...
详细信息
This study focuses on the optimization of antireflection coatings (ARCs) to enhance the performance of silicon heterojunction (SHJ) solar cells. SHJ solar cells face a significant challenge in achieving their theoreti...
详细信息
Nuanced dialects are a linguistic variant that pose several challenges for NLP models and techniques. One of the main challenges is the limited amount of datasets to enable extensive research and experimentation. We p...
详细信息
暂无评论