Safe reinforcement learning (SRL) aims to realize a safe learning process for deep reinforcement learning (DRL) algorithms by incorporating safety constraints. However, the efficacy of SRL approaches often relies on a...
详细信息
Safe reinforcement learning (SRL) aims to realize a safe learning process for deep reinforcement learning (DRL) algorithms by incorporating safety constraints. However, the efficacy of SRL approaches often relies on accurate function approximations, which are notably challenging to achieve in the early learning stages due to data insufficiency. To address this issue, we introduce, in this work, a novel generalizable safety enhancer (GenSafe) that can overcome the challenge of data insufficiency and enhance the performance of SRL approaches. Leveraging model order reduction techniques, we first propose an innovative method to construct a reduced order Markov decision process (ROMDP) as a low-dimensional approximator of the original safety constraints. Then, by solving the reformulated ROMDP-based constraints, GenSafe refines the actions of the agent to increase the possibility of constraint satisfaction. Essentially, GenSafe acts as an additional safety layer for SRL algorithms. We evaluate GenSafe on multiple SRL approaches and benchmark problems. The results demonstrate its capability to improve safety performance, especially in the early learning phases, while maintaining satisfactory task performance. Our proposed GenSafe not only offers a novel measure to augment existing SRL methods but also shows broad compatibility with various SRL algorithms, making it applicable to a wide range of systems and SRL problems.
As the basic parts of workshop processing, the life of the tool will affect the processing efficiency and processing quality, and the life of the tool will be affected by a variety of uncertain factors in the processi...
详细信息
The "Smart Roads: Lighting the Way to Safety and Efficiency"project introduces an innovative solution to improve the efficiency and sustainability of outdoor lighting systems. This research focuses on develo...
详细信息
Battery energy storage systems (BESS) play an essential role in modern grids by supporting renewable power systems, improving grid power quality through voltage and frequency regulation, and supporting electric vehicl...
详细信息
In recent works, using voice transformation functions (VTF) in optimal shifting of formants has improved near-end speech intelligibility. Though these VTFs are promising, they are computationally expensive to optimize...
详细信息
Modern power systems are experiencing a rapid movement from fossil-based generations to renewable energy resources (RERs) due to concerns about the environment and the dependence on fossil fuel sources. However, the r...
详细信息
The search for optimal finite-length binary block codes is a long-standing open problem for memoryless binary symmetric channels (BSCs) with the maximum likelihood decoding. A recent work studied the optimal codes amo...
详细信息
Image fusion plays a significant role in computer vision since numerous applications benefit from the fusion results. The existing image fusion methods are incapable of perceiving the most discriminative regions under...
详细信息
Purpose - In Business Process Management (BPM), accurate prediction of the next activities is vital for operational efficiency and decision-making. Current Artificial Intelligence (AI)/Machine Learning (ML) models str...
详细信息
With the emergence of the COVID-19 pandemic,the World Health Organization(WHO)has urged scientists and industrialists to exploremodern information and communication technology(ICT)as a means to reduce or even eliminat...
详细信息
With the emergence of the COVID-19 pandemic,the World Health Organization(WHO)has urged scientists and industrialists to exploremodern information and communication technology(ICT)as a means to reduce or even eliminate *** World Health Organization recently reported that the virus may infect the organism through any organ in the living body,such as the respiratory,the immunity,the nervous,the digestive,or the cardiovascular *** the abovementioned goal,we envision an implanted nanosystem embedded in the intra living-body *** main function of the nanosystem is either to perform diagnosis and mitigation of infectious diseases or to implement a targeted drug delivery system(i.e.,delivery of the therapeutic drug to the diseased tissue or targeted cell).The communication among the nanomachines is accomplished via communication-based molecular *** control/interconnection of the nanosystem is accomplished through the utilization of Internet of bio-nano things(IoBNT).The proposed nanosystem is designed to employ a coded relay nanomachine disciplined by the decode and forward(DF)principle to ensure reliable drug delivery to the targeted ***,both the sensitivity of the drug dose and the phenomenon of drug molecules loss before delivery to the target cell site in long-distance due to the molecules diffusion process are taken into *** this paper,a coded relay NM with conventional coding techniques such as RS and Turbo codes is selected to achieve minimum bit error rate(BER)performance and high signal-to-noise ratio(SNR),while the detection process is based on maximum likelihood(ML)probability and minimum error probability(MEP).The performance analysis of the proposed scheme is evaluated in terms of channel capacity and bit error rate by varying system parameters such as relay position,number of released molecules,relay and receiver *** results are validated through simulation and demonstrate that the proposed scheme can
暂无评论