This article covers the design, implementation, mathematical modelling, and control of a multivariable, underactuated, low-cost, three-degrees-of-freedom experimental helicopter system (namely a 3-DOF helicopter). The...
详细信息
In coming up with solutions to real-world problems, humans implicitly adhere to constraints that are too numerous and complex to be specified completely. However, reinforcement learning (RL) agents need these constrai...
详细信息
In coming up with solutions to real-world problems, humans implicitly adhere to constraints that are too numerous and complex to be specified completely. However, reinforcement learning (RL) agents need these constraints to learn the correct optimal policy in these settings. The field of Inverse Constraint Reinforcement Learning (ICRL) deals with this problem and provides algorithms that aim to estimate the constraints from expert demonstrations collected offline. Practitioners prefer to know a measure of confidence in the estimated constraints, before deciding to use these constraints, which allows them to only use the constraints that satisfy a desired level of confidence. However, prior works do not allow users to provide the desired level of confidence for the inferred constraints. This work provides a principled ICRL method that can take a confidence level with a set of expert demonstrations and outputs a constraint that is at least as constraining as the true underlying constraint with the desired level of confidence. Further, unlike previous methods, this method allows a user to know if the number of expert trajectories is insufficient to learn a constraint with a desired level of confidence, and therefore collect more expert trajectories as required to simultaneously learn constraints with the desired level of confidence and a policy that achieves the desired level of performance. Copyright 2024 by the author(s)
Multi-view semi-supervised classification primarily aims to enhance classification accuracy when dealing with limited labeled samples. Although existing methods have shown impressive performance, significant challenge...
详细信息
This paper studies the fixed-time consensus tracking problem of nonlinear multi-agent systems, where communication links are subjected to denial-of-service (DoS) attacks. The DoS attacks make the communication network...
详细信息
Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to cap...
Relationships were discussed in this work between discharge current and electrode moving speed, ionization coefficient, field strength, gas pressure, temperature, humidity and other factors. Gas flow distribution arou...
详细信息
In our study, we investigate how the brain maps environmental spaces into understandable maps through hippocampal place cells and entorhinal cortex grid cells. We uncover that the hippocampus and entorhinal cortex are...
详细信息
In the process of drawing architectural drawings with AutoCAD software, enterprises will produce a large number of CAD drawings in DWG format. The tables of these CAD drawings contain rich textual information. These t...
详细信息
As a key component of the Photonic network on chip (PNoC). The electro-optic modulator converts the electrical signal of the processor into optical signal, which is transmitted in the PNoC. The performance of the PNoC...
详细信息
Mamba, a state-space model with selective mechanisms and hardware-aware architecture, has demonstrated outstanding performance in long sequence modeling tasks, particularly garnering widespread exploration and applica...
暂无评论