In this paper, we present two simple and novel methods for automatic personalization of target blood glucose concentration values for individuals with Type 2 Diabetes (T2D). The methods can be integrated with any insu...
In this paper, we present two simple and novel methods for automatic personalization of target blood glucose concentration values for individuals with Type 2 Diabetes (T2D). The methods can be integrated with any insulin dosing algorithm, or used to provide an individualized reference BG concentration value for medical professionals to consider when determining long-acting insulin doses and other oral medications. The proposed methods were tested in three different simulation models, with different long-acting insulin dosing strategies, and were found to reduce instances of hypoglycemia.
In the electronic engineering undergraduate program, Peruvian universities offer different theoretical courses of Automatic control that do not present enough practical approach to the algorithms that are studied duri...
详细信息
In this paper, we consider the problem of safety assessment for Markov decision processes without explicit knowledge of the model. We aim to learn probabilistic safety specifications associated with a given policy wit...
详细信息
ISBN:
(数字)9783907144107
ISBN:
(纸本)9798331540920
In this paper, we consider the problem of safety assessment for Markov decision processes without explicit knowledge of the model. We aim to learn probabilistic safety specifications associated with a given policy without compromising the safety of the process. To accomplish our goal, we characterize a subset of the state-space namely proxy set, which contains the states that are near in a probabilistic sense to the forbidden set consisting of all unsafe states. We compute the safety function using the single-step temporal difference method. To thi
s
end, we relate the safety function computation to that of the value function estimation using temporal difference learning. Since the given control policy could be unsafe, we use a safe baseline sub-policy to generate data for learning. We then use an off-policy temporal difference learning method with importance sampling to learn the safety function corresponding to the given policy. Finally, we demonstrate our results using a numerical example.
In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochas...
详细信息
ISBN:
(数字)9798350316339
ISBN:
(纸本)9798350316346
In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochastic stopping time, the problem of learning optimal policy without violating safety constraints during the learning phase is yet to be addressed. To this end, we propose an algorithm based on linear programming that does not require a process model. We show that the learned policy is safe with high confidence. We also propose a method to compute a safe baseline policy, which is central in developing algorithms that do not violate the safety constraints. Finally, we provide simulation results to show the efficacy of the proposed algorithm. Further, we demonstrate that efficient exploration can be achieved by defining a subset of the state-space called proxy set.
This paper introduces a safe force/position tracking control strategy designed for Free-Floating Mobile Manipulator Systems (MMSs) engaging in compliant contact with planar surfaces. The strategy uniquely integrates t...
详细信息
ISBN:
(数字)9783907144107
ISBN:
(纸本)9798331540920
This paper introduces a safe force/position tracking control strategy designed for Free-Floating Mobile Manipulator Systems (MMSs) engaging in compliant contact with planar surfaces. The strategy uniquely integrates the control Barrier Function (CBF) to manage operational limitations and safety concerns. It effectively addresses safety-critical aspects in the kinematic as well as dynamic level, such as manipulator joint limits, system velocity constraints, and inherent system dynamic uncertainties. The proposed strategy remains robust to the uncertainties of the MMS dynamic model, external disturbances, or variations in the contact stiffness model. The proposed control method has low computational demand ensures easy implementation on onboard computing systems, endorsing real-time operations. Simulation results verify the strategy's efficacy, reflecting enhanced system performance and safety.
This paper introduces a safe force/position tracking control strategy designed for Free-Floating Mobile Manipulator Systems (MMSs) engaging in compliant contact with planar surfaces. The strategy uniquely integrates t...
详细信息
In variational quantum algorithms (VQAs), the most common objective is to find the minimum energy eigenstate of a given energy Hamiltonian. In this paper, we consider the general problem of finding a sufficient contro...
详细信息
In variational quantum algorithms (VQAs), the most common objective is to find the minimum energy eigenstate of a given energy Hamiltonian. In this paper, we consider the general problem of finding a sufficient control Hamiltonian structure that, under a given feedback control law, ensures convergence to the minimum energy eigenstate of a given energy function. By including quantum non-demolition (QND) measurements in the loop, convergence to a pure state can be ensured from an arbitrary mixed initial state. Based on existing results on strict control Lyapunov functions, we formulate a semidefinite optimization problem, whose solution defines a non-unique control Hamiltonian, which is sufficient to ensure almost sure convergence to the minimum energy eigenstate under the given feedback law and the action of QND measurements. A numerical example is provided to showcase the proposed methodology.
In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochas...
详细信息
The stochastic reach-avoid problem termed p-safety is further examined in the context of space debris and short-term orbital encounters. We define the collision probability problem, and reformulate it as a strong p-sa...
详细信息
ISBN:
(数字)9781665467612
ISBN:
(纸本)9781665467629
The stochastic reach-avoid problem termed p-safety is further examined in the context of space debris and short-term orbital encounters. We define the collision probability problem, and reformulate it as a strong p-safety problem, which offers a computable solution. Enabling computation comes at the cost of a more restrictive formulation which requires several relaxation schemes. To this end, Bernstein forms are employed as polynomial approximation of the nonlinear dynamics, and sum-of-squares as bases to attain certificates of positivity. Finally, a stochastic version of the unperturbed planetary equations is used to model the dynamics.
Recently, feedback-based quantum algorithms have been introduced to calculate the ground states of Hamiltonians, inspired by quantum Lyapunov control theory. This paper aims to generalize these algorithms to the probl...
详细信息
暂无评论