In previous work it was shown that adaptive-Critic-type Approximate dynamicprogramming could be applied in a higher-level way to create autonomous agents capable of using experience to discern context and select opti...
详细信息
In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptivedynamicprogramming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consid...
详细信息
This paper proposes a supervised adaptivedynamicprogramming (SADP) algorithm for the full range adaptive cruise control (ACC) system. The full range ACC system considers both the ACC situation in highway system and ...
详细信息
This paper discusses how the principles of adaptivedynamicprogramming (ADP) can be applied to the control of a quadrotor helicopter platform flying in an uncontrolled environment and subjected to various disturbance...
详细信息
An intelligent optimal control scheme for unknown nonlinear discrete-time systems with discount factor in the cost function is proposed in this paper. An iterative adaptivedynamicprogramming (ADP) algorithm via glob...
详细信息
Incomplete or imprecise models of control systems make it difficult to find an appropriate structure and parameter set for a corresponding control policy. These problems are addressed by reinforcementlearning algorit...
详细信息
This paper explores a novel application for reinforcementlearning (RL) techniques to sequential mastery testing. In such systems, the goal is to classify each examined person, using the minimal number of test items, ...
详细信息
Approximate policy iteration (API) has been shown to be a class of reinforcementlearning methods with stability and sample efficiency. However, sample collection is still an open problem which is critical to the perf...
详细信息
A common technique for dealing with the curse of dimensionality in approximate dynamicprogramming is to use a parametric value function approximation, where the value of being in a state is assumed to be a linear com...
详细信息
reinforcementlearning offers a very general framework for learning controllers, but its effectiveness is closely tied to the controller parameterization used. Especially when learning feedback controllers for weakly ...
详细信息
暂无评论