This article delves into a data-driven approach for solving the adaptive optimal output regulation problem in continuous-time linear systems with uncertain dynamics. Employing a class of parametric Lyapunov equations ...
详细信息
In this paper, a policy iteration-based Q-learning algorithm is proposed to solve infinite horizon linear nonzero-sum quadratic differential games with completely unknown dynamics. The Q-learning algorithm, which empl...
详细信息
In this paper, a policy iteration-based Q-learning algorithm is proposed to solve infinite horizon linear nonzero-sum quadratic differential games with completely unknown dynamics. The Q-learning algorithm, which employs off-policy reinforcement learning(RL), can learn the Nash equilibrium and the corresponding value functions online, using the data sets generated by behavior policies. First, we prove equivalence between the proposed off-policy Q-learning algorithm and an offline PI algorithm by selecting specific initially admissible polices that can be learned online. Then, the convergence of the off-policy Qlearning algorithm is proved under a mild rank condition that can be easily met by injecting appropriate probing noises into behavior policies. The generated data sets can be repeatedly used during the learning process, which is computationally effective. The simulation results demonstrate the effectiveness of the proposed Q-learning algorithm.
In this paper, influences of actuator nonlinearities and their suppression methods are studied. Firstly, the influences of the dead zone and backlash characteristics of the actuator on the control performance of stati...
详细信息
Oriented towards the requirements for reliable positioning and navigation of aircraft under the condition of rejection of navigation satellite, we proposed cross-domain guide positioning methods based on multi-layer n...
详细信息
This study examines a multi-player pursuit-evasion game, more specifically, a three-player lifeline game in a planar environment, where a single evader is tasked with reaching a lifeline prior to capture. A decomposit...
详细信息
This study examines a multi-player pursuit-evasion game, more specifically, a three-player lifeline game in a planar environment, where a single evader is tasked with reaching a lifeline prior to capture. A decomposition method based on an explicit policy is proposed to address the game qualitatively from two main aspects:(1) the evader's position distribution to guarantee winning the game(i.e., the escape zone),which is based on the premise of knowing the pursuers' positions initially, and(2) evasion strategies in the escape zone. First, this study decomposes the three-player lifeline game into two two-player sub-games and obtains an analytic expression of the escape zone by constructing a barrier, which is an integration of the solutions of two sub-games. This study then explicitly partitions the escape zone into several regions and derives an evasion strategy for each region. In particular, this study provides a resultant force method for the evader to balance the active goal of reaching the lifeline and the passive goal of avoiding capture. Finally,some examples from a lifeline game involving more than one pursuer are used to verify the effectiveness and scalability of the evasion strategies.
Swarm situational awareness technology for uncrewed aerial vehicles (UAVs) is a core technology for mastering environmental information and gaining decision-making advantages. However, evaluation indicators and valida...
详细信息
The multi-target association algorithm of unmanned aerial vehicles (UAV) clusters in unknown environments is used to solve the problem of multiple UAVs, which can correctly determine whether the data from different pe...
详细信息
In this paper, a class of linear quadratic differential game guidance scheme is presented to study the pursuit and evasion conflict scenario that an evader establishes a static obstacle and performs the evasion maneuv...
详细信息
Considering the azimuth discrepancy between the main and sub-inertial navigation systems (INS) for a platform-carried unmanned aerial vehicles(UAV) in case of tilted launching at a large pitch angle, an initial alignm...
详细信息
This paper focuses on the problem of ultra-low-altitude flight obstacle distance estimation for monocular vision-assisted aircraft. First, the basic error model of distance estimation using sequence images is derived ...
详细信息
暂无评论