This paper is concerned with a reinforcement learning-based adaptive tracking control technique to tolerate faults for a class of unknown multiple-inputmultiple-output nonlinear discrete-timesystems with less learni...
详细信息
This paper is concerned with a reinforcement learning-based adaptive tracking control technique to tolerate faults for a class of unknown multiple-inputmultiple-output nonlinear discrete-timesystems with less learning parameters. Not only abrupt faults are considered, but also incipient faults are taken into account. Based on the approximation ability of neural networks, action network and critic network are proposed to approximate the optimal signal and to generate the novel cost function, respectively. The remarkable feature of the proposed method is that it can reduce the cost in the procedure of tolerating fault and can decrease the number of learning parameters and thus reduce the computational burden. Stability analysis is given to ensure the uniform boundedness of adaptive control signals and tracking errors. Finally, three simulations are used to show the effectiveness of the present strategy. Note to Practitioners-As the practical engineering systems are becoming more and more complex, it is more likely to suffer from faults which can lead to unpredictable behaviors and serious damages. Therefore, adaptive fault tolerant control technique plays an important role in modern engineering systems. As a matter of fact, it is always desired to minimize the maintenance cost even when some faults occur in the systems, which can reduce the energy consumption in the industrial production process. One way is to integrate the adaptive reinforcement learning algorithm into the fault tolerant controller. Furthermore, a major restriction of the learning algorithm is that a large number of parameters should be tuned online, which directly increase the computational burden. In order to tackle this difficulty, by adjusting the estimated values of the network weight vectors instead of their weights, the number of adaptive learning parameters is decreased, and the computational burden is reduced dramatically.
暂无评论