This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objecti...
详细信息
This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objective of each agent is unknown to others. The above problem involves complexity simultaneously in the time and space aspects. Yet existing works about distributed optimization mainly consider privacy protection in the space aspect where the decision variable is a vector with finite dimensions. In contrast, when the time aspect is considered in this paper, the decision variable is a continuous function concerning time. Hence, the minimization of the overall functional belongs to the calculus of variations. Traditional works usually aim to seek the optimal decision function. Due to privacy protection and non-convexity, the Euler-Lagrange equation of the proposed problem is a complicated partial differential ***, we seek the optimal decision derivative function rather than the decision function. This manner can be regarded as seeking the control input for an optimal control problem, for which we propose a centralized reinforcement learning(RL) framework. In the space aspect, we further present a distributed reinforcement learning framework to deal with the impact of privacy protection. Finally, rigorous theoretical analysis and simulation validate the effectiveness of our framework.
In this paper,a novel adaptive Fault-Tolerant control(FTC)strategy is proposed for non-minimum phase Hypersonic Vehicles(HSVs)that are affected by actuator faults and parameter *** strategy is based on the output rede...
详细信息
In this paper,a novel adaptive Fault-Tolerant control(FTC)strategy is proposed for non-minimum phase Hypersonic Vehicles(HSVs)that are affected by actuator faults and parameter *** strategy is based on the output redefinition method and Adaptive Dynamic Programming(ADP).The intelligent FTC scheme consists of two main parts:a basic fault-tolerant and stable controller and an ADP-based supplementary *** the basic FTC part,an output redefinition approach is designed to make zero-dynamics stable with respect to the new ***,Ideal Internal Dynamic(IID)is obtained using an optimal bounded inversion approach,and a tracking controller is designed for the new output to realize output tracking of the nonminimum phase HSV *** the ADP-based compensation control part,an ActionDependent Heuristic Dynamic Programming(ADHDP)adopting an actor-critic learning structure is utilized to further optimize the tracking performance of the HSV control ***,simulation results are provided to verify the effectiveness and efficiency of the proposed FTC algorithm.
Strongly stealthy cyber-attacks in networked vehicle queue (V2V) network environments often manifest themselves as small changes to data, which makes it difficult for existing methods to effectively detect such attack...
详细信息
In this paper,the authors consider the inverse problem for the Moore-Gibson-Thompson equation with a memory term and variable diffusivity,which introduce a sort of delay in the dynamics,producing nonlocal effects in *...
详细信息
In this paper,the authors consider the inverse problem for the Moore-Gibson-Thompson equation with a memory term and variable diffusivity,which introduce a sort of delay in the dynamics,producing nonlocal effects in *** H¨older stability of simultaneously determining the spatially varying viscosity coefficient and the source term is obtained by means of the key pointwise Carleman estimate for the Moore-Gibson-Thompson *** the sake of generality in mathematical tools,the analysis of this paper is discussed within the framework of Riemannian geometry.
This study presents a novel application of computer vision technology to enhance safety in the automotive service industry, with a specific focus on the real-time detection of Personal Protective Equipment (PPE) usage...
详细信息
This article presents developments by its authors: the IndustrialKit library package and the RCWorkspace application based on it. The library allows developers of third-party applications to use the functionality of p...
详细信息
The traditional energy hub based model has difficulties in clearly describing the state transition and transition conditions of the energy unit in the integrated energy system(IES).Therefore,this study proposes a stat...
详细信息
The traditional energy hub based model has difficulties in clearly describing the state transition and transition conditions of the energy unit in the integrated energy system(IES).Therefore,this study proposes a state transition modeling method for an IES based on a cyber-physical system(CPS)to optimize the state transition of energy unit in the *** method uses the physical,integration,and optimization layers as a three-layer modeling *** physical layer is used to describe the physical models of energy units in the *** the integration layer,the information flow is integrated into the physical model of energy unit in the IES to establish the state transition model,and the transition conditions between different states of the energy unit are *** optimization layer aims to minimize the operating cost of the IES and enables the operating state of energy units to be transferred to the target *** simulations show that,compared with the traditional modeling method,the state transition modeling method based on CPS achieves the observability of the operating state of the energy unit and its state transition in the dispatching cycle,which obtains an optimal state of the energy unit and further reduces the system operating costs.
In this study, a data-driven learning algorithm was developed to estimate the optimal distributed cooperative control policy, which solves the cooperative optimal output regulation problem for linear discretetime mult...
详细信息
In this study, a data-driven learning algorithm was developed to estimate the optimal distributed cooperative control policy, which solves the cooperative optimal output regulation problem for linear discretetime multi-agent systems. Notably, the dynamics of all the agent systems and exo-system is completely unknown. By combining adaptive dynamic programming with an internal model, a model-free off-policy learning method is proposed to estimate the optimal control gain and the distributed adaptive internal model by only accessing the measurable data of multi-agent systems. Moreover, different from the traditional cooperative adaptive controller design method, a distributed internal model is approximated online. Convergence and stability analyses show that the estimate controller generated by the proposed data-driven learning algorithm converges to the optimal distributed controller. Finally, simulation results verify the effectiveness of the proposed method.
In this technical note, the design of a Lyapunov-based trajectory tracking controller is introduced for an underactuated quadrotor unmanned aerial vehicle (QUAV). First, the mathematical dynamics of QUAV are introduce...
详细信息
This paper investigates a finite-time tracking problem for the uncertainty nonlinear systems in nonstrict-feedback form,in which the output signal is restricted in a *** on the barrier Lyapunov function and dynamic su...
详细信息
This paper investigates a finite-time tracking problem for the uncertainty nonlinear systems in nonstrict-feedback form,in which the output signal is restricted in a *** on the barrier Lyapunov function and dynamic surface control scheme,a novel adaptive neural controller is proposed by using the finite-time Lyapunov *** the aforementioned literature on finite time tracking control,the violation of system output constraint is avoided by combining the barrier Lyapunov function method with finite-time *** structural characteristics of neural network is introduced to expand the adaptive neural finite-time backstepping method to the uncertainty nonlinear systems in the non-strict ***,the dynamic surface control is introduced to cope with the problem of“explosion of complexity”inherent in conventional backstepping *** is shown that the designed controller can achieve finite-time tracking control and all the variables in the closed-loop system are bounded with output constraint guaranteed form stability analysis and simulation results.
暂无评论