This work discusses the time-varying formation(TVF) tracking control problem of high-order multi-agent systems(MASs) with multiple leaders and multiplicative measurement noise. With the help of Lyapunov function tools...
详细信息
This work discusses the time-varying formation(TVF) tracking control problem of high-order multi-agent systems(MASs) with multiple leaders and multiplicative measurement noise. With the help of Lyapunov function tools and stochastic analysis methods, the TVF tracking protocol with multiple leaders and multiplicative noise is developed based on the relative state measurements, where followers are driven to realize the target TVF while tracking the convex combination formed by multiple leaders. Then, the TVF tracking problem is converted into the mean square asymptotic stability problem of a stochastic differential equation(SDE); sufficient conditions related to the control gains are given by stabilizing the corresponding stochastic system. Moreover, a TVF tracking algorithm is presented to outline the steps of protocol ***, the theoretical results are illustrated in terms of simulation examples.
Dear Editor,In this letter, a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated (HOFA) systems with noises. The method can effe...
Dear Editor,In this letter, a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated (HOFA) systems with noises. The method can effectively deal with nonlinearities, constraints, and noises in the system, optimize the performance metric, and present an upper bound on the stable output of the system.
The rapid development of deep learning provides great convenience for production and ***,the massive labels required for training models limits further ***-shot learning which can obtain a high-performance model by le...
详细信息
The rapid development of deep learning provides great convenience for production and ***,the massive labels required for training models limits further ***-shot learning which can obtain a high-performance model by learning few samples in new tasks,providing a solution for many scenarios that lack *** paper summarizes few-shot learning algorithms in recent years and proposes a ***,we introduce the few-shot learning task and its ***,according to different implementation strategies,few-shot learning methods in recent years are divided into five categories,including data augmentation-based methods,metric learning-based methods,parameter optimization-based methods,external memory-based methods,and other ***,We investigate the application of few-shot learning methods and summarize them from three directions,including computer vision,human-machine language interaction,and robot ***,we analyze the existing few-shot learning methods by comparing evaluation results on mini Image Net,and summarize the whole paper.
This paper proposes an event-triggered stochastic model predictive control for discrete-time linear time-invariant(LTI) systems under additive stochastic disturbances. It first constructs a probabilistic invariant set...
详细信息
This paper proposes an event-triggered stochastic model predictive control for discrete-time linear time-invariant(LTI) systems under additive stochastic disturbances. It first constructs a probabilistic invariant set and a probabilistic reachable set based on the priori knowledge of system *** with enhanced robust tubes, the chance constraints are then formulated into a deterministic form. To alleviate the online computational burden, a novel event-triggered stochastic model predictive control is developed, where the triggering condition is designed based on the past and future optimal trajectory tracking errors in order to achieve a good trade-off between system resource utilization and control performance. Two triggering parameters σ and γ are used to adjust the frequency of solving the optimization problem. The probabilistic feasibility and stability of the system under the event-triggered mechanism are also examined. Finally, numerical studies on the control of a heating, ventilation, and air conditioning(HVAC) system confirm the efficacy of the proposed control.
The hybrid photovoltaic(PV)-battery energy storage system(BESS)plant(HPP)can gain revenue by performing energy arbitrage in low-carbon power ***,multiple operational uncertainties challenge the profitability and relia...
详细信息
The hybrid photovoltaic(PV)-battery energy storage system(BESS)plant(HPP)can gain revenue by performing energy arbitrage in low-carbon power ***,multiple operational uncertainties challenge the profitability and reliability of HPP in the day-ahead *** paper proposes two coherent models to address these ***,a knowledge-driven penalty-based bidding(PBB)model for HPP is established,considering forecast errors of PV generation,market prices,and under-generation ***,a data-driven dynamic error quantification(DEQ)model is used to capture the variational pattern of the distribution of forecast *** role of the DEQ model is to guide the knowledgedriven bidding ***,the DEQ model aims at the statistical optimum,but the knowledge-driven PBB model aims at the operational *** two models have independent optimizations based on misaligned *** address this,the knowledge-data-complementary learning(KDCL)framework is proposed to align data-driven performance with knowledge-driven objectives,thereby enhancing the overall performance of the bidding strategy.A tailored algorithm is proposed to solve the bidding *** proposed bidding strategy is validated by using data from the National Renewable Energy Laboratory(NREL)and the New York Independent System Operator(NYISO).
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies as...
详细信息
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies assuming that the precise model of the leader is globally or distributively accessible to all or some of the followers, the leader's precise dynamical model is entirely inaccessible to all the followers in this paper. A data-based learning algorithm is first proposed to reconstruct the leader's unknown system matrix online. A distributed predictor subject to communication delays is further devised to estimate the leader's state, where interaction delays are allowed to be nonidentical. Then, a learning-based local controller, together with a discounted performance function, is projected to reach the optimal output synchronization. Bellman equations and game algebraic Riccati equations are constructed to learn the optimal solution by developing a model-based reinforcement learning(RL) algorithm online without solving regulator equations, which is followed by a model-free off-policy RL algorithm to relax the requirement of all agents' dynamics faced by the model-based RL algorithm. The optimal tracking control of HMASs subject to unknown leader dynamics and communication delays is shown to be solvable under the proposed RL algorithms. Finally, the effectiveness of theoretical analysis is verified by numerical simulations.
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown ***-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where th...
详细信息
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown ***-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where the system dynamics is not *** analyzing the value function iterations,the convergence of the model-based algorithm is *** equivalence of several types of value iteration algorithms is *** effectiveness of model-free algorithms is demonstrated by a numerical example.
作者:
Liu, YangZhang, Jiaming
Seventh Research Division The Center for Information and Control School of Automation Science and Electrical Engineering Beijing100191 China
This article explores the distributed containment control problem for uncertain nonlinear multi-agent systems subject to external disturbances and input quantization, where the constant control gains and the upper bou...
详细信息
This paper studies the moving path following(MPF)problem for fixed-wing unmanned aerial vehicle(UAV)under output constraints and wind *** vehicle is required to converge to a reference path moving with respect to the ...
详细信息
This paper studies the moving path following(MPF)problem for fixed-wing unmanned aerial vehicle(UAV)under output constraints and wind *** vehicle is required to converge to a reference path moving with respect to the inertial frame,while the path following error is not expected to violate the predefined *** from existing moving path following guidance laws,the proposed method removes complex geometric transformation by formulating the moving path following problem into a second-order time-varying control problem.A nominal moving path following guidance law is designed with disturbances and their derivatives estimated by high-order disturbance *** guarantee that the path following error will not exceed the prescribed bounds,a robust control barrier function is developed and incorporated into controller design with quadratic program based *** proposed method does not require the initial position of the UAV to be within predefined *** the safety margin concept makes error-constraint be respected even if in a noisy *** proposed guidance law is validated through numerical simulations of shipboard landing and hardware-in-theloop(HIL)experiments.
We consider the multiobjective optimization problem for the resource allocation of the multiagent network, where each agent contains multiple conflicting local objective functions. The goal is to find compromise solut...
详细信息
We consider the multiobjective optimization problem for the resource allocation of the multiagent network, where each agent contains multiple conflicting local objective functions. The goal is to find compromise solutions minimizing all local objective functions subject to resource constraints as much as possible, i.e., the Pareto optimums. To this end, we first reformulate the multiobjective optimization problem into one single-objective distributed optimization problem by using the weighted Lppreference index,where the weighting factors of all local objective functions are obtained from the optimization procedure so that the optimizer of the latter is the desired Pareto optimum of the former. Next, we propose novel predefined-time algorithms to solve the reformulated problem by time-based generators. We show that the reformulated problem is solved within a predefined time if the local objective functions are strongly convex and smooth. Moreover, the settling time can be arbitrarily preset since it does not depend on the initial values and designed parameters. Finally, numerical simulations are presented to illustrate the effectiveness of the proposed algorithms.
暂无评论