This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown ***-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where th...
详细信息
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown ***-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where the system dynamics is not *** analyzing the value function iterations,the convergence of the model-based algorithm is *** equivalence of several types of value iteration algorithms is *** effectiveness of model-free algorithms is demonstrated by a numerical example.
Opacity is a central concept in the issue of privacy security and has been studied extensively in fields such as finite automata, probabilistic automata, and stochastic automata. Here, we investigate the problem of va...
详细信息
Opacity is a central concept in the issue of privacy security and has been studied extensively in fields such as finite automata, probabilistic automata, and stochastic automata. Here, we investigate the problem of validating multi-step opaque properties through unambiguous weighted machines from the perspective of cyber-physical systems. First, the notion of multi-step state-based opacity for unambiguous weighted machines is presented and defined. It includes two variants of delays with a finite K and infinite steps. Subsequently, the weighted state estimate with K(infinite)-step delay is established by abstracting the possible state set that the system could have through these weighted observations. Meanwhile, to keep the observable weighted sequence consistent between the bidirectional observers, the unobserved weights of the reverse weighted machine are assumed to be reserved. Subsequently, the existence conditions are developed, and the corresponding algorithms, termed the weighted bidirectional observer, are generalized to verify these properties. Finally, several numerical examples are illustrated to demonstrate the effectiveness of the proposed method. Taken together, the current approach will be conducive to a deep understanding of the security and privacy of cyber-physical systems.
In this paper, a command filter-based adaptive fuzzy predefined-time event-triggered tracking control problem is investigated for uncertain nonlinear systems with time-varying full-state constraints. By designing a sl...
详细信息
In this paper, a command filter-based adaptive fuzzy predefined-time event-triggered tracking control problem is investigated for uncertain nonlinear systems with time-varying full-state constraints. By designing a sliding mode differentiator, the inherent computational complexity problem within the predefined-time backstepping framework is solved. Different from the existing command filter-based finite-time and fixed-time control strategies that the convergence time of the filtering error is adjusted through the system initial value or numerous parameters, a novel command filtering error compensation method is presented,which tunes one control parameter to make the filtering error converge in the predefined time, thereby reducing the complexity of design and analysis of processing the filtering error. Then, an improved event-triggered mechanism(ETM) that builds upon the switching threshold strategy, in which an inverse cotangent function is designed to replace the residual term of the ETM,is proposed to gradually release the controller's dependence on the residual term with increasing time. Furthermore, a tan-type nonlinear mapping technique is applied to tackle the time-varying full-state constraints problem. By the predefined-time stability theory, all signals in the uncertain nonlinear systems exhibit predefined-time stability. Finally, the feasibility of the proposed algorithm is substantiated through two simulation results.
Coalition formation(CF) refers to reasonably organizing robots and/or humans to form coalitions that can satisfy mission requirements, attracting more and more attention in many fields such as multirobot collaboration...
详细信息
Coalition formation(CF) refers to reasonably organizing robots and/or humans to form coalitions that can satisfy mission requirements, attracting more and more attention in many fields such as multirobot collaboration and human-robot collaboration. However, the analysis on CF problems remains *** provide a valuable study reference for researchers interested in CF, this paper proposed a capabilitycentric analysis of the CF problem. The key problem elements of CF are firstly extracted by referencing the concepts of the 5W1H method. That is, objects(who) form coalitions(what) to accomplish missions(why) by aggregating capabilities(how) in a specific environment(where-when). Then, a multi-view analysis of these elements and their correlation in terms of capabilities is proposed through various logic diagrams, structure charts, etc. Finally, to facilitate a deeper understanding of capability-centric CF, a general mathematical model is constructed, demonstrating how the different concepts discussed in this analysis contribute to the overall model.
Angular random walk (ARW), rate random walk (RRW), and bias instability (BI) are the main noise types in inertial measurement units (IMUs) and thus determine the navigation performance of IMUs. BI is the flicker noise...
详细信息
Dear Editor,This letter deals with the robustness problem of gait recognition method against maximum number of clothing *** selecting four kinds of time-varying silhouette features,gait dynamics underlying different i...
详细信息
Dear Editor,This letter deals with the robustness problem of gait recognition method against maximum number of clothing *** selecting four kinds of time-varying silhouette features,gait dynamics underlying different individuals’gait features is effectively modeled by radial basis function(RBF)neural networks through deterministic *** kind of dynamics information has little sensitivity to the variance between gait patterns under different clothing *** order to eliminate the effect of clothing differences,the training patterns under different clothing conditions further constitute a uniform training dataset,containing all kinds of gait dynamics under different clothing conditions.A rapid recognition scheme is presented on published gait *** experiments demonstrate the efficacy of the proposed method.
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies as...
详细信息
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies assuming that the precise model of the leader is globally or distributively accessible to all or some of the followers, the leader's precise dynamical model is entirely inaccessible to all the followers in this paper. A data-based learning algorithm is first proposed to reconstruct the leader's unknown system matrix online. A distributed predictor subject to communication delays is further devised to estimate the leader's state, where interaction delays are allowed to be nonidentical. Then, a learning-based local controller, together with a discounted performance function, is projected to reach the optimal output synchronization. Bellman equations and game algebraic Riccati equations are constructed to learn the optimal solution by developing a model-based reinforcement learning(RL) algorithm online without solving regulator equations, which is followed by a model-free off-policy RL algorithm to relax the requirement of all agents' dynamics faced by the model-based RL algorithm. The optimal tracking control of HMASs subject to unknown leader dynamics and communication delays is shown to be solvable under the proposed RL algorithms. Finally, the effectiveness of theoretical analysis is verified by numerical simulations.
Dear Editor,In this letter, a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated (HOFA) systems with noises. The method can effe...
Dear Editor,In this letter, a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated (HOFA) systems with noises. The method can effectively deal with nonlinearities, constraints, and noises in the system, optimize the performance metric, and present an upper bound on the stable output of the system.
Noncooperative differential games provide a basis for the study of coordination, conflict, and control for a single dynamical system with multiple players. Within the linear-quadratic differential games (LQDGs), the o...
详细信息
The intelligence security control problem of multiagent systems in the presence of actuator faults is investigated against denial-of-service (DoS) attacks. Noticing that the DoS attacks usually make the actuator signa...
详细信息
暂无评论