In this paper,we investigate a decentralized diagnosis problem of a discrete-evnt system(DES) subject to unreliable sensors,where the sensor observations of local diagnosers may be non-deterministic as a result of pos...
详细信息
In this paper,we investigate a decentralized diagnosis problem of a discrete-evnt system(DES) subject to unreliable sensors,where the sensor observations of local diagnosers may be non-deterministic as a result of possible *** studies on decentralized robust diagnosis can only deal with different types of sensor failures separately,e.g.,all sensors suffer from the same type of sensor failures such as intermittent sensor failures or permanent sensor ***,since sensors of different local diagnosers may face different external environments and have different internal characteristics,sensors corresponding to different local diagnosers may also have their own fault *** this paper,we propose a flexible framework of decentralized diagnosis for DES subject to unreliable sensors such that sensors of different local diagnosers are permitted to have different types of sensor *** this end,we extend the existing decentralized diagnosis framework to the case where there exist common sensors broadcasting their observations to all local *** apply linear temporal logic(LTL) to constrain infinite behaviors of private sensors of local diagnosers and common ***,a new notion of φ-codiagnosability is proposed as the necessary and sufficient condition for the existence of a decentralized diagnoser that works correctly under sensors,satisfying LTL-based sensor ***,we provide an effective approach to verify the φ-codiagnosability.
In this paper, a distributed adaptive dynamic programming(ADP) framework based on value iteration is proposed for multi-player differential games. In the game setting,players have no access to the information of other...
详细信息
In this paper, a distributed adaptive dynamic programming(ADP) framework based on value iteration is proposed for multi-player differential games. In the game setting,players have no access to the information of others' system parameters or control laws. Each player adopts an on-policy value iteration algorithm as the basic learning framework. To deal with the incomplete information structure, players collect a period of system trajectory data to compensate for the lack of information. The policy updating step is implemented by a nonlinear optimization problem aiming to search for the proximal admissible policy. Theoretical analysis shows that by adopting proximal policy searching rules, the approximated policies can converge to a neighborhood of equilibrium policies. The efficacy of our method is illustrated by three examples, which also demonstrate that the proposed method can accelerate the learning process compared with the centralized learning framework.
High-level task planning under adversarial environments is one of the central problems in the development of autonomous systems such as unmanned ground vehicles (UGV). Existing works commonly assume that the decision-...
详细信息
High-level task planning under adversarial environments is one of the central problems in the development of autonomous systems such as unmanned ground vehicles (UGV). Existing works commonly assume that the decision-maker such as UAV shares the same information with the environment. However, in many scenarios, the UGV, as an integral part of the system, generally has more information than the external adversary. For such a scenario, the decision-maker with more information may achieve better performance by using deceptive strategies. In this paper, we investigate the problem of optimal deceptive strategy synthesis for autonomous systems under asymmetric information between the internal decision-maker and the external adversary. Specifically, we model the dynamic system as a weighted two-player graph game and the objective is to optimize the mean payoff value per task. To capture the asymmetric information between two parties, we assume that the UGV has complete knowledge of the system, whereas the adversary may have misconceptions regarding the task as well as the cost. To synthesize an optimal deceptive strategy, we propose a synthesis algorithm based on hyper-games. The correctness as well as the complexity of the algorithm are analyzed. We illustrate the proposed algorithm by running examples as well as a simulation case study. Finally, we conduct an empirical experiment using real-world scenarios to verify the practical applicability of our algorithm. IEEE
Aiming at the problems of low accuracy and poor robustness that existed in the current hot rolling strip width spread model,an improved strip spread prediction model based on a material forming mechanism and Bayesian ...
详细信息
Aiming at the problems of low accuracy and poor robustness that existed in the current hot rolling strip width spread model,an improved strip spread prediction model based on a material forming mechanism and Bayesian optimized adaptive differential evolution algorithm(BADE)was *** first,we improved the original spread mechanism model by adding the weight and bias term to enhance the model robustness based on rolling ***,the BADE algorithm was proposed to optimize the improved spread mechanism *** optimization algorithm is based on a novel adaptive differential evolution algorithm,which can effectively achieve the global optimal ***,the prediction performances of five machine learning algorithms were compared in *** results show that the prediction accuracy of the improved spread model is obviously better than that of the machine learning algorithms,which proves the effectiveness of the proposed method.
In artificial intelligence(AI)based-complex power system management and control technology,one of the urgent tasks is to evaluate AI intelligence and invent a way of autonomous intelligence ***,there is,currently,near...
详细信息
In artificial intelligence(AI)based-complex power system management and control technology,one of the urgent tasks is to evaluate AI intelligence and invent a way of autonomous intelligence ***,there is,currently,nearly no standard technical framework for objective and quantitative intelligence *** this article,based on a parallel system framework,a method is established to objectively and quantitatively assess the intelligence level of an AI agent for active power corrective control of modern power systems,by resorting to human intelligence evaluation *** this basis,this article puts forward an AI self-evolution method based on intelligence assessment through embedding a quantitative intelligence assessment method into automated reinforcement learning(AutoRL)systems.A parallel system based quantitative assessment and self-evolution(PLASE)system for power grid corrective control AI is thereby constructed,taking Bayesian Optimization as the measure of AI evolution to fulfill autonomous evolution of AI under guidance of their intelligence assessment *** results exemplified in the power grid corrective control AI agent show the PLASE system can reliably and quantitatively assess the intelligence level of the power grid corrective control agent,and it could promote evolution of the power grid corrective control agent under guidance of intelligence assessment results,effectively,as well as intuitively improving its intelligence level through selfevolution.
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence *** players are divided into two groups in the learnin...
详细信息
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence *** players are divided into two groups in the learning process and adapt their policies *** method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based ***,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control *** efficacy of our method is illustrated by three examples.
This paper presents a new proportional-integral-derivative control approach for positive switched systems based on a positive proportional-integral observer. First, a positive proportional-integral observer is constru...
详细信息
This paper presents a new proportional-integral-derivative control approach for positive switched systems based on a positive proportional-integral observer. First, a positive proportional-integral observer is constructed. By using the observer state, the proportional-integral-derivative control and the corresponding integral part are designed, respectively. Using 1-norm inequality, two dynamic event-triggering conditions are established for the proportional-integral observer and proportional-integral-derivative controller, respectively. A dynamic event-triggered proportional-integral observer-based dynamic event-triggered proportional-integral-derivative controller is proposed by combining the sample state and the integral of the weighted sample output estimation error. Under the designed event-triggering conditions, an interval system with upper and lower bounds is introduced. The positivity and ?1-gain stability are achieved by realizing the according properties of the lower and upper bound systems in terms of multiple copositive Lyapunov function, respectively. All gain matrices are designed by a matrix decomposition approach and the corresponding conditions are solved by linear programming. Finally, two examples are provided to illustrate the validity of the results.
This paper presents a co-design control scheme that integrates Distributed Model Predictive control with dynamic quantizers for networked nonlinear continuous systems operating under limited communication bandwidth. E...
详细信息
With the number of prosumers rising quickly, energy sharing markets in distribution networks are emerging. The trading of active power influences the power flow in the distribution network, which will cause voltage fl...
详细信息
作者:
Qiming LiuXinru CuiZhe LiuHesheng WangDepartment of Automation
Shanghai Jiao Tong UniversityShanghai 200240China MoE Key Laboratory of Artificial Intelligence
AI InstituteShanghai Jiao Tong UniversityShanghai 200240China Department of Automation
Key Laboratory of System Control and Information Processing of Ministry of EducationKey Laboratory of Marine Intelligent Equipment and System of Ministry of EducationShanghai Engineering Research Center of Intelligent Control and ManagementShanghai Jiao Tong UniversityShanghai 200240China
Autonomous navigation for intelligent mobile robots has gained significant attention,with a focus on enabling robots to generate reliable policies based on maintenance of spatial *** this paper,we propose a learning-b...
详细信息
Autonomous navigation for intelligent mobile robots has gained significant attention,with a focus on enabling robots to generate reliable policies based on maintenance of spatial *** this paper,we propose a learning-based visual navigation pipeline that uses topological maps as memory *** introduce a unique online topology construction approach that fuses odometry pose estimation and perceptual similarity *** tackles the issues of topological node redundancy and incorrect edge connections,which stem from the distribution gap between the spatial and perceptual ***,we propose a differentiable graph extraction structure,the topology multi-factor transformer(TMFT).This structure utilizes graph neural networks to integrate global memory and incorporates a multi-factor attention mechanism to underscore elements closely related to relevant target cues for policy *** from photorealistic simulations on image-goal navigation tasks highlight the superior navigation performance of our proposed pipeline compared to existing memory *** validation through behavior visualization,interpretability tests,and real-world deployment further underscore the adapt-ability and efficacy of our method.
暂无评论