检索结果-内蒙古大学图书馆

60th Annual Allerton Conference on Communication, control, and Computing, Allerton 2024

作者： Krishnamurthy, Vikram Rojas, Cristian R. School of Electrical and Computer Engineering Cornell University United States School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Division of Decision and Control Systems Stockholm Sweden

ISBN: (纸本)9798331541033

We consider word-of-mouth social learning involving m Kalman filter agents that operate sequentially. The first Kalman filter receives the raw observations, while each subsequent Kalman filter receives a noisy measurement of the conditional mean of the previous Kalman filter. The prior is updated by the m-th Kalman filter. When m=2, and the observations are noisy measurements of a Gaussian random variable, the covariance goes to zero as k-1/3 for 7kappa;observations, instead of O(k-1) in the standard Kalman filter. In this paper we prove that for m agents, the covariance decreases to zero as k-(2m-1), i.e, the learning slows down exponentially with the number of agents. We also show that by artificially weighing the prior at each time, the learning rate can be made optimal as k-1. The implication is that in word-of-mouth social learning, artificially re-weighing the prior can yield the optimal learning rate. © 2024 IEEE.

关键词： Kalman filters

来源：评论

学校读者我要写书评

暂无评论

Koopman-based Deep Learning for Nonlinear System Estimation 63

Koopman-based Deep Learning for Nonlinear System Estimation

引用

63rd IEEE Conference on Decision and control, CDC 2024

作者： Sun, Zexin Chen, Mingyu Baillieul, John Boston University Division of Systems Engineering United States Boston University Department of Electrical and Computer Engineering United States Boston University Division of Systems Engineering Departments of Mechanical Engineering Electrical and Computer Engineering BostonMA02215 United States

ISBN: (纸本)9798350316339

Nonlinear differential equations are encountered as models of fluid flow, spiking neurons, and many other systems of interest in the real world. Common features of these systems are that their behaviors are difficult to describe exactly and invariably unmodeled dynamics present challenges in making precise predictions. In this paper, we present a novel data-driven linear estimator based on Koopman operator theory to extract meaningful finite-dimensional representations of complex non-linear systems. The Koopman model is used together with deep reinforcement networks that learn the optimal stepwise actions to predict future states of nonlinear systems. Our estimator is also adaptive to a diffeomorphic transformation of the estimated nonlinear system, which enables it to compute optimal state estimates without re-learning. © 2024 IEEE.

关键词： Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Defense for Advanced Persistent Threat with Inadvertent and Malicious Insider Threats

引用

无人系统（英文） 2024年第2期12卷 341-358页

作者： Ziqin Chen Guanpu Chen Yiguang Hong Department of Control Science and Engineering and Shanghai Research Institute for Intelligent Autonomous Systems Tongji UniversityShanghaiP.R.China School of Electrical Engineering and Computer Science KTH Royal Institute of TechnologyStockholmSweden

In this paper,we propose a game theory framework to solve advanced persistent threat problems,especially considering two types of insider threats:malicious and *** this framework,we establish a unified three-player game model and derive Nash equilibria in response to different types of insider *** analyzing these Nash equilibria,we provide quantitative solutions to advanced persistent threat problems pertaining to insider ***,we have conducted a comparative assessment of the optimal defense strategy and corresponding defender's costs between two types of insider ***,our findings advocate a more proactive defense strategy against inadvertent insider threats in contrast to malicious ones,despite the latter imposing a higher burden on the *** theoretical results are substantiated by numerical results,which additionally include a detailed exploration of the conditions under which different insiders adopt risky *** conditions can serve as guiding indicators for the defender when calibrating their monitoring intensities and devising defensive strategies.

关键词： Security game advanced persistent threat insider threats Nash equilibrium

来源：评论

学校读者我要写书评

暂无评论

Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning 41

Temporal Logic Specification-Conditioned Decision Transforme...

引用

41st International Conference on Machine Learning, ICML 2024

作者： Guo, Zijian Zhou, Weichao Li, Wenchao Division of Systems Engineering Boston University United States Department of Electrical and Computer Engineering Boston University United States

Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, these approaches fall short in real-world applications that involve complex tasks with rich temporal and logical structures. In this paper, we propose temporal logic Specification-conditioned Decision Transformer (SDT), a novel framework that harnesses the expressive power of signal temporal logic (STL) to specify complex temporal rules that an agent should follow and the sequential modeling capability of Decision Transformer (DT). Empirical evaluations on the DSRL benchmarks demonstrate the better capacity of SDT in learning safe and high-reward policies compared with existing approaches. In addition, SDT shows good alignment with respect to different desired degrees of satisfaction of the STL specification that it is conditioned on. Copyright 2024 by the author(s)

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Prescribed-Time Nash Equilibrium Seeking for Pursuit-Evasion Game

引用

IEEE/CAA Journal of Automatica Sinica 2024年第6期11卷 1518-1520页

作者： Lei Xue Jianfeng Ye Yongbao Wu Jian Liu D.C.Wunsch Key Laboratory of Measurement and Control of Complex Systems of Engineering Ministry of EducationNanjing 210096 School of Automation Southeast UniversityNanjing 210096China Department of Electrical and Computer Engineering Missouri University of Science and TechnologyRollaMO 65409 USA IEEE

Dear Editor,This letter is concerned with prescribed-time Nash equilibrium(PTNE)seeking problem in a pursuit-evasion game(PEG)involving agents with second-order *** order to achieve the prior-given and user-defined convergence time for the PEG,a PTNE seeking algorithm has been developed to facilitate collaboration among multiple pursuers for capturing the evader without the need for any global ***,it is theoretically proved that the prescribedtime convergence of the designed algorithm for achieving Nash equilibrium of ***,the effectiveness of the PTNE method was validated by numerical simulation results.A PEG consists of two groups of agents:evaders and *** pursuers aim to capture the evaders through cooperative efforts,while the evaders strive to evade *** is a classic noncooperative *** has attracted plenty of attention due to its wide application scenarios,such as smart grids[1],formation control[2],[3],and spacecraft rendezvous[4].It is noteworthy that most previous research on seeking the Nash equilibrium of the game,where no agent has an incentive to change its actions,has focused on asymptotic and exponential convergence[5]-[7].

关键词： seeking prescribed convergence

来源：评论

学校读者我要写书评

暂无评论

On final opinions of the Friedkin-Johnsen model over random graphs with partially stubborn community 63

On final opinions of the Friedkin-Johnsen model over random ...

引用

63rd IEEE Conference on Decision and control, CDC 2024

作者： Wang, Lingfei Xing, Yu Johansson, Karl H. Kth Royal Institute of Technology and with Digital Futures Division of Decision and Control Systems School of Electrical Engineering and Computer Science Stockholm Sweden

ISBN: (纸本)9798350316339

This paper studies the formation of final opinions for the Friedkin-Johnsen (FJ) model with a community of partially stubborn agents. The underlying network of the FJ model is symmetric and generated from a random graph model, in which each link is added independently from a Bernoulli distribution. It is shown that the final opinions of the FJ model will concentrate around those of an FJ model over the expected graph as the network size grows, on the condition that the stubborn agents are well connected to other agents. Probability bounds are proposed for the distance between these two final opinion vectors, respectively for the cases where there exist non-stubborn agents or not. Numerical experiments are provided to illustrate the theoretical findings. The simulation shows that, in presence of non-stubborn agents, the link probability between the stubborn and the non-stubborn communities affect the distance between the two final opinion vectors significantly. Additionally, if all agents are stubborn, the opinion distance decreases with the agent stubbornness. © 2024 IEEE.

关键词： Graph theory

来源：评论

学校读者我要写书评

暂无评论

Improving Power Response of Single-controllable VSG-Based Active Distribution Network

引用

CSEE Journal of Power and Energy systems 2025年第1期11卷 102-114页

作者： Shah Fahad Arman Goudarzi Pierluigi Siano Ji Xiang Department of Electrical and Computer Engineering Missouri University of Science and TechnologyUSA University of Victoria Canada Department of Management&Innovation Systems University of SalernoItaly Department of Electrical and Electronic Engineering Science University of JohannesburgJohannesburg2006South Africa National Laboratory of Industrial Control Technology College of Electrical EngineeringZhejiang UniversityHangzhou310058China

controlling an active distribution network(ADN)from a single PCC has been advantageous for improving the performance of coordinated Intermittent RESs(IRESs).Recent studies have proposed a constant PQ regulation approach at the PCC of ADNs using coordination of non-MPPT based ***,due to the intermittent nature of DGs coupled with PCC through uni-directional broadcast communication,the PCC becomes vulnerable to transient *** address this challenge,this study first presents a detailed mathematical model of an ADN from the perspective of PCC regulation to realize rigidness of PCC against ***,an H_(∞)controller is formulated and employed to achieve optimal performance against disturbances,consequently,ensuring the least oscillations during transients at ***,an eigenvalue analysis is presented to analyze convergence speed limitations of the newly derived system ***,simulation results show the proposed method offers superior performance as compared to the state-of-the-art methods.

关键词： Cooperative control of DGs hierarchical control system high penetration of intermittent RESs H_(∞)control optimal control in power systems single-controllable active distribution networks

来源：评论

学校读者我要写书评

暂无评论

Efficient Safety Verification of Discrete Event systems 19

Efficient Safety Verification of Discrete Event Systems

引用

19th IEEE International Conference on Automation Science and engineering, CASE 2023

作者： Lennartson, Bengt Chalmers University of Technology Division of Systems and Control Department of Electrical Engineering Gothenburg Sweden

ISBN: (纸本)9798350320695

Formal verification of safety properties is critical in many application areas. In this paper a survey of the most common and efficient methods is given. The different methods are compared for some typical scalable examples, both modeled by transition systems and Petri nets. The methods are based on symbolic computations and logic representations of both sets and transition relations. Reachability analysis is performed not only from the initial states to any forbidden states, but also in the opposite backward direction. The reason is that a system is also safe when no initial state is backward reachable from the set of forbidden states. For safe systems the number of backward reachable states from forbidden states is often much less than the forward reachable states from the initial states. The backward strategy is shown to be very efficient and often even better than the most efficient methods that are specialized on safety analysis. © 2023 IEEE.

关键词： Discrete event simulation

来源：评论

学校读者我要写书评

暂无评论

Statistical Learning Theory for control: A Finite-Sample Perspective

引用

IEEE control systems 2023年第6期43卷 67-97页

作者： Tsiamis, Anastasios Ziemann, Ingvar Matni, Nikolai Pappas, George J. ETH Zürich Department of Information Technology and Electrical Engineering Zürich8092 Switzerland KTH Royal Institute of Technology Division of Decision and Control Systems Stockholm Sweden University of Pennsylvania Department of Electrical and Systems Engineering PhiladelphiaPA19104 United States

Learning algorithms have become an integral component to modern engineering solutions. Examples range from self-driving cars and recommender systems to finance and even critical infrastructure, many of which are typically under the purview of control theory. While these algorithms have already shown tremendous promise in certain applications [1], there are considerable challenges, in particular, with respect to guaranteeing safety and gauging fundamental limits of operation. Thus, as we integrate tools from machine learning into our systems, we also require an integrated theoretical understanding of how they operate in the presence of dynamic and system-theoretic phenomena. Over the past few years, intense efforts toward this goal - an integrated theoretical understanding of learning, dynamics, and control - have been made. While much work remains to be done, a relatively clear and complete picture has begun to emerge for (fully observed) linear dynamical systems. These systems already allow for reasoning about concrete failure modes, thus helping to indicate a path forward. Moreover, while simple at a glance, these systems can be challenging to analyze. Recently, a host of methods from learning theory and high-dimensional statistics, not typically in the control-theoretic toolbox, have been introduced to our community. This tutorial survey serves as an introduction to these results for learning in the context of unknown linear dynamical systems (see 'Summary'). We review the current state of the art and emphasize which tools are needed to arrive at these results. Our focus is on characterizing the sample efficiency and fundamental limits of learning algorithms. Along the way, we also delineate a number of open problems. More concretely, this article is structured as follows. We begin by revisiting recent advances in the finite-sample analysis of system identification. Next, we discuss how these finite-sample bounds can be used downstream to give guaranteed performa

关键词： Learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Differentially Private Online Federated Learning with Correlated Noise 63

Differentially Private Online Federated Learning with Correl...

引用

63rd IEEE Conference on Decision and control, CDC 2024

作者： Zhang, Jiaojiao Zhu, Linglingzhi Johansson, Mikael Kth Royal Institute of Technology Division of Decision and Control Systems School of Electrical Engineering and Computer Science StockholmSE-100 44 Sweden Chinese University of Hong Kong Department of Systems Engineering and Engineering Management Hong Kong

ISBN: (纸本)9798350316339

We introduce a novel differentially private algorithm for online federated learning that employs temporally correlated noise to enhance utility while ensuring privacy of continuously released models. To address challenges posed by DP noise and local updates with streaming non-iid data, we develop a perturbed iterate analysis to control the impact of the DP noise on the utility. Moreover, we demonstrate how the drift errors from local updates can be effectively managed under a quasi-strong convexity condition. Subject to an (, δ) DP budget, we establish a dynamic regret bound over the entire time horizon, quantifying the impact of key parameters and the intensity of changes in dynamic environments. Numerical experiments confirm the efficacy of the proposed algorithm. © 2024 IEEE.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：