检索结果-内蒙古大学图书馆

Reinforcement learning-based unknown reference tracking control of HMASs with nonidentical communication delays

science China(Information sciences) 2023年第7期66卷 46-57页

作者： Yong XU Zheng-Guang WU Wei-Wei CHE Deyuan MENG School of Automation Beijing Institute of Technology Institute of Cyber-Systems and Control Zhejiang University College of Mathematics and Computer Science Zhejiang Normal University Department of Automation Qingdao University School of Automation Science and Electrical Engineering Beihang University(BUAA)

This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies assuming that the precise model of the leader is globally or distributively accessible to all or some of the followers, the leader's precise dynamical model is entirely inaccessible to all the followers in this paper. A data-based learning algorithm is first proposed to reconstruct the leader's unknown system matrix online. A distributed predictor subject to communication delays is further devised to estimate the leader's state, where interaction delays are allowed to be nonidentical. Then, a learning-based local controller, together with a discounted performance function, is projected to reach the optimal output synchronization. Bellman equations and game algebraic Riccati equations are constructed to learn the optimal solution by developing a model-based reinforcement learning(RL) algorithm online without solving regulator equations, which is followed by a model-free off-policy RL algorithm to relax the requirement of all agents' dynamics faced by the model-based RL algorithm. The optimal tracking control of HMASs subject to unknown leader dynamics and communication delays is shown to be solvable under the proposed RL algorithms. Finally, the effectiveness of theoretical analysis is verified by numerical simulations.

关键词： heterogeneous multiagent systems HMAS reinforcement learning RL optimal output synchronization communication delays

来源：评论

学校读者我要写书评

暂无评论

Data-driven output regulation control for constrained linear systems

引用

science China(Information sciences) 2025年第3期68卷 338-353页

作者： Chaoyu XIA Yi DONG Chaoli WANG Shengyuan XU Shanghai Research Institute for Intelligent Autonomous Systems Tongji University College of Electronic and Information Engineering Shanghai Research Institute for Intelligent Autonomous SystemsTongji University Department of Control Science and Engineering School of Optical-Electrical and Computer EngineeringUniversity of Shanghai for Science and Technology Department of Automation Nanjing University of Science and Technology

This study introduces a data-driven approach for state and output feedback control addressing the constrained output regulation problem in unknown linear discrete-time systems. Our method ensures effective tracking performance while satisfying the state and input constraints, even when system matrices are not available. We first establish a sufficient condition necessary for the existence of a solution pair to the regulator equation and propose a data-based approach to obtain the feedforward and feedback control gains for state feedback control using linear programming. Furthermore, we design a refined Luenberger observer to accurately estimate the system state, while keeping the estimation error within a predefined set. By combining output regulation theory, we develop an output feedback control strategy. The stability of the closed-loop system is rigorously proved to be asymptotically stable by further leveraging the concept of λ-contractive sets.

关键词： output regulation constrained system data-driven.

来源：评论

学校读者我要写书评

暂无评论

On Approximate Opacity of Stochastic control Systems

引用

IEEE Transactions on Automatic control 2024年第6期70卷 3846-3861页

作者： Liu, Siyuan Yin, Xiang Dimarogonas, Dimos V. Zamani, Majid Kth Royal Institute of Technology Division of Decision and Control Systems Stockholm Sweden Shanghai Jiao Tong University and Key Lab of System Control & Information Processing Ministry of Education Department of Automation Shanghai China University of Colorado Boulder Computer Science Department CO80309 United States Ludwig Maximilian University of Munich Computer Science Department Germany

This paper investigates an important class of information-flow security property called opacity for stochastic control systems. Opacity captures whether a system's secret behavior (a subset of the system's behavior that is considered to be critical) can be kept from outside observers. Existing works on opacity for control systems only provide a binary characterization of the system's security level by determining whether the system is opaque or not. In this work, we introduce a quantifiable measure of opacity that considers the likelihood of satisfying opacity for stochastic control systems modeled as general Markov decision processes (gMDPs). We also propose verification methods tailored to the new notions of opacity for finite gMDPs by using value iteration techniques. Then, a new notion called approximate opacity-preserving stochastic simulation relation is proposed, which captures the distance between two systems' behaviors in terms of preserving opacity. Based on this new system relation, we show that one can verify opacity for stochastic control systems using their abstractions (modeled as finite gMDPs). We also discuss how to construct such abstractions for a class of gMDPs under certain stability conditions. © 1963-2012 IEEE.

关键词： Stochastic control systems

来源：评论

学校读者我要写书评

暂无评论

Quality-Relevant Modeling and Monitoring of Industrial Cyber-Physical Systems: The Semi-Supervised Dynamic Latent Variable Models

IEEE Transactions on Industrial Cyber-Physical Systems

引用

IEEE Transactions on Industrial Cyber-Physical Systems 2025年 3卷 39-47页

作者： Zhou, Le Wang, Yaoxin Wu, Yuanqing He, Shenghuang Song, Zhihuan Zhejiang University of Science and Technology Department of Automation and Electrical Engineering Hangzhou310023 China Guangdong University of Technology Department of Automation Guangzhou510006 China Dongguan University of Technology Department of Computer Science and Technology Dongguan523808 China Ningbo Industrial Internet Institute Ningbo315012 China Zhejiang University Department of Control Science and Engineering Hangzhou310027 China

In modern industrial cyber-physical systems, a mass of process variables has been obtained by the high-sampling online sensors. Meanwhile, the key quality indexes are usually obtained infrequently from the laboratory. Hence, these quality variables are with low sampling rate. To avail of the complete process and quality variables with various sampling rates in the dynamic processes, a set of semi-supervised dynamic latent variable models are proposed for dynamic modeling and quality-relevant monitoring. The proposed models have built a unified structure to consider both the auto-correlations and cross-correlations between the process and quality variables with unbalanced sampling sizes. Hence, the feature extraction of the time series data is dynamically adjusted under the guidance of the quality variables. Then, the quality-relevant monitoring schemes are proposed, which is validated by a numerical case and an actual wastewater treatment process. © 2024 IEEE. All rights reserved.

关键词： Indexes Probabilistic logic Kalman filters Noise Cyber-physical systems Analytical models Parameter estimation Time series analysis Process monitoring Numerical models

来源：评论

学校读者我要写书评

暂无评论

From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization 30

From News to Summaries: Building a Hungarian Corpus for Extr...

引用

Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024

作者： Barta, Botond Lakatos, Dorina Nagy, Attila Nyist, Milán Konor Ács, Judit HUN-REN Institute for Computer Science and Control Hungary Department of Automation and Applied Informatics Budapest University of Technology and Economics Hungary

ISBN: (纸本)9782493814104

Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl corpus undergoing thorough cleaning, preprocessing and deduplication. In addition to abstractive summarization we generate sentence-level labels for extractive summarization using sentence similarity. We train baseline models for both extractive and abstractive summarization using the collected dataset. To demonstrate the effectiveness of the trained models, we perform both quantitative and qualitative evaluation. Our dataset, models and code are publicly available, encouraging replication, further research, and real-world applications across various domains. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.

关键词： abstractive summarization extractive summarization Hungarian

来源：评论

学校读者我要写书评

暂无评论

Interference Suppression and Jitter Elimination Ability-Based Adaption Tracking Guidance for Robotic Fishes

引用

IEEE/CAA Journal of Automatica Sinica 2025年第1期12卷 126-137页

作者： Dongfang Li Jie Huang Rob Law Xin Xu Limin Zhu Edmond Q.Wu IEEE the School of Electrical Engineering and Automation Fuzhou University the Key Laboratory of System Control and Information Processing Ministry of Education the School of Electrical Engineering and Automation Fuzhou University 5G+Industrial Internet Institute Fuzhou University the University of Macau the College of Mechatronics and Automation National University of Defense Technology the School of Mechanical Engineering Shanghai Jiao Tong University the Department of Computer Science and Engineering Shanghai Jiao Tong University

This work presents an adaptive tracking guidance method for robotic fishes. The scheme enables robots to suppress external interference and eliminate motion jitter. An adaptive integral surge line-of-sight guidance rule is designed to eliminate dynamics interference and sideslip issues. Limited-time yaw and surge speed observers are reported to fit disturbance variables in the model. The approximation values can compensate for the system's control input and improve the robots' tracking ***, this work develops a terminal sliding mode controller and third-order differential processor to determine the rotational torque and reduce the robots' run jitter. Then, Lyapunov's theory proves the uniform ultimate boundedness of the proposed method. Simulation and physical experiments confirm that the technology improves the tracking error convergence speed and stability of robotic fishes.

关键词： Adaptive integral surge line-of-sight approximation robotic fish speed observer

来源：评论

学校读者我要写书评

暂无评论

Unleashing the Potential of Knowledge Distillation for IoT Traffic Classification

IEEE Transactions on Machine Learning in Communications and ...

引用

IEEE Transactions on Machine Learning in Communications and Networking 2024年 2卷 221-239页

作者： Abbasi, Mahmoud Shahraki, Amin Prieto, Javier Arrieta, Angelica Gonzalez Corchado, Juan M. University of Salamanca BISITE Research Group Salamanca37007 Spain University of Oslo Department of Informatics Oslo0373 Norway University of Salamanca Department of Computer Science and Automation Control Salamanca37007 Spain

The Internet of Things (IoT) has revolutionized our lives by generating large amounts of data, however, the data needs to be collected, processed, and analyzed in real-time. Network Traffic Classification (NTC) in IoT is a crucial step for optimizing network performance, enhancing security, and improving user experience. Different methods are introduced for NTC, but recently Machine Learning solutions have received high attention in this field, however, Traditional Machine Learning (ML) methods struggle with the complexity and heterogeneity of IoT traffic, as well as the limited resources of IoT devices. Deep learning shows promise but is computationally intensive for resource-constrained IoT devices. Knowledge distillation is a solution to help ML by compressing complex models into smaller ones suitable for IoT devices. In this paper, we examine the use of knowledge distillation for IoT traffic classification. Through experiments, we show that the student model achieves a balance between accuracy and efficiency. It exhibits similar accuracy to the larger teacher model while maintaining a smaller size. This makes it a suitable alternative for resource-constrained scenarios like mobile or IoT traffic classification. We find that the knowledge distillation technique effectively transfers knowledge from the teacher model to the student model, even with reduced training data. The results also demonstrate the robustness of the approach, as the student model performs well even with the removal of certain classes. Additionally, we highlight the trade-off between model capacity and computational cost, suggesting that increasing model size beyond a certain point may not be beneficial. The findings emphasize the value of soft labels in training student models with limited data resources. © 2023 CCBY.

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

Prescribed-Time Nash Equilibrium Seeking for Pursuit-Evasion Game

引用

IEEE/CAA Journal of Automatica Sinica 2024年第6期11卷 1518-1520页

作者： Lei Xue Jianfeng Ye Yongbao Wu Jian Liu D.C.Wunsch Key Laboratory of Measurement and Control of Complex Systems of Engineering Ministry of EducationNanjing 210096 School of Automation Southeast UniversityNanjing 210096China Department of Electrical and Computer Engineering Missouri University of Science and TechnologyRollaMO 65409 USA IEEE

Dear Editor,This letter is concerned with prescribed-time Nash equilibrium(PTNE)seeking problem in a pursuit-evasion game(PEG)involving agents with second-order *** order to achieve the prior-given and user-defined convergence time for the PEG,a PTNE seeking algorithm has been developed to facilitate collaboration among multiple pursuers for capturing the evader without the need for any global ***,it is theoretically proved that the prescribedtime convergence of the designed algorithm for achieving Nash equilibrium of ***,the effectiveness of the PTNE method was validated by numerical simulation results.A PEG consists of two groups of agents:evaders and *** pursuers aim to capture the evaders through cooperative efforts,while the evaders strive to evade *** is a classic noncooperative *** has attracted plenty of attention due to its wide application scenarios,such as smart grids[1],formation control[2],[3],and spacecraft rendezvous[4].It is noteworthy that most previous research on seeking the Nash equilibrium of the game,where no agent has an incentive to change its actions,has focused on asymptotic and exponential convergence[5]-[7].

关键词： seeking prescribed convergence

来源：评论

学校读者我要写书评

暂无评论

Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time 63

Safe Reinforcement Learning for Constrained Markov Decision ...

引用

63rd IEEE Conference on Decision and control, CDC 2024

作者： Mazumdar, Abhijit Wisniewski, Rafal Bujorianu, Manuela L. Aalborg University Section of Automation & Control Aalborg East9220 Denmark University College London Department of Computer Science United Kingdom

ISBN: (纸本)9798350316339

In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochastic stopping time, the problem of learning optimal policy without violating safety constraints during the learning phase is yet to be addressed. To this end, we propose an algorithm based on linear programming that does not require a process model. We show that the learned policy is safe with high confidence. We also propose a method to compute a safe baseline policy, which is central in developing algorithms that do not violate the safety constraints. Finally, we provide simulation results to show the efficacy of the proposed algorithm. Further, we demonstrate that efficient exploration can be achieved by defining a subset of the state-space called proxy set. © 2024 IEEE.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Single-channel ultrahigh frequency moisture meter with direct measurement of moisture content of bulk materials

Single-channel ultrahigh frequency moisture meter with direc...

引用

2024 International Conference Automatics and Informatics, ICAI 2024

作者： Kalandarov, Palvan I. Ubaydullayeva, Shakhnoza R. Ismailov, Mirhalil A. Nikolov, Nikola N. Alexandrova, Mariela Tashkent Inst. of Irrigation and Agricultural Mechanization Engineers National Research University Department "Automation and Control of Technology Process in Production" Tashkent Uzbekistan Faculty of Computer Science and Automation Technical University of Varna Department of Automation Varna Bulgaria

ISBN: (纸本)9798350353907

The article discusses the theoretical foundations of the design of a single-channel ultrahigh frequency moisture meter with direct measurement of the moisture content of bulk materials. In accordance with the requirements of the methodology for selecting efficiency criteria, it is necessary to develop structural diagrams of measuring devices. For these purposes, the standard deviation of the random error is determined, characterizing the accuracy. It includes the main components: sensitivity error, zero error and additive component. Mathematical models of structures are constructed and the standard deviation of random errors, which are caused by certain parameters and additive fluctuations, is calculated. A single-parameter ultrahigh-frequency method for determining the moisture content is proposed. This method provides high accuracy of a single-channel ultrahigh frequency moisture meter with direct measurement of the moisture content of bulk materials. The measuring device can be used in the agricultural industry, where humidity is one of the important parameters, starting with harvesting and ending with the release of finished products. © 2024 IEEE.

关键词： Moisture meters

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：