检索结果-内蒙古大学图书馆

On-Policy and Off-Policy Value Iteration Algorithms for Stochastic Zero-Sum Dynamic Games

Journal of Systems Science & Complexity 2025年第1期38卷 421-435页

作者： GUO Liangyuan WANG Bing-Chang ZHANG Ji-Feng School of Control Science and Engineering Shandong UniversityJinan 250063China School of Automation and Electrical Engineering Zhongyuan University of TechnologyZhengzhou 450007China Key Laboratory of Systems and Control Academy of Mathematics and Systems ScienceChinese Academy of SciencesBeijing 100190China

This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown ***-policy and off-policy learning algorithms are developed to solve the stochastic zero-sum games,where the system dynamics is not *** analyzing the value function iterations,the convergence of the model-based algorithm is *** equivalence of several types of value iteration algorithms is *** effectiveness of model-free algorithms is demonstrated by a numerical example.

关键词： Approximate dynamic programming on-policy off-policy stochastic zero-sum games valueiteration

来源：评论

学校读者我要写书评

暂无评论

Security control for air-sea heterogeneous multiagent systems with cooperative-antagonistic interactions:An intermittent privacy preservation mechanism

引用

Science China(Technological Sciences) 2025年第4期68卷 179-192页

作者： Shoufeng YANG Hongjing LIANG Yingnan PAN Tieshan LI College of Control Science and Engineering Bohai UniversityJinzhou121013China School of Automation Engineering University of Electronic Science and Technology of ChinaChengdu611731China Laboratory of Electromagnetic Space Cognition and Intelligent Control Beijing100089China

antagonistic interactions,data security and transient-steady state performance of the system are two key *** ensure data security,an intermittent privacy preservation(IPP)mechanism is proposed for the first time.A novel setting time initial mask function and a novel intermittent mask function are *** can implement intermittent preservation for the system according to actual requirements,which solves the irreversibility problem after conventional mask disappears and balances control accuracy and system *** ensure transient-steady state performance,a novel error transformation function(ETF)is proposed and integrated into the predefined-time prescribed performance control *** to conventional hyperbolic tangent type ETFs,the proposed ETF can improve the convergence accuracy of errors under the same ***,a unified model of the air-sea HMASs is established,which improves the model accuracy compared with the simplified ***,the proposed IPP security control strategy is applied to the air-sea delivery mission to verify its feasibility and effectiveness.

关键词： air-sea heterogeneous multiagent systems(HMASs) security control intermittent privacy preservation(IPP) error transformation function(ETF)

来源：评论

学校读者我要写书评

暂无评论

Gait Recognition Under Different Clothing Conditions Via Deterministic Learning

引用

IEEE/CAA Journal of Automatica Sinica 2024年第6期11卷 1530-1532页

作者： Muqing Deng Cong Wang School of Automation Guangdong University of TechnologyGuangzhou 510006China School of Control Science and Engineering Shandong UniversityJinan 250100China

Dear Editor,This letter deals with the robustness problem of gait recognition method against maximum number of clothing *** selecting four kinds of time-varying silhouette features,gait dynamics underlying different individuals’gait features is effectively modeled by radial basis function(RBF)neural networks through deterministic *** kind of dynamics information has little sensitivity to the variance between gait patterns under different clothing *** order to eliminate the effect of clothing differences,the training patterns under different clothing conditions further constitute a uniform training dataset,containing all kinds of gait dynamics under different clothing conditions.A rapid recognition scheme is presented on published gait *** experiments demonstrate the efficacy of the proposed method.

关键词： networks neural individual

来源：评论

学校读者我要写书评

暂无评论

Coalition formation problem: a capability-centric analysis and general model

引用

Science China(Information Sciences) 2024年第11期67卷 180-193页

作者： Jie CHEN Miao GUO Bin XIN Qing WANG Shengyu LU Yipeng WANG Yulong DING School of Automation Beijing Institute of Technology National Key Lab of Autonomous Intelligent Unmanned Systems Department of Control Science and Engineering Tongji University

Coalition formation(CF) refers to reasonably organizing robots and/or humans to form coalitions that can satisfy mission requirements, attracting more and more attention in many fields such as multirobot collaboration and human-robot collaboration. However, the analysis on CF problems remains *** provide a valuable study reference for researchers interested in CF, this paper proposed a capabilitycentric analysis of the CF problem. The key problem elements of CF are firstly extracted by referencing the concepts of the 5W1H method. That is, objects(who) form coalitions(what) to accomplish missions(why) by aggregating capabilities(how) in a specific environment(where-when). Then, a multi-view analysis of these elements and their correlation in terms of capabilities is proposed through various logic diagrams, structure charts, etc. Finally, to facilitate a deeper understanding of capability-centric CF, a general mathematical model is constructed, demonstrating how the different concepts discussed in this analysis contribute to the overall model.

关键词： coalition formation capability aggregation capability metric mission requirement environmental effect

来源：评论

学校读者我要写书评

暂无评论

π-phase-shifted two-plus-one method for non-diffuse surface

引用

Chinese Optics Letters 2023年第10期21卷 36-43页

作者：王建华杨延西徐鹏 School of Information and Control Engineering Qingdao University of TechnologyQingdao 266520China School of Automation and Information Engineering Xi’an University of TechnologyXi’an 710048China

We propose a method for reconstructing non-diffuse surfaces based on theπ-phase-shifted two-plus-one phase-shifting ***,we introduce a 2fH+a+2fM+2f_(L)method for unwrapped phase ***,we introduce a new set ofπ-phase-shifted 2fH+a/2+2fM+2f_(L)fringe patterns with halved background *** saturated pixels will be replaced with the unsaturated pixels in theπ-phase-shifted fringe ***,we analyze eight fringe replacement cases and give the corresponding phase calculation,and further give the general *** confirm that the sum of the phase error of the proposed method is 81.4%lower than that of the traditional method,and 61.5%lower than that of the adaptive fringe projection method.

关键词： non-diffusion π-phase shift two-plus-one phase shift

来源：评论

学校读者我要写书评

暂无评论

Parallel System Based Quantitative Assessment and Self-evolution for Artificial Intelligence of Active Power Corrective control

引用

CSEE Journal of Power and Energy Systems 2024年第1期10卷 13-28页

作者： Tianyun Zhang Jun Zhang Feiyue Wang Peidong Xu Tianlu Gao Haoran Zhang Ruiqi Si School of Electrical Engineering and Automation Wuhan UniversityWuhan 430072HubeiChina State Key Laboratory for Management and Control of Complex System Institute of AutomationChinese Academy of SciencesBeijing 100190China

In artificial intelligence(AI)based-complex power system management and control technology,one of the urgent tasks is to evaluate AI intelligence and invent a way of autonomous intelligence ***,there is,currently,nearly no standard technical framework for objective and quantitative intelligence *** this article,based on a parallel system framework,a method is established to objectively and quantitatively assess the intelligence level of an AI agent for active power corrective control of modern power systems,by resorting to human intelligence evaluation *** this basis,this article puts forward an AI self-evolution method based on intelligence assessment through embedding a quantitative intelligence assessment method into automated reinforcement learning(AutoRL)systems.A parallel system based quantitative assessment and self-evolution(PLASE)system for power grid corrective control AI is thereby constructed,taking Bayesian Optimization as the measure of AI evolution to fulfill autonomous evolution of AI under guidance of their intelligence assessment *** results exemplified in the power grid corrective control AI agent show the PLASE system can reliably and quantitatively assess the intelligence level of the power grid corrective control agent,and it could promote evolution of the power grid corrective control agent under guidance of intelligence assessment results,effectively,as well as intuitively improving its intelligence level through selfevolution.

关键词： AI quantitative intelligence assessment and self-evolution automated reinforcement learning Bayesian optimization corrective control parallel system

来源：评论

学校读者我要写书评

暂无评论

Event-triggered predefined-time control for full-state constrained nonlinear systems: A novel command filtering error compensation method

引用

Science China(Technological Sciences) 2024年第9期67卷 2867-2880页

作者： PAN YingNan CHEN YiLin LIANG HongJing College of Control Science and Engineering Bohai UniversityJinzhou 121013China School of Automation Engineering University of Electronic Science and Technology of ChinaChengdu 611731China Laboratory of Electromagnetic Space Cognition and Intelligent Control Beijing 100089China

In this paper, a command filter-based adaptive fuzzy predefined-time event-triggered tracking control problem is investigated for uncertain nonlinear systems with time-varying full-state constraints. By designing a sliding mode differentiator, the inherent computational complexity problem within the predefined-time backstepping framework is solved. Different from the existing command filter-based finite-time and fixed-time control strategies that the convergence time of the filtering error is adjusted through the system initial value or numerous parameters, a novel command filtering error compensation method is presented,which tunes one control parameter to make the filtering error converge in the predefined time, thereby reducing the complexity of design and analysis of processing the filtering error. Then, an improved event-triggered mechanism(ETM) that builds upon the switching threshold strategy, in which an inverse cotangent function is designed to replace the residual term of the ETM,is proposed to gradually release the controller's dependence on the residual term with increasing time. Furthermore, a tan-type nonlinear mapping technique is applied to tackle the time-varying full-state constraints problem. By the predefined-time stability theory, all signals in the uncertain nonlinear systems exhibit predefined-time stability. Finally, the feasibility of the proposed algorithm is substantiated through two simulation results.

关键词： predefined-time control command filtering error compensation method event-triggered mechanism time-varying full-state constraints uncertain nonlinear systems

来源：评论

学校读者我要写书评

暂无评论

Sensorless control of PMSM Drives Using Octant Newton-Raphson Method and Reduced-Order EKF Considering Speed Reversal and Noise

引用

IEEE Transactions on Power Electronics 2025年第8期40卷 10793-10803页

作者： Xiang, Fuyuan Yu, Ke Chen, Congyan Li, Shihua Southeast University School of Automation and the Key Laboratory of Measurement and Control of Complex Systems of Engineering Ministry of Education Nanjing210096 China

For back electromotive force-based sensorless control in permanent magnet synchronous motor (PMSM) drives, the conventional phase-locked loop (PLL) exhibits a ±π position estimation error during speed reversal. This paper presents an octant Newton-Raphson method (ONRM) based PLL, which employs a newly defined objective function and a position octant method. It not only ensures the position estimation converges correctly during speed reversal but also improves dynamic performance compared to the conventional PLL. The applicability condition of the ONRM in the presence of noise is also analyzed in detail. Subsequently, due to the noise sensitivity of the ONRM, a reduced-order extended Kalman filter is designed to filter out the non-Gaussian noise in the position estimation while simultaneously estimating the speed. The effectiveness of the proposed sensorless control scheme is verified on a 1.5 kW PMSM drive platform. © 1986-2012 IEEE.

关键词： Extended Kalman filters

来源：评论

学校读者我要写书评

暂无评论

A Degeneracy-Aware LiDAR-Inertial-Visual SLAM for Degenerate Environments

引用

IEEE Sensors Journal 2025年第11期25卷 19376-19385页

作者： Zhu, Hongwei Zhang, Guobao Huang, Yongming Southeast University School of Automation Key Laboratory of Measurement and Control of Complex Systems of Engineering Ministry of Education Nanjing210096 China

Different sensors may experience degeneracy in specific scenarios, which can lead to failures in multi-sensor fusion optimization. To address the challenges of localization and mapping under such degeneracy conditions, this paper proposes a LiDAR-Inertial-Visual SLAM system with integrated degeneracy awareness. In the frontend, the system utilizes the ESIKF for tracking and incorporates a sensor degeneracy detection method to ensure that degraded sensor data do not affect the state error update. In the backend, we replace the conventional point cloud map with an adaptive voxel map to accelerate the retrieval of planar point cloud, and merge LiDAR planar features within a sliding window via region growing algorithm. Furthermore, the point-to-plane distance formula is optimized via the Rayleigh quotient theorem, which reduces the dimensionality of the optimization variables and simplifies the optimization process. Finally, we evaluate the proposed system on several challenging datasets and compare it with various state-of-the-art sensor fusion systems. The experimental results demonstrate that the proposed system achieves high localization accuracy in the majority of challenging scenarios. © 2001-2012 IEEE.

关键词： Inertial navigation systems

来源：评论

学校读者我要写书评

暂无评论

Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal control

引用

IEEE/CAA Journal of Automatica Sinica 2024年第9期11卷 1987-1998页

作者： Yisha Li Ya Zhang Xinde Li Changyin Sun School of Automation Southeast UniversityNanjing 210096 Key Laboratory of Measurement and Control of Complex Systems of Engineering Ministry of EducationSoutheast UniversityNanjing 210096China

This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system.A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic *** a regional multi-agent Q-learning framework is proposed,which can equivalently decompose the global Q value of the traffic system into the local values of several regions Based on the framework and the idea of human-machine cooperation,a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to realtime traffic flow *** order to achieve better cooperation inside each region,a lightweight spatio-temporal fusion feature extraction network is *** experiments in synthetic real-world and city-level scenarios show that the proposed RegionS TLight converges more quickly,is more stable,and obtains better asymptotic performance compared to state-of-theart models.

关键词： Human-machine cooperation mixed domain attention mechanism multi-agent reinforcement learning spatio-temporal feature traffic signal control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：