Relative overgeneralization (RO) occurs in cooperative multi-agent learning tasks when agents converge towards a suboptimal joint policy due to overfitting to suboptimal behaviors of other *** methods have been propos...
详细信息
Relative overgeneralization (RO) occurs in cooperative multi-agent learning tasks when agents converge towards a suboptimal joint policy due to overfitting to suboptimal behaviors of other *** methods have been proposed for addressing RO in multi-agent policy gradient (MAPG) methods although these methods produce state-of-the-art *** address this gap, we propose a general, yet simple, framework to enable optimistic updates in MAPG methods that alleviate the RO *** approach involves clipping the advantage to eliminate negative values, thereby facilitating optimistic updates in *** optimism prevents individual agents from quickly converging to a local ***, we provide a formal analysis to show that the proposed method retains optimality at a fixed *** extensive evaluations on a diverse set of tasks including the Multi-agent MuJoCo and Overcooked benchmarks, our method outperforms strong baselines on 13 out of 19 tested tasks and matches the performance on the rest. Copyright 2024 by the author(s)
The finite/fixed-time stabilization and tracking control is currently a hot field in various systems since the faster convergence can be obtained. By contrast to the asymptotic stability,the finite-time stability poss...
详细信息
The finite/fixed-time stabilization and tracking control is currently a hot field in various systems since the faster convergence can be obtained. By contrast to the asymptotic stability,the finite-time stability possesses the better control performance and disturbance rejection property. Different from the finite-time stability, the fixed-time stability has a faster convergence speed and the upper bound of the settling time can be estimated. Moreover, the convergent time does not rely on the initial *** work aims at presenting an overview of the finite/fixed-time stabilization and tracking control and its applications in engineering systems. Firstly, several fundamental definitions on the finite/fixed-time stability are recalled. Then, the research results on the finite/fixed-time stabilization and tracking control are reviewed in detail and categorized via diverse input signal structures and engineering applications. Finally, some challenging problems needed to be solved are presented.
The rail transit system plays a crucial role in modern *** the increasing demand for clean and green energy in the transport sector,its energy system is expected to achieve low-carbon and highly efficient energy utili...
详细信息
The rail transit system plays a crucial role in modern *** the increasing demand for clean and green energy in the transport sector,its energy system is expected to achieve low-carbon and highly efficient energy utilization in rail ***,the gradual development of the rail transport energy system has led to an increase in its complexity,and the rising difficulty of system assessment has faced the limitations of traditional assessment ***,it is essential to develop effective assessment *** paper begins by providing a systematic review of the development status of Reliability,Availability,Maintainability and Safety(RAMS)assessment and analyzing the shortcomings of traditional RAMS assessment technology in the context of rail transit energy ***,based on the four fundamental properties of RAMS,it summarizes the current state of key assessment technologies in the field of rail ***,the paper delves into the challenges and potential solutions concerning the implementation of RAMS assessment technology for rail transit energy ***,the paper offers an outlook on the future development of RAMS assessment for rail transport energy *** comprehensively analyzing these aspects,the paper aims to contribute valuable insights into optimizing the rail transit energy system,promoting its sustainable and efficient operation in the context of clean and green energy utilization.
Rational planning of battery energy storage system is the key technology to solve the problem of high proportion of new energy consumption and the requirements of high performance power supply. Starting from the multi...
详细信息
1 Quantum information technology Quantum information technology utilizes physical systems at the microscopic level, such as photon, atom, ion, and superconducting, to accomplish information-processing tasks that are i...
详细信息
1 Quantum information technology Quantum information technology utilizes physical systems at the microscopic level, such as photon, atom, ion, and superconducting, to accomplish information-processing tasks that are impossible for the classical macroscopic world. During the past decade, significant process has been achieved in the pursuit of quantum technology into practical applications,generating great research interest from various domains, with the potential to radically change our information infrastructure [1–3].
Dear Editor,This letter studies the problem of sliding mode control(SMC)design for recurrent neural networks(RNNs)with impulsive disturbances and time-varying transmission *** this end,an appropriate integral sliding ...
详细信息
Dear Editor,This letter studies the problem of sliding mode control(SMC)design for recurrent neural networks(RNNs)with impulsive disturbances and time-varying transmission *** this end,an appropriate integral sliding surface function and SMC law are adopted for use under impulsive disturbances and time-varying delays.
Large language models (LLMs) have demonstrated remarkable capacities on various tasks, and integrating the capacities of LLMs into the Internet of Things (IoT) applications has drawn much research attention recently. ...
详细信息
In recent years, Neural Architecture Search (NAS) has marked significant advancements, yet its efficacy is marred by the dependence on substantial computational resources. To mitigate this, the development of NAS benc...
详细信息
This paper presents the (Formula presented.) State-Feedback control for Continuous Semi-Markov Jump Linear Systems where the transition rates are given by the ratio of polynomials of the sojourn time. We show that, fo...
详细信息
In this paper, a neural network time delay prediction method based on phase space reconstruction is presented. This method reconstructs one-dimensional chaotic time series in phase space according to the internal law ...
详细信息
暂无评论