Reinforcement learning algorithms are central to the cognition and decision-making of embodied intelligent agents. A bilevel optimization(BO) modeling approach, along with a host of efficient BO algorithms, has been p...
详细信息
Reinforcement learning algorithms are central to the cognition and decision-making of embodied intelligent agents. A bilevel optimization(BO) modeling approach, along with a host of efficient BO algorithms, has been proven to be an effective means of addressing actor-critic(AC) policy optimization problems. In this work, based on a bilevelstructured AC problem model, an implicit zeroth-order stochastic algorithm is developed. A locally randomized spherical smoothing technique, which can be applied to nonsmooth nonconvex implicit AC formulations and avoid the closed-form lower-level mapping, is introduced. In the proposed zeroth-order scheme, the gradient of the implicit function can be approximated through inexact lower-level value estimations that are practically available. Under suitable assumptions,the algorithmic framework designed for the bilevel AC method is characterized by convergence guarantees under a fixed stepsize and smoothing parameter. Moreover, the proposed algorithm is equipped with the overall iteration complexity of O(n2L20 L20?-1). The convergence performance of the proposed algorithm is verified through numerical simulations.
The Hydro-Viscous Drive(HVD)speed regulating system finds extensive application in air transport transmission systems to regulate the stepless speed or conduct overload ***,its intrinsic hysteretic behaviors,such as t...
详细信息
The Hydro-Viscous Drive(HVD)speed regulating system finds extensive application in air transport transmission systems to regulate the stepless speed or conduct overload ***,its intrinsic hysteretic behaviors,such as the asymmetric hysteretic and dead zone,could introduce inaccuracy and delay in control applications,posing challenges to system *** paper investigates a Nonlinear Hysteresis Compensation Control(NHCC)that consists of two parts to control the HVD output speed by operating the valve under different engine operating *** the first part,the Inverse Hysteresis Compensator(IHC)based on major loop data is introduced for the asymmetric hysteresis characterization and compensation of the HVD speed control system of the power generation and distribution,which aims to reduce the hysteresis and dead zone effect and expand the effective input *** the second part,the Active Disturbance Rejection Controller(ADRC)is employed to mitigate the hysteresis effects of the compensated system and remove the steady-state error,which allows real-time compensation of the estimated perturbations as state feedback to achieve the required *** experimental laboratory station has been fabricated to evaluate the proposed *** test results show that the NHCC method can regulate the fan speed to the desired value(45 r/min at steady state)and broaden the effective input range to the full range under different engine ***,the proposed control method can reduce the non-linearity of the input and output curves(from 18%to 4%)and compensate for the asymmetric hysteresis(from 38%to 5%).
Dear Editor,This letter studies the bipartite consensus tracking problem for heterogeneous multi-agent systems with actuator faults and a leader's unknown time-varying control input. To handle such a problem, the ...
详细信息
Dear Editor,This letter studies the bipartite consensus tracking problem for heterogeneous multi-agent systems with actuator faults and a leader's unknown time-varying control input. To handle such a problem, the continuous fault-tolerant control protocol via observer design is developed. In addition, it is strictly proved that the multi-agent system driven by the designed controllers can still achieve bipartite consensus tracking after faults occur.
In the field of skeleton-based gesture recognition, occlusion remains a significant challenge, significantly degrading performance when key joints are occluded or disturbed. To tackle this issue, we propose DiffTrans,...
详细信息
With the depletion of high-quality iron ore resources,high-phosphorus oolitic hematite(HPOH)has attracted great attention due to its large reserve and relatively high iron ***,HPOH is very difficult to be used in iron...
详细信息
With the depletion of high-quality iron ore resources,high-phosphorus oolitic hematite(HPOH)has attracted great attention due to its large reserve and relatively high iron ***,HPOH is very difficult to be used in ironmaking process due to its special structure.A two-step method of gas-based direct reduction and magnetic separation was thus proposed to recover iron and reduce *** results showed that the powdery reduced iron produced contained 92.31%iron and 0.1%phosphorus,and the iron recovery was 92.65%under optimum reduction condition,which is suitable for following *** apatite will be reduced under long reduction time and a large reducing gas flow rate,resulting in more phosphorus found in the metallic *** the hydrogen–carbon ratio will inhibit the formation and growth of iron particles and prevent the breakage of oolitic *** adjustment of reduction temperature is recommended as it affects the oolitic structure and reduction.
This paper presents a fully distributed, low-complexity UAV formation controller design with fixed-time full-state error performance, which is able to address the difficulties in obtaining global information and suppr...
详细信息
In the rapidly advancing field of industrial automation, the reliability and maintenance of multirobot manufacturing systems are crucial. This paper proposes a collaborative optimization method for the reliability of ...
详细信息
This paper investigates resilient consensus control for teleoperation systems under denial-of-service (DoS) attacks. We design resilient controllers with auxiliary systems based on sampled positions of both master and...
详细信息
This article is devoted to one-class fault detection in linear discrete-time varying (LDTV) systems with uncertainties. Specifically, following the Hilbert Projection theorem, the residual generation problem is solved...
详细信息
Based on the personalized customization model in bearing steel enterprises, it is studied in this paper how historical data can be fully utilized to predict orders in the face of multi-variety and small-batch order mo...
详细信息
暂无评论