In this study, a novel nonlinear parallel control method is proposed for cascaded nonlinear systems using the backstepping technique. Unlike the existing state feedback control methods, the control input is taken into...
详细信息
This study proposes a new event-triggered optimal control (ETOC) method for discrete-time (DT) constrained nonlinear systems. First, a new triggering condition is proposed. We show the asymptotic stability of the clos...
详细信息
Reinforcement learning(RL) algorithms have been demonstrated to solve a variety of continuous control tasks. However,the training efficiency and performance of such methods limit further applications. In this paper, w...
详细信息
Reinforcement learning(RL) algorithms have been demonstrated to solve a variety of continuous control tasks. However,the training efficiency and performance of such methods limit further applications. In this paper, we propose an off-policy heterogeneous actor-critic(HAC) algorithm, which contains soft Q-function and ordinary Q-function. The soft Q-function encourages the exploration of a Gaussian policy, and the ordinary Q-function optimizes the mean of the Gaussian policy to improve the training efficiency. Experience replay memory is another vital component of off-policy RL methods. We propose a new sampling technique that emphasizes recently experienced transitions to boost the policy training. Besides, we integrate HAC with hindsight experience replay(HER) to deal with sparse reward tasks, which are common in the robotic manipulation domain. Finally, we evaluate our methods on a series of continuous control benchmark tasks and robotic manipulation tasks. The experimental results show that our method outperforms prior state-of-the-art methods in terms of training efficiency and performance, which validates the effectiveness of our method.
In response to concerns over the centralization tendency in the decentralized autonomous organizations (DAOs), TRUE autonomous organizations and operations (TAOs or TRUE DAOs) have been proposed recently. TAOs aim at ...
In response to concerns over the centralization tendency in the decentralized autonomous organizations (DAOs), TRUE autonomous organizations and operations (TAOs or TRUE DAOs) have been proposed recently. TAOs aim at spreading equitable value distribution and democratized decision-making, distinguishing them from their DAOs counterparts. This study focuses on the treasury within TAOs, which acts as a central fund pool and a crucial element in the decentralized economy (DeEco) system. First, against a backdrop of potential black swan events and other long-tail unforeseen challenges, a reference model for the intelligent treasury management of TAOs is proposed. Then, an evaluation system, namely VALID, is presented with metrics including verifiability, anti-volatility, legitimacy, inclusiveness, and decentralization. Furthermore, a novel parallel treasury management mechanism is proposed to demonstrate a virtual-real interactive closed-loop management and control paradigm of the treasury, thereby fostering the formulation and development of DeEco. This research provides a comprehensive perspective on intelligent treasury management of TAOs and their role in sustainable advancement of DeEco.
The movement of pedestrians involves temporal continuity,spatial interactivity,and random *** a result,pedestrian trajectory prediction is rather *** existing trajectory prediction methods tend to focus on just one as...
详细信息
The movement of pedestrians involves temporal continuity,spatial interactivity,and random *** a result,pedestrian trajectory prediction is rather *** existing trajectory prediction methods tend to focus on just one aspect of these challenges,ignoring the temporal information of the trajectory and making too many *** this paper,we propose a recurrent attention and interaction(RAI)model to predict pedestrian *** RAI model consists of a temporal attention module,spatial pooling module,and randomness modeling *** temporal attention module is proposed to assign different weights to the input sequence of a target,and reduce the speed deviation of different *** spatial pooling module is proposed to model not only the social information of neighbors in historical frames,but also the intention of neighbors in the current *** randomness modeling module is proposed to model the uncertainty and diversity of trajectories by introducing random *** conduct extensive experiments on several public *** results demonstrate that our method outperforms many that are state-ofthe-art.
In this paper,the L-gain based filtering problem for nonlinear positive semi-Markov jump systems is investigated by proposing a novel asynchronous design *** precisely,the mode-dependent filters are designed in terms ...
详细信息
In this paper,the L-gain based filtering problem for nonlinear positive semi-Markov jump systems is investigated by proposing a novel asynchronous design *** precisely,the mode-dependent filters are designed in terms of practical observed modes instead of true system modes,such that less conservatism can be *** addition,the effect of time-varying delays is taken into account for more robustness and *** selecting suitable stochastic Lyapunov-Krasovskii functions and applying the linear programming method,sufficient conditions are established to fulfill the desired L-gain ***,the illustrative simulation is performed to verify the effectiveness of our developed control scheme.
This paper provides a comprehensive survey of robotic autonomous grasping techniques. We summarize three key tasks: grasp detection, affordance detection, and model migration. Grasp detection determines the graspable ...
详细信息
In this paper, optimal control of nonlinear systems for non-zero-sum games is solved with a data-based and recursive least square. The new adjustment law is similar to experience replay algorithm which refer history d...
详细信息
This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear *** existing optimal state feedback control,the control input of the optimal parallel control is introduced int...
详细信息
This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear *** existing optimal state feedback control,the control input of the optimal parallel control is introduced into the feedback ***,due to the introduction of control input into the feedback system,the optimal state feedback control methods can not be applied *** address this problem,an augmented system and an augmented performance index function are proposed ***,the general nonlinear system is transformed into an affine nonlinear *** difference between the optimal parallel control and the optimal state feedback control is analyzed *** is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state feedback control with the traditional performance index ***,an adaptive dynamic programming(ADP)technique is utilized to implement the optimal parallel tracking control using a critic neural network(NN)to approximate the value function *** stability analysis of the closed-loop system is performed using the Lyapunov theory,and the tracking error and NN weights errors are uniformly ultimately bounded(UUB).Also,the optimal parallel controller guarantees the continuity of the control input under the circumstance that there are finite jump discontinuities in the reference ***,the effectiveness of the developed optimal parallel control method is verified in two cases.
The problem of traffic signal control is essential but remains unsolved. Some researchers use online reinforcement learning, including the off-policy one, to derive an optimal control policy through interaction betwee...
详细信息
暂无评论