Event-triggered control has attracted considerable attention for its effectiveness in resource-restricted applications. To make event-triggered control as an end-to-end solution, a key issue is how to effectively lear...
详细信息
The impact of new dual-credit policy on electric vehicle (EV) diffusion is examined through complex network evolutionary game theory, we find that merely tightening new energy vehicle (NEV) credit rule and corporate a...
详细信息
Tremendous amount of data are being generated and saved in many complex engineering and social systems every *** is significant and feasible to utilize the big data to make better decisions by machine learning techniq...
详细信息
Tremendous amount of data are being generated and saved in many complex engineering and social systems every *** is significant and feasible to utilize the big data to make better decisions by machine learning techniques. In this paper, we focus on batch reinforcement learning(RL) algorithms for discounted Markov decision processes(MDPs) with large discrete or continuous state spaces, aiming to learn the best possible policy given a fixed amount of training data. The batch RL algorithms with handcrafted feature representations work well for low-dimensional MDPs. However, for many real-world RL tasks which often involve high-dimensional state spaces, it is difficult and even infeasible to use feature engineering methods to design features for value function approximation. To cope with high-dimensional RL problems, the desire to obtain data-driven features has led to a lot of works in incorporating feature selection and feature learning into traditional batch RL algorithms. In this paper, we provide a comprehensive survey on automatic feature selection and unsupervised feature learning for high-dimensional batch RL. Moreover, we present recent theoretical developments on applying statistical learning to establish finite-sample error bounds for batch RL algorithms based on weighted Lpnorms. Finally, we derive some future directions in the research of RL algorithms, theories and applications.
This paper mainly focuses on designing a sliding mode boundary controller for a single flexible-link manipulator based on adaptive radial basis function (RBF) neural network. The flexible manipulator in this paper i...
详细信息
This paper mainly focuses on designing a sliding mode boundary controller for a single flexible-link manipulator based on adaptive radial basis function (RBF) neural network. The flexible manipulator in this paper is considered to be an Euler-Bernoulli beam. We first obtain a partial differential equation (PDE) model of single-link flexible manipulator by using Hamiltons approach. To improve the control robustness, the system uncertainties including modeling uncertainties and external disturbances are compensated by an adaptive neural approximator. Then, a sliding mode control method is designed to drive the joint to a desired position and rapidly suppress vibration on the beam. The stability of the closed-loop system is validated by using Lyapunov's method based on infinite dimensional model, avoiding problems such as control spillovers caused by traditional finite dimensional truncated models. This novel controller only requires measuring the boundary information, which facilitates implementation in engineering practice. Favorable performance of the closed-loop system is demonstrated by numerical simulations.
Soft pneumatic actuators are highly flexible and can adapt to complex environments, attracting significant attention for their ability to operate in settings where rigid actuators cannot. Most existing soft pneumatic ...
详细信息
Group behavior forecasting is an emergent re- search and application field in social computing. Most of the existing group behavior forecasting methods have heavily re- lied on structured data which is usually hard to...
详细信息
Group behavior forecasting is an emergent re- search and application field in social computing. Most of the existing group behavior forecasting methods have heavily re- lied on structured data which is usually hard to obtain. To ease the heavy reliance on structured data, in this paper, we pro- pose a computational approach based on the recognition of multiple plans/intentions underlying group behavior. We fur- ther conduct human experiment to empirically evaluate the effectiveness of our proposed approach.
DO we need a fundamental change in our professional culture and knowledge foundation for control and automation?If so,what are necessary and critical steps we must take to ensure such a change would take place effecti...
详细信息
DO we need a fundamental change in our professional culture and knowledge foundation for control and automation?If so,what are necessary and critical steps we must take to ensure such a change would take place effectively and efficiently,or more general,smoothly and sustainably?
Existing wearable data gloves for gesture recognition often face challenges in achieving both high precision and real-time performance. To address these limitations, we propose a data glove design incorporating fiber ...
详细信息
In this paper, distributed containment control problems of general linear multi-agent systems are investigated. The objective is to make the followers in a multi-agent network converge to the convex hull spanned by so...
详细信息
In this paper, distributed containment control problems of general linear multi-agent systems are investigated. The objective is to make the followers in a multi-agent network converge to the convex hull spanned by some leaders whose control inputs are nonzero and not available to any *** mode surfaces are defined for the cases of reduced order and non-reduced order, respectively. For each case, fast sliding mode controllers are designed. It is shown that all the error trajectories exponentially reach the sliding mode surfaces in a finite time if for each follower, there exists at least one of the leaders who has a directed path to the follower, and the leaderscontrol inputs are bounded. The control Lyapunov function for exponential finite time stability, motivated by the fast terminal sliding mode control, is used to prove reachability of the sliding mode surfaces. Simulation examples are given to illustrate the theoretical results.
暂无评论