Meta-Imitation Learning is a promising technique for the robot to learn a new task from observing one or a few human demonstrations. However, it usually requires a significant number of demonstrations both from humans...
详细信息
Abnormal or drastic changes in the natural environment may lead to unexpected events,such as tsunamis and earthquakes,which are becoming a major threat to national ***,no effective assessment approach can deduce a sit...
详细信息
Abnormal or drastic changes in the natural environment may lead to unexpected events,such as tsunamis and earthquakes,which are becoming a major threat to national ***,no effective assessment approach can deduce a situation and determine the optimal response strategy when a natural disaster *** this study,we propose a social evolution modeling approach and construct a deduction model for self-playing,self-learning,and self-upgrading on the basis of the idea of parallel data and reinforcement *** proposed approach can evaluate the impact of an event,deduce the situation,and provide optimal strategies for *** the breakage of a submarine cable caused by earthquake as an example,we find that the proposed modeling approach can obtain a higher reward compared with other existing methods.
The development of the scientific publishing system has remarkably enhanced global accessibility to research findings and substantially increased the visibility and dissemination of academic publications. However, sig...
详细信息
ISBN:
(数字)9798350349252
ISBN:
(纸本)9798350349269
The development of the scientific publishing system has remarkably enhanced global accessibility to research findings and substantially increased the visibility and dissemination of academic publications. However, significant challenges still exist in effectively safeguarding the intellectual property rights of contributors, such as the unauthorized usage of materials, the complexity of enforcing intellectual property rights across various legal jurisdictions, and high instances of plagiarism and content misuse. Additionally, financial barriers related to open access may restrict the participation of economically disadvantaged researchers, potentially biasing scientific records towards more affluent research initiatives. To address these issues, a novel decentralized framework is formulated to ensure truly open access. This framework leverages blockchain for immutable record-keeping and clear attribution of authorship, to prevent unauthorized usage and plagiarism. Besides, it also utilizes a copyright-sharing model based on decentralized autonomous organizations and operations (DAOs), where smart contracts automatically enforce copyright and access policies to ensure fair compensation for authors and researchers. Furthermore, the copyright sharing model based on non-fungible tokens (NFT) and gradual ownership optimization (GOO) mechanism is proposed to ensure fair and accurate recognition and compensation for scholarly contributions.
Parallel Manufacturing is a new manufacturing paradigm in industry, deeply integrating informalization, automation, and artificial intelligence. In this paper we propose a new mechanical design paradigm in Parallel Ma...
详细信息
This paper proposes a speed control method for a biomimetic robotic fish based on linear active disturbance rejection control. Inspired by a bluefin tuna in nature, a robotic fish with a two-joint propulsive mechanism...
详细信息
In this study, a novel nonlinear parallel control method is proposed for cascaded nonlinear systems using the backstepping technique. Unlike the existing state feedback control methods, the control input is taken into...
详细信息
This study proposes a new event-triggered optimal control (ETOC) method for discrete-time (DT) constrained nonlinear systems. First, a new triggering condition is proposed. We show the asymptotic stability of the clos...
详细信息
Learning-based autonomous vehicle trajectory planning methods have shown excellent performance in a variety of complex traffic scenarios. However, the existing imitation learning (IL) and reinforcement learning (RL) a...
Learning-based autonomous vehicle trajectory planning methods have shown excellent performance in a variety of complex traffic scenarios. However, the existing imitation learning (IL) and reinforcement learning (RL) algorithms still have their limitations, such as poor safety and generalizability for IL, and low data efficiency for RL. To leverage their respective advantages and mitigate the limitations, this paper proposes a novel hybrid RL algorithm for autonomous vehicle planning, where IL is embedded in it to guide its exploration with expert knowledge. Different from existing approaches, we use multi-step trajectory prediction instead of behavior cloning as the IL method integrated with online RL. Through such design, we make a further step in the research about how expert demonstration can be helpful to RL. Moreover, we conduct parallel training and testing of the algorithm based on real-world driving data. Experimental results demonstrate that our proposed approach outperforms standalone IL and RL methods, and performs better than RL methods enhanced by behavior cloning.
An important question in data-driven control is how to obtain an informative dataset. In this work, we consider the problem of effective data acquisition of an unknown linear system with bounded disturbance for both o...
详细信息
Reinforcement learning(RL) algorithms have been demonstrated to solve a variety of continuous control tasks. However,the training efficiency and performance of such methods limit further applications. In this paper, w...
详细信息
Reinforcement learning(RL) algorithms have been demonstrated to solve a variety of continuous control tasks. However,the training efficiency and performance of such methods limit further applications. In this paper, we propose an off-policy heterogeneous actor-critic(HAC) algorithm, which contains soft Q-function and ordinary Q-function. The soft Q-function encourages the exploration of a Gaussian policy, and the ordinary Q-function optimizes the mean of the Gaussian policy to improve the training efficiency. Experience replay memory is another vital component of off-policy RL methods. We propose a new sampling technique that emphasizes recently experienced transitions to boost the policy training. Besides, we integrate HAC with hindsight experience replay(HER) to deal with sparse reward tasks, which are common in the robotic manipulation domain. Finally, we evaluate our methods on a series of continuous control benchmark tasks and robotic manipulation tasks. The experimental results show that our method outperforms prior state-of-the-art methods in terms of training efficiency and performance, which validates the effectiveness of our method.
暂无评论