This letter presents a control framework that combines model-based optimal control and reinforcement learning (RL) to achieve versatile and robust legged locomotion. Our approach enhances the RL training process by in...
详细信息
Due to the increase of automation in the field of construction, more and more automation approaches for excavators have been developed. A special case of excavators are drilling excavators. Therefore, a precise pose e...
详细信息
Control policies trained using deep reinforcement learning often generate stiff, high-frequency motions in response to unexpected disturbances. To promote more natural and compliant balance recovery strategies, we pro...
详细信息
ISBN:
(数字)9798350384574
ISBN:
(纸本)9798350384581
Control policies trained using deep reinforcement learning often generate stiff, high-frequency motions in response to unexpected disturbances. To promote more natural and compliant balance recovery strategies, we propose a simple modification to the typical reinforcement learning training process. Our key insight is that stiff responses to perturbations are due to an agent’s incentive to maximize task rewards at all times, even as perturbations are being applied. As an alternative, we introduce an explicit recovery stage where tracking rewards are given irrespective of the motions generated by the control policy. This allows agents a chance to gradually recover from disturbances before attempting to carry out their main tasks. Through an in-depth analysis, we highlight both the compliant nature of the resulting control policies, as well as the benefits that compliance brings to legged locomotion. In our simulation and hardware experiments, the compliant policy achieves more robust, energy-efficient, and safe interactions with the environment.
Thailand has a wide variety of tourist attractions, making it difficult for tourist to choose where to go on vacation. The tourist destination recommendation system is a challenge for creating a system to help recomme...
详细信息
Conventional approaches to Federated Deep Reinforcement Learning (FDRL) often mandate the participation of all the associated devices and perform indiscriminate aggregation of the models. This can, at times, culminate...
详细信息
To automate surgical (sub-)tasks in robotic surgery, the knowledge of the exact pose of the instrument is mandatory. The application of Optical Coherence Tomography (OCT) to the problem of pose measurement appears pro...
详细信息
This paper investigates the influence of a static robot head on deviations of human hand movements from task direction (motor interference) during simultaneous human and robot arm movements using a collaborative robot...
详细信息
We propose an approach to do learning in Gaussian factor graphs. We treat all relevant quantities (inputs, outputs, parameters, activations) as random variables in a graphical model, and view training and prediction a...
详细信息
We propose an approach to do learning in Gaussian factor graphs. We treat all relevant quantities (inputs, outputs, parameters, activations) as random variables in a graphical model, and view training and prediction as inference problems with different observed nodes. Our experiments show that these problems can be efficiently solved with belief propagation (BP), whose updates are inherently local, presenting exciting opportunities for distributed and asynchronous training. Our approach can be scaled to deep networks and provides a natural means to do continual learning: use the BP-estimated posterior of the current task as a prior for the next. On a video denoising task we demonstrate the benefit of learnable parameters over a classical factor graph approach and we show encouraging performance of deep factor graphs for continual image classification. Copyright 2024 by the author(s)
When building a predictive model, it is often difficult to ensure that application-specific requirements are encoded by the model that will eventually be deployed. Consider researchers working on hate speech detection...
详细信息
This paper presents a novel method to quantify Trust in HRI. It proposes an HRI framework for estimating the Robot Trust towards the Human in the context of a narrow and specified task. The framework produces a real-t...
详细信息
暂无评论