This paper identifies and addresses the problems with naively combining (reinforcement) learning-based controllers and state estimators for robotic in-hand manipulation. Specifically, we tackle the challenging task of...
This paper identifies and addresses the problems with naively combining (reinforcement) learning-based controllers and state estimators for robotic in-hand manipulation. Specifically, we tackle the challenging task of purely tactile, goal-conditioned, dextrous in-hand reorientation with the hand pointing downwards. Due to the limited sensing available, many control strategies that are feasible in simulation when having full knowledge of the object's state do not allow for accurate state estimation. Hence, separately training the controller and the estimator and combining the two at test time leads to poor performance. We solve this problem by coupling the control policy to the state estimator already during training in simulation. This approach leads to more robust state estimation and overall higher performance on the task while maintaining an interpretability advantage over end-to-end policy learning. With our GPU-accelerated implementation, learning from scratch takes a median training time of only 6.5 hours on a single, low-cost GPU. In simulation experiments with the DLR-Hand II and for four significantly different object shapes, we provide an in-depth analysis of the performance of our approach. We demonstrate the successful sim2real transfer by rotating the four objects to all 24 orientations in the $\pi/2$ discretization of SO(3), which has never been achieved for such a diverse set of shapes. Finally, our method is able to reorient a cube consecutively to in median nine goals, which was beyond the reach of previous methods in this challenging setting. (Web: https://***/dlr-tactile-manipulation)
In this article, a new predictive controller based on fast model predictive control has been proposed for permanent magnet synchronous motor(PMSM) drive system to solve the quadratic programming problem online in a ve...
详细信息
This paper focuses on developing a virtually controlled robot by integrating Virtual Reality (VR) and Robot Operating System (ROS). Robots can be used in certain environments where humans cannot be physically present ...
详细信息
One of the most common injuries sustained by humans is injury to the ankle joint. After an injury, there is a long process of treatment and rehabilitation, which can be complicated by prolonged immobility of the joint...
One of the most common injuries sustained by humans is injury to the ankle joint. After an injury, there is a long process of treatment and rehabilitation, which can be complicated by prolonged immobility of the joint. To reduce the rate of complications, medical devices for mechanotherapy are usually used, which can also speed up the patient's recovery. Each patient requires a personalized adjustment of the device due to the physiological characteristics of the foot. The aim of the study is to increase the effectiveness of ankle joint rehabilitation by personalized adjustment of the device. It is necessary to create a methodology for adjusting the device based on controlling the interaction force between the foot and the platform to ensure uniform contact at the time of fixation of the patient's leg. The results of the research can be used to create a mechanotherapy device that provides a given movement of the foot under unpredictable changes in physiological parameters with the possibility of personalized adjustment to increase the effectiveness of rehabilitation measures.
The Robot Operating System (ROS) has established itself as a useful set of software libraries and tools that can help to build robot applications. It provides services such as hardware abstraction, low-level device ma...
详细信息
The paper consideres complementary control for parrying external and internal perturbations of the system. The directions of parrying perturbations in automatic control systems (ACS) are marked. The basics of compleme...
详细信息
Vision-based control of mobile robots often involves complex calculations to derive a control law. The reinforcement learning algorithm (Q-learning) offers a machine learning method to extrapolate a control law from a...
Vision-based control of mobile robots often involves complex calculations to derive a control law. The reinforcement learning algorithm (Q-learning) offers a machine learning method to extrapolate a control law from an environment given discretized actions, without the need of complex calculations. In this paper, a vision-based controller is created using Q-Learning to enable tracking in a leader-follower configuration of two nonholonomic autonomous mobile robots. The follower robot gathers its desired trajectory values by using a deep learning SSD model to identify a distinguishing visual feature on the leader robot and uses a lidar to determine the distance between two robots. These parameters are utilized to select an optimal action of the follower robot through reinforcement learning. The emulated results in a ROS Gazebo environment show this method to be effective in enabling a wheeled mobile robot to follow another, while simultaneously avoiding obstacles.
Future crewed missions beyond low earth orbit will greatly rely on the support of robotic assistance platforms to perform inspection and manipulation of critical assets. This includes crew habitats, landing sites or a...
详细信息
Future crewed missions beyond low earth orbit will greatly rely on the support of robotic assistance platforms to perform inspection and manipulation of critical assets. This includes crew habitats, landing sites or assets for life support and operation. Maintenance and manipulation of a crewed site in extra-terrestrial environments is a complex task and the system will have to face different challenges during operation. While most may be solved autonomously, in certain occasions human intervention will be required. The telerobotic demonstration mission, Surface Avatar, led by the German Aerospace Center (DLR), with partner European Space Agency (ESA), investigates different approaches offering astronauts on board the International Space Station (ISS) control of ground robots in representative scenarios, e.g. a Martian landing and exploration site. In this work we present a feasibility study on how to integrate auditory information into the mentioned application. We will discuss methods for obtaining audio information and localizing audio sources in the environment, as well as fusing auditory and visual information to perform state estimation based on the gathered data. We demonstrate our work in different experiments to show the effectiveness of utilizing audio information, the results of spectral analysis of our mission assets, and how this information could help future astronauts to argue about the current mission situation.
Traditionally, manufacturing and assembly of space assets is performed on ground before sending them into orbit. However, this monolithic approach involves high launch costs due to increasing asset sizes, e.g., large ...
详细信息
Traditionally, manufacturing and assembly of space assets is performed on ground before sending them into orbit. However, this monolithic approach involves high launch costs due to increasing asset sizes, e.g., large telescopes for space observation. Alternatively, in-orbit assembly of space structures after launching the raw materials to orbit opens wider possi-bilities at a reduced cost. Mobile robotics, such as walking manipulators or multi-arm robots, are a critical component for this approach due to their mobility in orbit. However, unlike ter-restrial assembly tasks, the continuous motion of the robot and materials, coupled with the change of inertial properties of the structure, results in a rotational deviation of the platform due to conservation of angular momentum in orbit. This might violate the tolerance limits of the platform antenna's cone angle for communication with the ground stations. Although exploiting the attitude control system of the platform is a straightforward solution, it might lead to issues related to the associated actu-ators like reaction wheels saturation, high-frequency vibration, or high fuel consumption. To deal with this problem, in this paper we formulate the attitude disturbance problem as a minimization of the effects created by the gait of the walking manipulator. Investigating the dynamic coupling between the robot system and the space structure gives a deeper understanding of the spacecraft's be-havior depending on the robot gaits. The paper proposes a controller that optimizes the forces that the robotic arm applies to the structure, hence minimizing the base rotation. As an application, we use a space structure composed of identical elements, namely the mirrors of a segmented telescope, endowed with standard interfaces to allow the robot locomotion. We show the effects of optimizing these interaction forces in various sce-narios and positions on the structure through multiple dynamic simulations.
Dextrous in-hand manipulation with a multi-fingered robotic hand is a challenging task, esp. when performed with the hand oriented upside down, demanding permanent force-closure, and when no external sensors are used....
Dextrous in-hand manipulation with a multi-fingered robotic hand is a challenging task, esp. when performed with the hand oriented upside down, demanding permanent force-closure, and when no external sensors are used. For the task of reorienting an object to a given goal orientation (vs. infinitely spinning it around an axis), the lack of external sensors is an additional fundamental challenge as the state of the object has to be estimated all the time, e.g., to detect when the goal is reached. In this paper, we show that the task of reorienting a cube to any of the 24 possible goal orientations in a π/2-raster using the torque-controlled DLR-Hand II is possible. The task is learned in simulation using a modular deep reinforcement learning architecture: the actual policy has only a small observation time window of 0.5 s but gets the cube state as an explicit input which is estimated via a deep differentiable particle filter trained on data generated by running the policy. In simulation, we reach a success rate of 92% while applying significant domain randomization. Via zero-shot Sim2Real-transfer on the real robotic system, all 24 goal orientations can be reached with a high success rate. (Web: ***/dlr-tactile-manipulation)
暂无评论