检索结果-内蒙古大学图书馆

作者： Degallier, Sarah Righetti, Ludovic Gay, Sebastien Ijspeert, Auke CNBI Laboratory School of Engineering EPFL Ecole Polytechnique Fédérale de Lausanne Lausanne 1015 Switzerland Biorobotics Laboratory School of Engineering EPFL Ecole Polytechnique Fédérale de Lausanne Lausanne 1015 Switzerland Computational Learning and Motor Control Lab Computer Science Neurosciences and Biomedical Engineering University of Southern California Los Angeles CA 90089 United States

Vertebrates are able to quickly adapt to new environments in a very robust, seemingly effortless way. To explain both this adaptivity and robustness, a very promising perspective in neurosciences is the modular approach to movement generation: Movements results from combinations of a finite set of stable motor primitives organized at the spinal level. In this article we apply this concept of modular generation of movements to the control of robots with a high number of degrees of freedom, an issue that is challenging notably because planning complex, multidimensional trajectories in time-varying environments is a laborious and costly process. We thus propose to decrease the complexity of the planning phase through the use of a combination of discrete and rhythmic motor primitives, leading to the decoupling of the planning phase (i.e. the choice of behavior) and the actual trajectory generation. Such implementation eases the control of, and the switch between, different behaviors by reducing the dimensionality of the high-level commands. Moreover, since the motor primitives are generated by dynamical systems, the trajectories can be smoothly modulated, either by high-level commands to change the current behavior or by sensory feedback information to adapt to environmental constraints. In order to show the generality of our approach, we apply the framework to interactive drumming and infant crawling in a humanoid robot. These experiments illustrate the simplicity of the control architecture in terms of planning, the integration of different types of feedback (vision and contact) and the capacity of autonomously switching between different behaviors (crawling and simple reaching). © 2011 Springer Science+Business Media, LLC.

关键词： Anthropomorphic robots

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning of full-body humanoid motor skills

Reinforcement learning of full-body humanoid motor skills

引用

2010 10th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2010

作者： Stulp, Freek Buchli, Jonas Theodorou, Evangelos Schaal, Stefan Computational Learning and Motor Control Lab University of Southern California Los Angeles CA 90089 United States

ISBN: (纸本)9781424486885

Applying reinforcement learning to humanoid robots is challenging because humanoids have a large number of degrees of freedom and state and action spaces are continuous. Thus, most reinforcement learning algorithms would become computationally infeasible and require a prohibitive amount of trials to explore such high-dimensional spaces. In this paper, we present a probabilistic reinforcement learning approach, which is derived from the framework of stochastic optimal control and path integrals. The algorithm, called Policy Improvement with Path Integrals (PI2), has a surprisingly simple form, has no open tuning parameters besides the exploration noise, is modelfree, and performs numerically robustly in high dimensional learning *** demonstrate how PI2 is able to learn full-body motor skills on a 34-DOF humanoid robot. To demonstrate the generality of our approach, we also apply PI2 in the context of variable impedance control, where both planned trajectories and gain schedules for each joint are optimized simultaneously. ©2010 IEEE.

关键词： Anthropomorphic robots

来源：评论

学校读者我要写书评

暂无评论

Compliant control for quadrupedal walking over rough terrain 13th

Compliant control for quadrupedal walking over rough terrain

引用

13th International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines, CLAWAR 2010

作者： Buchli, Jonas Kalakrishnan, Mrinal Mistry, Michael Pastor, Peter Schaal, Stefan Computational Learning and Motor Control Lab University of Southern California Los AngelesCA90089 United States Disney Research PittsburghPA15213 United States

ISBN: (纸本)9789814327978

An often used stability criterion in legged locomotion is the zero moment point (ZMP). The ZMP is a virtual point calculated based on the center of gravity (COG) position and acceleration and must be kept within the support polygon at all times for the robot to be stable. Therefore, maintaining stability relies critically on good tracking of the planned COG trajectory and traditionally high gain PID control is the most commonly used to ensure good tracking performance. However, PID control has a severe disadvantage in that it relies critically on exact knowledge of the terrain and exhibits suboptimal stiff behavior in case of external perturbations. Here we will show that the inverse dynamics controller allows us to lower the PID gains while at the same time maintaining sufficient tracking performance. The main contributions of this paper are (a) the formulation of the ZMP planning as constrained optimization problem, and the proof of its performance on difficult terrain (b) inverse dynamics based compliant control to ensure tracking of the ZMP and (c) showing that the compliant controller outperforms high-gain controllers in case of severely degraded terrain perception and thus increases the robustness in locomotion over difficult terrain. We present results on the quadruped robot LittleDog. © 2010 by World Scientific Publishing Co. Pte. Ltd.

关键词： Three term control systems

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning of full-body humanoid motor skills

Reinforcement learning of full-body humanoid motor skills

引用

IEEE-RAS International Conference on Humanoid Robots

作者： Freek Stulp Jonas Buchli Evangelos Theodorou Stefan Schaal Computational Learning and Motor Control Lab University of Southern California Los Angeles CA

Applying reinforcement learning to humanoid robots is challenging because humanoids have a large number of degrees of freedom and state and action spaces are continuous. Thus, most reinforcement learning algorithms would become computationally infeasible and require a prohibitive amount of trials to explore such high-dimensional spaces. In this paper, we present a probabilistic reinforcement learning approach, which is derived from the framework of stochastic optimal control and path integrals. The algorithm, called Policy Improvement with Path Integrals (PI 2 ), has a surprisingly simple form, has no open tuning parameters besides the exploration noise, is model-free, and performs numerically robustly in high dimensional learning problems. We demonstrate how PI 2 is able to learn full-body motor skills on a 34-DOF humanoid robot. To demonstrate the generality of our approach, we also apply PI 2 in the context of variable impedance control, where both planned trajectories and gain schedules for each joint are optimized simultaneously.

关键词： Trajectory Joints Optimal control learning Noise Humanoid robots

来源：评论

学校读者我要写书评

暂无评论

Stochastic differential dynamic programming

Stochastic differential dynamic programming

引用

作者： Theodorou, Evangelos Tassa, Yuval Todorov, Emo Departments of Computer Science and Neuroscience Computational Learning and Motor Control Lab. University of Southern California United States Interdisciplinary Center for Neural Computation Hebrew University Jerusalem Israel Department of Computer Science and Engineering University of Washington Seattle WA United States

ISBN: (纸本)9781424474264

Although there has been a significant amount of work in the area of stochastic optimal control theory towards the development of new algorithms, the problem of how to control a stochastic nonlinear system remains an open research topic. Recent iterative linear quadratic optimal control methods iLQG [1], [2] handle control and state multiplicative noise while they are derived based on first order approximation of dynamics. On the other hand, methods such as Differential Dynamic Programming expand the dynamics up to the second order but so far they can handle nonlinear systems with additive noise. In this work we present a generalization of the classic Differential Dynamic Programming algorithm. We assume the existence of state and control multiplicative process noise, and proceed to derive the second-order expansion of the cost-to-go. We find the correction terms that arise from the stochastic assumption. Despite having quartic and cubic terms in the initial expression, we show that these vanish, leaving us with the same quadratic structure as standard DDP. © 2010 AACC.

关键词： Additive noise

来源：评论

学校读者我要写书评

暂无评论

Stochastic Differential Dynamic Programming

Stochastic Differential Dynamic Programming

引用

American control Conference

作者： Evangelos Theodorou Yuval Tassa Emo Todorov Computational Learning and Motor Control Lab Departments of Computer Science and Neuroscience University of Southern California Interdisciplinary Center for Neural Computation Hebrew University Jerusalem Israel Department of Computer Science and Engineering and the Department of Applied Mathematics University of Washington Seattle WA

ISBN: (纸本)9781424474264

关键词： dynamic programming theory of optimal control multiplicative noise Quartic stochastic derivative Additive noise correct entry dynamics Work area Handles

来源：评论

学校读者我要写书评

暂无评论

Compact models of motor primitive variations for predictable reaching and obstacle avoidance

Compact models of motor primitive variations for predictable...

引用

9th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS09

作者： Stulp, Freek Oztop, Erhan Pastor, Peter Beetz, Michael Schaaz, Stefan Computational Learning and Motor Control Lab. University of Southern California Los Angeles CA United States Kyoto Japan Computational Neuroscience Laboratories Advanced Telecommunications Research Institute International Kyoto Japan Intelligent Autonomous Systems Group Technische Universitdt Munchen Munich Germany

ISBN: (纸本)9781424445882

In most activities of daily living, related tasks are encountered over and over again. This regularity allows humans and robots to reuse existing solutions for known recurring tasks. We expect that reusing a set of standard solutions to solve similar tasks will facilitate the design and on-line adaptation of the control systems of robots operating in human environments. In this paper, we derive a set of standard solutions for reaching behavior from human motion data. We also derive stereotypical reaching trajectories for variations of the task, in which obstacles are present. These stereotypical trajectories are then compactly represented with Dynamic Movement Primitives. On the humanoid robot Sarcos CB, this approach leads to reproducible, predictable, and human-like reaching motions. ©2009 IEEE.

关键词： Anthropomorphic robots

来源：评论

学校读者我要写书评

暂无评论

Editorial

引用

International Journal of Humanoid Robotics 2005年第4期2卷 389-390页

作者： CHENG, GORDON SCHAAL, STEFAN ATKESON, CHRISTOPHER G. JST-ICORP Computational Brain Project ATR Computational Neuroscience Laboratory 2-2-2 Keihanna Science City Soraku-gun Kyoto619-0288 Japan Computational Learning and Motor Control Lab University of Southern California Hedco Neurosciences Building HNB-103 3641 Watt Way Los AngelesCA90089-2520 United States Robotics Institute Carnegie Mellon University 5000 Forbes Avenue PittsburghPA15213 United States

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：