版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Louisiana State University 3277 Patrick F. Taylor Hall Baton Rouge 70803 USA Louisiana State University 3240-A Patrick F. Taylor Hall Baton Rouge 70803 USA
出 版 物:《Procedia Manufacturing》
年 卷 期:2019年第39卷
页 面:406-413页
主 题:Reinforcement Learning Inspect Correct SARSA algorithm
摘 要:The ability to rapidly program robots for complex tasks is an important precursor to wider adoption of robotics in industry. Robot programming is often time consuming and brittle to unanticipated variations in processing. Automated robot task learning is a solution to this problem. Reinforcement Learning (RL) is a commonly used approach for a robot to autonomously learn simple tasks. In RL, rewards are used to guide the robot towards learning an optimal plan or control policy. RL, however, has proven to be of limited value for problems with large-state spaces and considerable environmental variability. In this paper, we investigate formulation of the RL approach for inspect/correct types of tasks, specifically a misplaced block in a simple grid-world environment (requiring searching the gird world to identify a missing block and returning the missing block back to the target). We use a hybrid method, combining the SARSA algorithm and a model of the environment. The model of the environment is used as a reference model to reduce the state space, avoiding unnecessary exploration of the environment. A main focus of this research is the impact of task variability on RL performance.