首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards
  • 本地全文:下载
  • 作者:Guoyu Zuo ; Qishen Zhao ; Jiahao Lu
  • 期刊名称:International Journal of Advanced Robotic Systems
  • 印刷版ISSN:1729-8806
  • 电子版ISSN:1729-8814
  • 出版年度:2020
  • 卷号:17
  • 期号:1
  • 页码:1-13
  • DOI:10.1177/1729881419898342
  • 出版社:SAGE Publications
  • 摘要:The goal of reinforcement learning is to enable an agent to learn by using rewards. However, some robotic tasks naturally specify with sparse rewards, and manually shaping reward functions is a difficult project. In this article, we propose a general and model-free approach for reinforcement learning to learn robotic tasks with sparse rewards. First, a variant of Hindsight Experience Replay, Curious and Aggressive Hindsight Experience Replay, is proposed to improve the sample efficiency of reinforcement learning methods and avoid the need for complicated reward engineering. Second, based on Twin Delayed Deep Deterministic policy gradient algorithm, demonstrations are leveraged to overcome the exploration problem and speed up the policy training process. Finally, the action loss is added into the loss function in order to minimize the vibration of output action while maximizing the value of the action. The experiments on simulated robotic tasks are performed with different hyperparameters to verify the effectiveness of our method. Results show that our method can effectively solve the sparse reward problem and obtain a high learning speed.
  • 关键词:Robot learning; reinforcement learning; sparse reward; CAHER; demonstrations
  • 其他关键词:Robot learning ; reinforcement learning ; sparse reward ; CAHER ; demonstrations
国家哲学社会科学文献中心版权所有