首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Time Manipulation Technique for Speeding up Reinforcement Learning in Simulations
  • 本地全文:下载
  • 作者:P. Kormushev ; K. Nomoto ; F. Dong
  • 期刊名称:Cybernetics and Information Technologies
  • 印刷版ISSN:1311-9702
  • 电子版ISSN:1314-4081
  • 出版年度:2008
  • 卷号:8
  • 期号:1
  • 出版社:Bulgarian Academy of Science
  • 摘要:A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.
  • 关键词:Reinforcement learning; computer simulation; state space exploration;active learning.
国家哲学社会科学文献中心版权所有