文章基本信息

标题：Time Manipulation Technique for Speeding up Reinforcement Learning in Simulations
本地全文：下载
作者：P. Kormushev ; K. Nomoto ; F. Dong 等
期刊名称：Cybernetics and Information Technologies
印刷版ISSN：1311-9702
电子版ISSN：1314-4081
出版年度：2008
卷号：8
期号：1
出版社：Bulgarian Academy of Science
摘要：A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.
关键词：Reinforcement learning; computer simulation; state space exploration;active learning.