首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:A kernel based true online Sarsa(λ) for continuous space control problems
  • 本地全文:下载
  • 作者:Zhu, Fei ; Zhu, Haijun ; Fu, Yuchen
  • 期刊名称:Computer Science and Information Systems
  • 印刷版ISSN:1820-0214
  • 电子版ISSN:2406-1018
  • 出版年度:2017
  • 卷号:14
  • 期号:3
  • 页码:789-804
  • 出版社:ComSIS Consortium
  • 摘要:Reinforcement learning is an efficient learning method for the control problem by interacting with the environment to get an optimal policy. However, it al so faces challenges such as low convergence accuracy and slow convergence. More over, conventional reinforcement learning algorithms could hardly solve continuous control problems. The kernel-based method can accelerate convergence speed and improve convergence accuracy; and the policy gradient method is a good way to deal with continuous space problems. We proposed a Sarsa(λ) version of true online time difference algorithm, named True Online Sarsa(λ)(TOSarsa(λ)), on the basis of the clustering-based sample specification method and selective kernel-based value function. The TOSarsa(λ) algorithm has a consistent result with both the forward view and the backward view which ensures to get an optimal policy in less time. Afterwards we also combined TOSarsa(λ) with heuristic dynamic programming. The experiments showed our proposed algorithm worked well in dealing with continuous control problem.
  • 关键词:reinforcement learning; kernel method; true online; policy gradient; Sarsa(λ)
国家哲学社会科学文献中心版权所有