首页    期刊浏览 2024年11月26日 星期二
登录注册

文章基本信息

  • 标题:An Extension of the Rational Policy Making algorithm to Continuous State Spaces
  • 本地全文:下载
  • 作者:Kazuteru Miyazaki ; Hajime Kimura ; Shigenobu Kobayashi
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2007
  • 卷号:22
  • 期号:3
  • 页码:332-341
  • DOI:10.1527/tjsai.22.332
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:Reinforcement Learning is a kind of machine learning. We know Profit Sharing, the Rational Policy Making algorithm (RPM), the Penalty Avoiding Rational Policy Making algorithm and PS-r* to guarantee the rationality in a typical class of the Partially Observable Markov Decision Processes. However they cannot treat continuous state spaces. In this paper, we present a solution to adapt them in continuous state spaces. We give RPM a mechanism to treat continuous state spaces in the environment that has the same type of a reward. We show the effectiveness of the proposed method in numerical examples.
  • 关键词:reinforcement learning ; profit sharing ; continuous state spaces
国家哲学社会科学文献中心版权所有