文章基本信息

标题：An Extension of the Rational Policy Making algorithm to Continuous State Spaces
本地全文：下载
作者：Kazuteru Miyazaki ; Hajime Kimura ; Shigenobu Kobayashi 等
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2007
卷号：22
期号：3
页码：332-341
DOI：10.1527/tjsai.22.332
出版社：The Japanese Society for Artificial Intelligence
摘要：Reinforcement Learning is a kind of machine learning. We know Profit Sharing, the Rational Policy Making algorithm (RPM), the Penalty Avoiding Rational Policy Making algorithm and PS-r* to guarantee the rationality in a typical class of the Partially Observable Markov Decision Processes. However they cannot treat continuous state spaces. In this paper, we present a solution to adapt them in continuous state spaces. We give RPM a mechanism to treat continuous state spaces in the environment that has the same type of a reward. We show the effectiveness of the proposed method in numerical examples.
关键词：reinforcement learning ; profit sharing ; continuous state spaces