首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Online Bahavior Aquisition of an Agent based on Coaching as Learning Assistance
  • 本地全文:下载
  • 作者:Masakazu HIROKAWA ; Kenji SUZUKI
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2010
  • 卷号:25
  • 期号:6
  • 页码:694-702
  • DOI:10.1527/tjsai.25.694
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:This paper describes a novel methodology, namely ``Coaching'', which allows humans to give a subjective evaluation to an agent in an iterative manner. This is an interactive learning method to improve the reinforcement learning by modifying a reward function dynamically according to given evaluations by a trainer and the learning situation of the agent. We demonstrate that the agent can learn different reward functions by given instructions such as ``good or bad'' by human's observation, and can also obtain a set of behavior based on the learnt reward functions through several experiments.
  • 关键词:HAI ; reinforcement learning ; coaching
国家哲学社会科学文献中心版权所有