首页    期刊浏览 2024年07月18日 星期四
登录注册

文章基本信息

  • 标题:On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability
  • 本地全文:下载
  • 作者:Vincent Francois-Lavet ; Guillaume Rabusseau ; Joelle Pineau
  • 期刊名称:Journal of Artificial Intelligence Research
  • 印刷版ISSN:1076-9757
  • 出版年度:2019
  • 卷号:65
  • 页码:1-30
  • DOI:10.1613/jair.1.11478
  • 出版社:American Association of Artificial
  • 摘要:This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability. Our theoretical analysis formally characterizes that while potentially increasing the asymptotic bias, a smaller state representation decreases the risk of overfitting. This analysis relies on expressing the quality of a state representation by bounding $L_1$ error terms of the associated belief states. Theoretical results are empirically illustrated when the state representation is a truncated history of observations, both on synthetic POMDPs and on a large-scale POMDP in the context of smartgrids, with real-world data. Finally, similarly to known results in the fully observable setting, we also briefly discuss and empirically illustrate how using function approximators and adapting the discount factor may enhance the tradeoff between asymptotic bias and overfitting in the partially observable context.
  • 关键词:reinforcement learning;machine learning;knowledge representation
  • 其他关键词:reinforcement learning;machine learning;knowledge representation
国家哲学社会科学文献中心版权所有