首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Reinforcement learning with guarantees: a review
  • 本地全文:下载
  • 作者:Pavel Osinenko ; Dmitrii Dobriborsci ; Wolfgang Aumer
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2022
  • 卷号:55
  • 期号:15
  • 页码:123-128
  • DOI:10.1016/j.ifacol.2022.07.619
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractReinforcement learning is concerned with a generic concept of an agent acting in an environment. From the control theory standpoint, reinforcement learning may be considered as an adaptive optimal control scheme. Despite accumulating evidence of effectiveness of reinforcement learning in various applications, which range from video games to robotics, this control scheme in its bare-bones version provides no guarantees on the performance of the agent-environment closed loop. Measures have to be taken to provide the said guarantees. This survey gives a brief picture of the current progress in this direction. Three major groups of approaches are overviewed: supervisor-based, Lyapunov reinforcement learning and fusion with model-predictive control. The central message of this survey is that a synergy with classical model-based control seems the most promising direction of research in reinforcement learning, as long as it is to become an industry standard.
  • 关键词:KeywordsReinforcement learningstabilitysafe control
国家哲学社会科学文献中心版权所有