首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Multi-step Greedy Reinforcement Learning Based on Model Predictive Control
  • 本地全文:下载
  • 作者:Yucheng Yang ; Sergio Lucia
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2021
  • 卷号:54
  • 期号:3
  • 页码:699-705
  • DOI:10.1016/j.ifacol.2021.08.323
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractReinforcement learning aims to compute optimal control policies with the help of data from closed-loop trajectories. Traditional model-free approaches need huge number of data points to achieve an acceptable performance, rendering them not applicable in most real situations, even if the data can be obtained from a detailed simulator. Model-based reinforcement learning approaches try to leverage model knowledge to drastically reduce the amount of data needed or to enforce important constraints to the closed-loop operation, which is another important drawback of model-free approaches. This paper proposes a novel model-based reinforcement learning approach. The main novelty is the fact that we exploit all the information of a model predictive control (MPC) computing step, and not only the first input that is actually applied to the plant, to efficiently learn a good approximation of the state value function. This approximation can be included into a model predictive control formulation as a terminal cost with a short prediction horizon, achieving a similar performance to an MPC with a very long prediction horizon. Simulation results of a discretized batch bioreactor illustrate the potential of the proposed methodology.
  • 关键词:KeywordsReinforcement learningModel predictive controlNonlinear systemBatch reactor
国家哲学社会科学文献中心版权所有