首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
  • 本地全文:下载
  • 作者:P. Petsagkourakis ; I.O. Sandoval ; E. Bradford
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2020
  • 卷号:53
  • 期号:2
  • 页码:11264-11270
  • DOI:10.1016/j.ifacol.2020.12.361
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractDynamic real-time optimization (DRTO) is a challenging task due to the fact that optimal operating conditions must be computed in real time. The main bottleneck in the industrial application of DRTO is the presence of uncertainty. Many stochastic systems present the following obstacles: 1) plant-model mismatch, 2) process disturbances, 3) risks in violation of process constraints. To accommodate these difficulties, we present a constrained reinforcement learning (RL) based approach. RL naturally handles the process uncertainty by computing an optimal feedback policy. However, no state constraints can be introduced intuitively. To address this problem, we present a chance-constrained RL methodology. We use chance constraints to guarantee the probabilistic satisfaction of process constraints, which is accomplished by introducing backoffs, such that the optimal policy and backoffs are computed simultaneously. Backoffs are adjusted using the empirical cumulative distribution function to guarantee the satisfaction of a joint chance constraint. The advantage and performance of this strategy are illustrated through a stochastic dynamic bioprocess optimization problem, to produce sustainable high-value bioproducts.
  • 关键词:KeywordsReinforcement learningUncertain dynamic systemsStochastic controlChemical process controlAdaptive controlPolicy gradient
国家哲学社会科学文献中心版权所有