首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Data-Efficient Reinforcement Learning from Controller Guidance with Integrated Self-Supervision for Process Control
  • 本地全文:下载
  • 作者:Nicolas Bougie ; Takashi Onishi ; Yoshimasa Tsuruoka
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2022
  • 卷号:55
  • 期号:7
  • 页码:863-868
  • DOI:10.1016/j.ifacol.2022.07.553
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractModel-free reinforcement learning methods have achieved significant success in a variety of decision-making problems. In fact, they traditionally rely on large amounts of data generated by sample-efficient simulators. However, many process control industries involve complex and costly computations, which limits the applicability of model-free reinforcement learning. In addition, extrinsic rewards are naturally sparse in the real world, further increasing the amount of necessary interactions with the environment. This paper presents a sample-efficient model-free algorithm for process control, which massively accelerates the learning process even when rewards are extremely sparse. To achieve this, we leverage existing controllers to guide the agent's learning — controller guidance is used to drive exploration towards key regions of the state space. To further mitigate the above-mentioned challenges, we propose a strategy for self-supervision learning that lets us improve the agent's policy via its own successful experience. Notably, the method we develop is able to leverage guidance that does not include the actions and remains effective when the existing controllers are suboptimal. We present an empirical evaluation on a vinyl acetate monomer (VAM) chemical plant under disturbances. The proposed method exhibits better performance than baselines approaches and higher sample efficiency. Besides, empirical results show that our method outperforms the existing controllers for controlling the plant and canceling disturbances, mitigating the drop in the production load.
  • 关键词:KeywordsReinforcement learning controlProcess controlChemical plant controlCo-Learningself-learningArtificial intelligence
国家哲学社会科学文献中心版权所有