文章基本信息

标题：Policy Iteration Algorithm for the Control of Oxygenation
本地全文：下载
作者：Anake Pomprapa ; Milod Mir Wais ; Marian Walter 等
期刊名称：IFAC PapersOnLine
印刷版ISSN：2405-8963
出版年度：2015
卷号：48
期号：20
页码：517-522
DOI：10.1016/j.ifacol.2015.10.193
语种：English
出版社：Elsevier
摘要：AbstractThe policy iteration algorithm (PIA) is a quasi non-identifier approach of nonlinear optimal control based on a reinforcement learning and iterative algorithm in order to solve the Hamilton-Jacobi-Bellman (HJB) equation. The synthesized state-feedback controller corresponding to the converged solution should be applicable for the control of cardiopulmonary system. In this article, the simulation results for the control of oxygenation were carried out using a simplified first-order model with time delay based on porcine dynamics. The distinctive results of oxygenation control can then be achieved based on the proposed control strategy. In addition, the practical example of water level for interacting three-tank system, which has the nonlinear dynamics similar to that of the oxygenation, was implemented in order to prove the concept of this control scheme.
关键词：Keywordspolicy iteration algorithmoptimal controlreinforcement learningcontrol of oxygenationbiomedical control systemsclosed-loop ventilation