首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Reinforced Optimal Estimator ⁎
  • 本地全文:下载
  • 作者:Wenhan Cao ; Jianyu Chen ; Jingliang Duan
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2021
  • 卷号:54
  • 期号:20
  • 页码:366-373
  • DOI:10.1016/j.ifacol.2021.11.201
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractEstimating the state of a stochastic system is a long-lasting issue in the areas of engineering and science. Existing methods either use approximations or yield a high computation burden. In this paper, we propose reinforced optimal estimator (ROE), which is an offline estimator for general nonlinear and non-Gaussian stochastic models. This method solves optimal estimation problems offline, and the learned estimator can be applied online efficiently. Firstly, we demonstrate that minimum variance estimation requires us to solve the estimation problem online, which causes low computation efficiency To overcome this drawback, we propose an infinite horizon optimal estimation problem, called reinforcement estimation problem, to obtain the offline estimator. The time-invariant filter of linear systems is shown as an example to analyze the equivalence between reinforcement estimation problem and minimum variance estimation problem. We show that such equivalence can only be found for linear systems, and the proposed problem formulation actually enables us to find the time-invariant estimator for general nonlinear systems. Then, we propose the ROE algorithm, inspired by reinforcement learning, and develop an actor-critic architecture to find a nearly optimal estimator of the reinforcement estimation problem. The estimator is approximated by recurrent neural networks, which has high online computation efficiency. The convergence is proved using contraction mapping and extended policy improvement theorem. Experiment results on complex nonlinear system estimation problems show that our method achieves higher estimation accuracy and computation efficiency than the unscented Kalman filter and particle filter.
  • 关键词:KeywordsState estimationreinforcement learningminimum variance estimationtime-invariant filterrecurrent neural network
国家哲学社会科学文献中心版权所有