首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Finite-Sample Analysis For Decentralized Cooperative Multi-Agent Reinforcement Learning From Batch Data
  • 本地全文:下载
  • 作者:Kaiqing Zhang ; Zhuoran Yang ; Han Liu
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2020
  • 卷号:53
  • 期号:2
  • 页码:1049-1056
  • DOI:10.1016/j.ifacol.2020.12.1290
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractIn contrast to its great empirical success, theoretical understanding of multi-agent reinforcement learning (MARL) remains largely underdeveloped. As an initial attempt, we provide a finite-sample analysis for decentralized cooperative MARL with networked agents. In particular, we consider a team of cooperative agents connected by a time-varying communication network, with no central controller coordinating them. The goal for each agent is to maximize the long-term return associated with the team-average reward, by communicating only with its neighbors over the network. A batch MARL algorithm is developed for this setting, which can be implemented in a decentralized fashion. We then quantify the estimation errors of the action-value functions obtained from our algorithm, establishing their dependence on the function class, the number of samples in each iteration, and the number of iterations. This work appears to be the first finite-sample analysis for decentralized cooperative MARL from batch data.
  • 关键词:KeywordsReinforcement LearningFinite-Sample AnalysisNetworked SystemsMulti-Agent SystemsDecentralized Optimization
国家哲学社会科学文献中心版权所有