首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:Transient Response Analysis of Metropolis Learning in Games
  • 本地全文:下载
  • 作者:Hassan Jaleel ; Jeff S. Shamma
  • 期刊名称:IFAC PapersOnLine
  • 印刷版ISSN:2405-8963
  • 出版年度:2017
  • 卷号:50
  • 期号:1
  • 页码:9661-9667
  • DOI:10.1016/j.ifacol.2017.08.1927
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractThe objective of this work is to provide a qualitative description of the transient properties of stochastic learning dynamics like adaptive play, log-linear learning, and Metropolis learning. The solution concept used in these learning dynamics for potential games is that of stochastic stability, which is based on the stationary distribution of the reversible Markov chain representing the learning process. However, time to converge to a stochastically stable state is exponential in the inverse of noise, which limits the use of stochastic stability as an effective solution concept for these dynamics. We propose a complete solution concept that qualitatively describes the state of the system at all times. The proposed concept is prevalent in control systems literature where a solution to a linear or a non-linear system has two parts, transient response and steady state response. Stochastic stability provides the steady state response of stochastic learning rules. In this work, we study its transient properties. Starting from an initial condition, we identify the subsets of the state space called cycles that have small hitting times and long exit times. Over the long time scales, we provide a description of how the distributions over joint action profiles transition from one cycle to another till it reaches the globally optimal state.
  • 关键词:KeywordsLearning theoryStochastic controlgame theorySensor networks
国家哲学社会科学文献中心版权所有