首页    期刊浏览 2024年11月23日 星期六
登录注册

文章基本信息

  • 标题:Quantum Multiple Q-Learning
  • 本地全文:下载
  • 作者:Michael Ganger ; Wei Hu
  • 期刊名称:International Journal of Intelligence Science
  • 印刷版ISSN:2163-0283
  • 电子版ISSN:2163-0356
  • 出版年度:2019
  • 卷号:09
  • 期号:01
  • 页码:1-22
  • DOI:10.4236/ijis.2019.91001
  • 出版社:Scientific Research Publishing
  • 摘要:In this paper, a collection of value-based quantum reinforcement learning algorithms are introduced which use Grover’s algorithm to update the policy, which is stored as a superposition of qubits associated with each possible action, and their parameters are explored. These algorithms may be grouped in two classes, one class which uses value functions ( V (s)) and new class which uses action value functions ( Q (s,a)). The new ( Q (s,a)) -based quantum algorithms are found to converge faster than V (s) -based algorithms, and in general the quantum algorithms are found to converge in fewer iterations than their classical counterparts, netting larger returns during training. This is due to fact that the ( Q (s,a)) algorithms are more precise than those based on V (s) , meaning that updates are incorporated into the value function more efficiently. This effect is also enhanced by the observation that the Q (s,a) -based algorithms may be trained with higher learning rates. These algorithms are then extended by adding multiple value functions, which are observed to allow larger learning rates and have improved convergence properties in environments with stochastic rewards, the latter of which is further improved by the probabilistic nature of the quantum algorithms. Finally, the quantum algorithms were found to use less CPU time than their classical counterparts overall, meaning that their benefits may be realized even without a full quantum computer.
  • 关键词:Quantum Computing;Reinforcement Learning;Q-Learning
国家哲学社会科学文献中心版权所有