首页    期刊浏览 2024年07月18日 星期四
登录注册

文章基本信息

  • 标题:Statistical Mechanics of On-line Node-perturbation Learning
  • 本地全文:下载
  • 作者:Kazuyuki Hara ; Kentaro Katahira ; Kazuo Okanoya
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2011
  • 卷号:6
  • 期号:2
  • 页码:352-361
  • DOI:10.11185/imt.6.352
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:Node-perturbation learning (NP-learning) is a kind of statistical gradient descent algorithm that estimates the gradient of an objective function through application of a small perturbation to the outputs of the network. It can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. In this paper, we show that node-perturbation learning can be formulated as on-line learning in a linear perceptron with noise, and we can derive the differential equations of order parameters and the generalization error in the same way as for the analysis of learning in a linear perceptron through statistical mechanical methods. From analytical results, we show that cross-talk noise, which originates in the error of the other outputs, increases the generalization error as the output number increases.
国家哲学社会科学文献中心版权所有