首页    期刊浏览 2024年09月16日 星期一
登录注册

文章基本信息

  • 标题:Competing Bandits: Learning Under Competition
  • 作者:Yishay Mansour ; Aleksandrs Slivkins ; Zhiwei Steven Wu
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2018
  • 卷号:94
  • 页码:48:1-48:27
  • DOI:10.4230/LIPIcs.ITCS.2018.48
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Most modern systems strive to learn from interactions with users, and many engage in exploration: making potentially suboptimal choices for the sake of acquiring new information. We initiate a study of the interplay between exploration and competition--how such systems balance the exploration for learning and the competition for users. Here the users play three distinct roles: they are customers that generate revenue, they are sources of data for learning, and they are self-interested agents which choose among the competing systems. In our model, we consider competition between two multi-armed bandit algorithms faced with the same bandit instance. Users arrive one by one and choose among the two algorithms, so that each algorithm makes progress if and only if it is chosen. We ask whether and to what extent competition incentivizes the adoption of better bandit algorithms. We investigate this issue for several models of user response, as we vary the degree of rationality and competitiveness in the model. Our findings are closely related to the "competition vs. innovation" relationship, a well-studied theme in economics.
  • 关键词:machine learning; game theory; competition; exploration; rationality
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有