摘要:The payoff matrix of a finite stage game is realized randomly, and then the stage game is repeated infinitely. The distribution over states of the world (a state corresponds to a payoff matrix) is commonly known, but players do not observe nature’s choice. Over time, they can learn the state in two ways. After each round, each player observes his own realized payoff (which may be stochastic, conditional on the state), and he observes a noisy public signal of the state (whose informativeness may vary with the actions chosen). Actions are perfectly observable. The result is that for any function that maps each state to a payoff vector that is feasible and individually rational in that state, there is a sequential equilibrium in which patient players learn the realized state with arbitrary precision and achieve a payoff close to the one specified for that state. That result extends to the case where there is no public signal, but instead players receive very closely correlated private signals of the vector of realized payoffs.