文章基本信息

标题：Reputation effects in bargaining games.
作者：Park, Eun-Soo
期刊名称：Southern Economic Journal
印刷版ISSN：0038-4038
出版年度：1999
期号：July
语种：English
出版社：Southern Economic Association
摘要：Since the publication of Rubinstein's (1982) alternating-offer bilateral bargaining game with complete information, much of the recent literature on bargaining has been devoted to models with incomplete information. (See Osborne and Rubinstein 1990 for a survey.) One of the most popular approaches in this line of research is to base incomplete information on something tangible about the parties involved, such as uncertainty about the buyer's valuation of the good or the seller's production cost of the good.
关键词：Game theory;Negotiation;Negotiations

Reputation effects in bargaining games.

Park, Eun-Soo

1. Introduction

Since the publication of Rubinstein's (1982) alternating-offer bilateral bargaining game with complete information, much of the recent literature on bargaining has been devoted to models with incomplete information. (See Osborne and Rubinstein 1990 for a survey.) One of the most popular approaches in this line of research is to base incomplete information on something tangible about the parties involved, such as uncertainty about the buyer's valuation of the good or the seller's production cost of the good.

In light of the recent experimental bargaining results (e.g., Ochs and Roth 1989), however, it is not surprising to see a growing view that much of the recent literature on alternating-offer bargaining with incomplete information does not directly address what we see in the experimental bargaining literature. For example, Kreps (1990a) states that

These formulations do not directly address what we see in the experimental literature and what we have conjectured will be found in subsequent experiments. There is no "tangible uncertainty" at work in the Ochs and Roth experiments. Rather, what seems to be at work are the beliefs of one player concerning equity and greed and the assessment of the other player about the beliefs of the first. It is not clear a priori that the forms of incomplete information that have been investigated so far will speak adequately to effects driven by these other sorts of considerations. (p. 569)

Kreps (1990b, pp. 123-8) adheres to this view and consequently suggested an alternatingoffer bargaining game that is perturbed with a "crazy" type, in that there is a small probability that one player might irrationally insist on a particular allocation. He conjectured that players of the crazy type may indeed get that particular allocation, regardless who has which discount rate.

Myerson (1991, pp. 399-403) has expressed the same view; he characterized the equilibrium of an alternating-offer bargaining game in which one player might be a crazy type who irrationally insists on a particular allocation. Myerson's model can be summarized as follows. First, consider the following slight variation of Rubinstein's (1982) alternating-offer bargaining game in the context of splitting a dollar, a variation based on Myerson (1991, pp. 394-9). Player 1 makes an offer in every odd-numbered round and player 2 makes an offer in every even-numbered round until some offer is accepted or the game ends in disagreement. In each round, after the offer is made, the other player can either accept or reject it. At any round when player i makes the offer and player j rejects it, there is probability [p.sub.i] that player i will walk out and the bargaining will end in disagreement. In this scenario, both players get nothing. This game has a unique (subgame perfect) Rubinstein equilibrium. In equilibrium, the game will end at round 1 with the following Pareto efficient allocation:

([p.sub.1]/[p.sub.1] + [p.sub.2] - [p.sub.1][p.sub.2], [p.sub.2] - [p.sub.1][p.sub.2]/[p.sub.1] + [p.sub.2] - [p.sub.1][p.sub.2])

Now suppose that player 2 assesses probability q that, before the game begins, player 1 is replaced by a computer that is programmed to play the following r-insistent strategy: Offer r = ([r.sub.1], [r.sub.2]) every time he is able to make an offer in which [r.sub.1] + [r.sub.2] [less than or equal to] 1 and both [r.sub.1] and [r.sub.2] are between 0 and 1, and reject any offer that leaves him with less than [r.sub.0]. In this variation of the alternating-offer bargaining game, Myerson showed that in equilibrium, the bargaining game is sure to end within the first J(q, r) rounds if player 1 follows the r-insistent strategy. Besides the fact that Myerson proved the existence of such an upper bound in an ingenious way, a more striking feature of Myerson equilibrium is that J(q, r) does not depend on [p.sub.1] or [p.sub.2].(1)

This result is in sharp contrast to that of Rubinstein, since Rubinstein's equilibrium allocation crucially depends on [p.sub.1]/[p.sub.2]. Note that in Rubinstein's model described above, [p.sub.i] measures player i's "strength" in the sense that it is the probability that any offer he makes, if rejected, may be final. The Rubinstein model suggests that players would reach an agreement in which their relative shares crucially depend on their relative strength, [p.sub.i]/[p.sub.j], even if the absolute level of strength of each player, [p.sub.i], is very low. (For example, if [p.sub.1] = 0.004 and [p.sub.2] = 0.001, then in Rubinstein equilibrium, player 1 will get approximately 4/5 and player 2 will get approximately 1/5 of the pie to be shared.) This result suggests that player 1 could never convince anyone that there are some offers that would be impossible for him to accept. However, in real bargaining situations (for instance, as when bargaining over automobile or house prices), we often try to convince each other that we would never accept any offer outside a particular range, regardless of one's relative strength. To understand the importance of such tactics, Myerson's model introduced the incomplete information in the form of r-insistent strategy (that is, the crazy type using the jargon of game theory).

Thus, Myerson argues that if one player can create some doubt in another player's mind that he may have some irrational commitment to some allocation, then he can expect to get an agreement that is close to or better than this allocation, no matter what the ratio [p.sub.1]/[p.sub.2] may be. Thus, in both Kreps' and Myerson's models, the key in determining the allocations of bargaining games is the anticipation of what one can expect to get and what one expects must be given, rather than the institutional details of the game (such as the players' discount rates).

The main theme of this article is that the bargaining protocols, such as infinite time horizon or alternating offers, are not the key to the results of Kreps' or Myerson's type of bargaining game with incomplete information. In particular, we consider a model of a simultaneous-offer bilateral bargaining with a finite time horizon in which one player might have some kind of irrational commitment that compels him to insist on some bargaining allocation.(2) In this variation, we show that, as in the work of Kreps and of Myerson, beliefs of one player concerning equity and greed and the assessment of the other player about the beliefs of the first are the key in determining the allocations of bargaining games. Kreps (1990a) shares the same view when he conjectures, "'Institutional details' concerning who has which discount rate and what is the precise protocol of bargaining will be swamped by expectations as to what one can expect to get and what one expects must be given in order to come to agreement" (p. 568).

The bargaining protocols considered in my model are similar in spirit to the experimental bargaining game conducted in Roth and Schoumaker (1983). In Roth and Schoumaker's experiments, the subjects played a 25-round bilateral (simultaneous-move) bargaining game using a computer interface. In order to study the role of expectations and reputation in bargaining, Roth and Schoumaker conditioned their subjects by having them play the bargaining game against computers for the first 15 rounds; the subjects were not aware of this.(3) The computer was programmed to play the 50-50 split for some subjects and to play the 20-80 split for the others. Then conditioned subjects were mixed to play the bargaining game with each other. The results were not surprising in that expectations and reputation played an important role in determining the allocations: When two subjects who had been conditioned in compatible ways were matched, they played the equilibrium they had learned. When two subjects who had been conditioned in incompatible ways were matched, either money was left on the table or disagreement outcomes were reached.(4)

2. The Model and Theorem

Consider a static two-person bargaining game, (F, v), where F [subset or equal to] [R.sup.2] is the set of feasible payoff allocations and v = ([v.sub.1], [v.sub.2]) [element of] [R.sup.2] is the disagreement point.(5) Assume that F is a compact and convex set and that the Pareto frontier of F is strictly downward sloping. Assume further that there is some allocation y = ([y.sub.1], [y.sub.2]) [element of] F that is strictly better for both players than disagreement, that is, [y.sub.1] [greater than] [v.sub.1] and [y.sub.2] [greater than] [v.sub.2]. Let [m.sub.i] denote the maximum payoff that player i can get, that is,

[Mathematical Expression Omitted].

Now consider a T round (where T is sufficiently large, but finite) dynamic version of (F, v) where in each round, two players simultaneously and independently name a demand.(6) Let [x.sub.1t] denote player 1's demand in round t for t = 1, 2, . . ., T. Also let [x.sub.2t] denote player 2's demand in round t. If these two demands are compatible in the sense that ([x.sub.1t], [x.sub.2t]) [element of] F, then ([x.sub.1t], [x.sub.2t]) is implemented and the game ends. If not, with probability [p.sub.1], player 1 walks away and terminates the bargaining process. Similarly, let [p.sub.2] denote the probability that player 2 walks away and terminates the bargaining process if the two demands are not compatible. If the demands are not compatible or the game ends in disagreement in any round, each player gets nothing. If the two demands are incompatible but the game does not end in disagreement in the current round, the game moves to the next round.

Now suppose that before the game begins, player 2 assesses probability [Epsilon] that player 1 always names the demand of [r.sub.1] whenever he is able to name a demand. Call this player l's strategy [r.sub.1]-insistent strategy. Let [r.sub.2] denote the maximum payoff that player 2 can get when player 1 gets [r.sub.1]. That is,

[r.sub.2] = max {[y.sub.2][where]([r.sub.1], [y.sub.2]) [element of] F}.

Note that [m.sub.2] [greater than or equal to] [r.sub.2]. To simplify the analysis, suppose [v.sub.1] = [v.sub.2] = 0.

THEOREM. In any sequential equilibrium of the game described above, if player 1 follows the [r.sub.1]-insistent strategy, then there exists a constant M([Epsilon],][m.sub.2], [r.sub.2]) depending on [Epsilon], [m.sub.2], and [r.sub.2] but independent of T, [p.sub.1], or [p.sub.2], such that the game is sure to end within the first M([Epsilon], [m.sub.2], [r.sub.2]) rounds. Furthermore, M([Epsilon], [m.sub.2], [r.sub.2]) is the smallest integer n such that

[Epsilon] [greater than or equal to] [([m.sub.2]/[m.sub.2] + [r.sub.2]).sup.n-1.

So the expected payoff to player 1 in equilibrium cannot be less than [r.sub.1][(1 - max{[p.sub.1], [p.sub.2]}).sup.M]. If [p.sub.1] and [p.sub.2] are small, then this lower bound is close to [r.sub.1].

REMARKS. Some remarks are in order. First, note that in the establishment of such an upperbound M, we need to look at equilibria in which player 1 is not observed deviating from the [r.sub.1]-insistent strategy throughout the game, since any deviation from the [r.sub.1]-insistent strategy immediately reveals that player 1 is not a machine. An important point is that if player 1, a rational type, tries to mimic the crazy type, the relative strength of the two players is of little importance. Second, note that Myerson's proof for the alternating-offer model cannot be extended to the current simultaneous-offer model. The reason is that [p.sub.i] is defined slightly differently because of the difference between the simultaneous- and alternating-offer formats. In Myerson's model, at any round when player i makes an offer ([x.sub.1], [x.sub.2]) and player j rejects it, there is a probability [p.sub.i] that player i walks out and the bargaining will end in disagreement. On the other hand, in the current model, in any round, each player names his own demand [x.sub.i], and if ([x.sub.1], [x.sub.2]) [not an element of] F, where F is the set of feasible allocations, then there is a probability [p.sub.i] that player i walks away.

PROOF. Consider the first k rounds and count the time backward from the kth round. So t = k, k - 1, . . ., 1. Denote by [[Mu].sub.t], the probability assessed by player 2 at the beginning of round t that player 1 is an [r.sub.1]-insistent machine, given that player 1 has always played [r.sub.1] previously.

We claim that if

[[Mu].sub.t] [greater than or equal to] f(t) [equivalent to] [([m.sub.2]/[m.sub.2] + [r.sub.2]).sup.t-1],

then player 2 cannot be willing to withstand the [r.sub.1]-insistent strategy for the next t rounds (including this round) by demanding more than [r.sub.2] during these rounds, if the game does not end in disagreement.

To prove this claim, I will use mathematical induction. For t = 1, it is clear that the claim is true. Now fix the integer t and suppose the claim is true for t - 1. For notational simplicity, let [Mu] = [[Mu].sub.t].

Suppose player 1, as a rational player, plans to follow the [r.sub.1]-insistent strategy throughout these t rounds with some positive probability. In particular, let [[Pi].sub.t] be the probability that player 1 plays the [r.sub.1]-insistent strategy as a rational player in this round. Now suppose that player 2 is following a strategy under which she withstands the [r.sub.1]-insistent strategy throughout these t rounds by demanding more than [r.sub.2]. Call this strategy for player 2 [c.sub.2].

If player 2 follows [c.sub.2], then her expected payoff cannot be greater than (1 - [Mu])[[Pi].sub.t](1 - [p.sub.1])(1 - [p.sub.2])[m.sub.2] + (1 - [Mu])(1 - [[Pi].sub.t])[m.sub.2], since the largest probability that bargaining will not end in disagreement at the end of this round is [[Pi].sub.t](1 - [p.sub.1])(1 - [p.sub.2]) and (1 - [Mu])(1 - [[Pi].sub.t])[m.sub.2] is the maximum expected payoff to player 2 when player 1 is not a machine and demands other than [r.sub.1]. On the other hand, if player 2 plays a strategy other than [c.sub.2] against player 1's proposed strategy, then player 2's expected payoff is at least [[Mu].sub.t][r.sub.2] + (1 - [Mu]) [[Pi].sub.t][r.sub.2].

Thus, in order for [c.sub.2] to be optimal, we must have

[Mathematical Expression Omitted],

or equivalently,

[Mathematical Expression Omitted]. (1)

On the other hand, by the induction hypothesis and Bayes' rule, the optimality of [c.sub.2] requires

[Mathematical Expression Omitted],

or equivalently,

[Mathematical Expression Omitted]. (2)

Combining Equation 1 and Equation 2, we have

[MU](1 - f(t - 1))/f(t - 1) [greater than] (1 - [Mu])[m.sub.2] - [Mu][r.sub.2] / [r.sub.2] + [m.sub.2] - [m.sub.2] (1 - [p.sub.1])(1 - [p.sub.2]),

or equivalently,

[Mu] [less than] f(t - 1). [m.sub.2] / [m.sub.2] + [r.sub.2] - [m.sub.2] (1 - [p.sub.2])(1 - f(t - 1)).

Now define g(t) as

[Mathematical Expression Omitted].

So if [Mu] [greater than or equal to] f(t - 1)g(t), then [c.sub.2] cannot be optimal.

Now it can be shown that

g(t) [greater than] [m.sub.2]/[m.sub.2] + [r.sub.2]

Therefore, if

[Mu] [greater than or equal to] f(t - 1). [m.sub.2]/[m.sub.2] + [r.sub.2] = f(t),

then player 2 cannot be willing to withstand the [r.sub.1]-insistent strategy throughout these t rounds. This completes the proof of the claim.

Now consider when t = k. Since [Epsilon] is the prior, if player 1 follows the [r.sub.1]-insistent strategy and if

[Epsilon] [greater than or equal to] [([m.sub.2]/[m.sub.2] + [r.sub.2]).sup.k-1],

then player 2 cannot be willing to withstand the [r.sub.1]-insistent strategy for the first k rounds if the game does not end in disagreement. Let M([Epsilon], [m.sup.2], [r.sub.2]) be the smallest integer k such that the above inequality holds. QED.

Note that as in Myerson's equilibrium discussed earlier, M([Epsilon], [m.sub.2], [r.sub.2]) does not depend on [p.sub.1] or [p.sub.2], and thus player 1 can expect to get an agreement that is close to his desired allocation, no matter what the ratio of the power of commitment of the players.

As a concrete example, consider a T-round (T sufficiently large, but finite) split-a-dollar game with the same dynamic structure as above, where in each round, two players simultaneously and independently name a demand in [0, 1]. For this game,

F = {([x.sub.1], [x.sub.2]) [Epsilon] [[0, 1].sup.2] [where] [x.sub.1] + [x.sub.2] 1}.

[v.sub.1] = [v.sub.2] = 0, and [m.sub.1] = [m.sub.2] = 1. Let [r.sub.1] [Epsilon] (0, 1), [r.sub.2] = 1 - [r.sub.1], and r = ([r.sub.1], [r.sub.2]). For this game, we have the following corollary:

COROLLARY. In any sequential equilibrium of the dynamic split-a-dollar game described above, if player 1 follows the [r.sub.1]-insistent strategy, then there exists a constant M([Epsilon], r) depending on e and r but independent of T, [p.sub.1], or [p.sub.2], such that the game is sure to end within the first M([Epsilon], r) rounds. Furthermore, M([Epsilon], r) is the smallest integer n such that

[Epsilon] [greater than or equal to] [(1/1+ [r.sup.2]).sup.n-1].

For example, when [Epsilon] = 0.1 and player 1 follows the 0.9-insistent strategy, the game will end within the first 25 rounds. Note that M([Epsilon], r) does not depend on [p.sub.1] and [p.sub.2]. Also note the natural monotonicity of the upper bound: M([Epsilon], r) increases in [r.sub.1] and decreases in [Epsilon].

3. Conclusion

Before closing, I discuss a potential experimental test for the results presented in this paper. As noted earlier, Roth and Schoumaker's (1983) experiment results are similar in spirit to the current results in that expectations are an important factor in reaching agreement. However, in Roth and Schoumaker's experiment, they conditioned their subjects by having them play the bargaining game against computers that were programmed to play a fixed split for the first 15 rounds, though subjects were not aware of this. Thus, in their experimental design, the reputation-building behavior of the player who enjoys the uncertainty about his true type cannot be explained.

In this respect, experiments conducted by Camerer and Weigelt (1988) and Neral and Ochs (1992) provide useful insights. In Camerer and Weigelt's experiment, they looked at the reputation-building behavior for a game with one-sided uncertainty. By changing the payoff parameters and the probability of the crazy type, they found that reputation-building behavior played a key role, as the theory predicted. On the other hand, Neral and Ochs pointed out that Camerer and Weigelt changed the two parameters (payoffs and the probability of the crazy type) simultaneously, so the effect of the change in payoff could not be estimated independently of the effect of a change in the probability. Thus, they examined the reputation-building behavior by changing the payoff parameters while holding the probability of the crazy type fixed. In their experiment, they found that while there is a systematic reputation-building behavior in response to changes in payoff parameters, the behavior is not in the direction predicted by the theory.

For a potential experimental test of the results in this paper (or in Myerson 1991), Roth and Schoumaker's experiment could be modified using the insights provided by Camerer-Weigelt's and Neral-Ochs' experiments. For example, we can develop an experiment in which subjects play a two-person (finite-horizon) dynamic bargaining game over a pie of fixed size (say, $10) and one subject (say, player 1) is designated as the one who may be of, say, $7-insistent type with some known probability, [Epsilon]. Also, in order to represent the probability [p.sub.i] that player i walks out after the disagreement, we can specify the rate at which the pie is shrinking for each player by providing the payoff table for each round. In this design, we can control [Epsilon] and [p.sub.i]. The equilibrium prediction from the model in this paper and Myerson's is that for a larger value of [Epsilon], ceteris paribus, the normal type of player 1 will play the insistent strategy with a higher probability in earlier rounds and player 2 will concede earlier. Also, the round by which player 2 concedes would not be sensitive to the changes in the payoff structure, that is, to the changes in the rate at which the pie is shrinking, ceteris paribus. It would be interesting to test these hypotheses in a well-designed experiment.

1 Drawing upon the results from the repeated game theory, one might suspect that this dynamic (not repeated) bargaining game will end within the first several rounds; however, it needs a lot of work to prove the existence of an upper bound that is independent of players' discount rates.

2 See Dekel (1990) for some features of a simultaneous-offer bargaining game that the model of alternating offer fails to capture.

3 See Roth and Schoumaker (1983) for the detailed rules of their bargaining game.

4 Roth and Schoumaker did not report when agreements were reached. The current paper offers the maximum periods that will elapse before agreement is reached. This result cannot be tested with Roth-Schoumaker's experimental results.

5 See, for example, Myerson (1991, chap. 8) for a rigorous introduction to this subject.

6 The same result holds for the infinite horizon.

References

Camerer, C., and K. Weigelt. 1988. Experimental tests of a sequential equilibrium reputation model. Econometrica 56: 1-36.

Dekel, E. 1990. Simultaneous offers and the inefficiency of bargaining: A two-period example. Journal of Economic' Theory 50:300-8.

Kreps, David. 1990a. A Course in Microeconomic Theory. Princeton, NJ: Princeton University Press.

Kreps, David. 1990b. Game Theory and Economic Modelling, Oxford: Oxford University Press.

Myerson, Roger. 1991. Game Theory: Analysis of Conflict. Cambridge, MA: Harvard University Press.

Neral. J., and J. Ochs. 1992. The sequential equilibrium theory of reputation building: A further test. Econometrica 60: 1151-69.

Ochs, J., and A. Roth. 1989. An experimental study of sequential bargaining. American Economic Review 79:355-84.

Osborne, Martin, and Ariel Rubinstein. 1990. Bargaining and Markets. San Diego, CA: Academic Press, San Diego.

Roth, A., and F. Schoumaker. 1983. Expectations and reputations in bargaining: An experimental study. American Economic Review 73:362-72.

Rubinstein, Ariel. 1982. Perfect equilibrium in a bargaining model. Econometrica 50:97-109.