文章基本信息

标题：Ex ante versus ex post optimal promotion rules: the case of internal promotion.
作者：Waldman, Michael
期刊名称：Economic Inquiry
印刷版ISSN：0095-2583
出版年度：2003
期号：January
语种：English
出版社：Western Economic Association International
摘要：The literature on promotion practices within the firm identifies two distinct roles for a firm's promotion policies. Promotions serve both as a way for the firm to efficiently assign workers to tasks and as a way for the firm to reward prior performance. (1) In this article I explore the time-inconsistency problem that arises as a result of the dual nature of the promotion decision. In particular, I show that the common practice of favoring internal candidates for promotion can be understood as a response by firms to the problem of time inconsistency. (2)
关键词：Employee promotions

Ex ante versus ex post optimal promotion rules: the case of internal promotion.

Waldman, Michael

I. INTRODUCTION

The literature on promotion practices within the firm identifies two distinct roles for a firm's promotion policies. Promotions serve both as a way for the firm to efficiently assign workers to tasks and as a way for the firm to reward prior performance. (1) In this article I explore the time-inconsistency problem that arises as a result of the dual nature of the promotion decision. In particular, I show that the common practice of favoring internal candidates for promotion can be understood as a response by firms to the problem of time inconsistency. (2)

Consider a firm in which promotions serve as a reward for prior expenditures of effort and/or prior investments in human capital. The optimal promotion policy for this firm depends on the time frame from which we view the problem. The ex ante optimal promotion rule is the optimal rule when the problem is viewed from a date prior to young workers deciding to work at this firm. This rule takes into account both the assignment aspects of the promotion decision and that promotions serve as a reward for prior performance. There is a second relevant rule, however, because at the date of the promotion decision the effects of promotion serving as a reward for prior performance are in the past. For example, in a world where the possibility of future promotion serves to increase the effort levels of young workers, by the time of the promotion decision those effort choices have already been made. Hence, the ex post optimal rule only takes into account the assignment aspects of the promotion decision.

The time-inconsistency problem that arises is now clear. Total profits are maximized when the firm follows the ex ante optimal promotion rule, but in the absence of commitment this is not how the firm behaves. At the time of the promotion decision the firm has an incentive to ignore the effects of promotion serving as a reward for prior performance, and instead follow the ex post optimal rule. In other words, the profits of the firm are decreased because at the time of the promotion decision the firm maximizes current profits, and the anticipation of this behavior by workers has deleterious effects on behavior in prior periods.

In this article I explore a specific example of this time-inconsistency problem. Consider a firm's decision concerning whether to fill a managerial position by promotion from within or by hiring an outsider. I show that when confronted with this decision the firm faces the exact situation described. That is, the ex post optimal rule shows no preference for promotion from within over hiring from the outside, whereas the ex ante optimal rule exhibits a preference for promoting from within. The logic here is that the possibility of receiving the higher managerial wage in the future reduces the wage required to attract young workers into the firm, but only the ex ante optimal rule takes this into account. The result is that in the absence of commitment the firm hires from the outside too often. The conclusion is that the establishment of an internal labor market in which the firm favors promotion from within can be understood as a way that the firm avoids this time-inconsistency problem.

This result extends a finding in Malcomson (1984). That article also shows that an internal labor market in which hiring from the outside is restricted can arise in a world where promotion serves as a reward for prior performance. In that work, however, there is no time-inconsistency problem. The reason is that Malcomson assumes workers are homogeneous, and thus at the time of the promotion decision it is (weakly) optimal for the firm to have all promotions be from within. In contrast, I assume that workers are heterogeneous and find that if the firm maximizes ex post profits it lowers total profits. The result is an internal labor market that not only limits hiring from the outside but also constrains the firm to behave in a manner different from that which maximizes profits at the time the promotion decision is made.

Another closely related paper is Chan (1996). He also presents an analysis in which the firm prefers to promote from within, but his argument is different both from that put forth here and from Malcomson's argument. Chan argues that to avoid the high wage associated with promotion from within, a firm will sometimes hire an outsider when the efficient contract calls for promotion of an insider. In turn, he further argues that the firm can avoid the problem by lowering the wage associated with promotion from within and only hiring an outsider when the outsider is much superior to the best internal candidate. Chan's argument is distinctly different than Malcomson's in that Chan assumes heterogeneous workers, and this is an integral part of his argument. The argument is also distinctly different than that put forth here in that there is no production in Chan's model after the promotion decision takes place. Thus, whereas the focus here is on the interplay between optimal incentives and optimal assignment, in Chan 's analysis this is not an issue because there is no assignment aspect of the promotion decision. (3)

The analysis in this article builds on an insight found in Milgrom and Roberts (1988). That study was the first to recognize the time-inconsistency problem that arises as a result of the dual nature of the promotion decision. Milgrom and Roberts (1988) consider a world characterized by influence activities, where influence activities refer to activities that a worker can engage in that are not directly productive but that can influence a firm's beliefs concerning the worker's ability. Their focus is a time-inconsistency problem that arises due to the unproductive nature of these activities. In particular, they show that from an ex post standpoint the firm has an incentive to promote the workers it believes are most productive in the higher-level task, but this creates an incentive for workers in prior periods to expend effort in unproductive influence activities that affect the firm's perceptions concerning who would be best at the higher-level task. As a result, in their analysis the firm sometimes commits t o promote the workers most productive in the lower-level task, rather than the ones it thinks will be most productive in the higher-level task. In other words, as in the current analysis, the firm commits to an assignment of workers to tasks that is ex post inefficient. (4)

As a final introductory point, it is worth noting that there are alternative explanations for why higher levels of a firm's job ladder might be staffed entirely or almost entirely with insiders. For example, this could be due to the presence of specific human capital as first explored by Becker (1962). Or it could be because the firm has better information about the abilities of its own workers and is thus hesitant to fill a high-level position with an outsider for whom it has less precise information. (5) One aspect of the argument presented that is distinctly different than those previous explanations is that, as discussed, the firm does not only refuse to hire outsiders for higher-level positions. The firm also needs to constrain its own behavior to achieve this goal. I come back to this issue in section IV, where I discuss methods of commitment.

The outline for the article is as follows. Section II presents an overlapping-generations model in which the possibility of future promotion serves as an incentive for young workers to provide effort. Section III analyzes the model and shows that time inconsistency can cause the formation of an internal labor market in which the firm always promotes the best internal candidate. Section IV first discusses using a promotion standard for internal promotion (rather than a total prohibition on hiring from the outside), and then discusses methods a firm may employ to bind its own future actions. Section V presents concluding remarks.

II. THE MODEL

Consider an overlapping-generations setting in which each worker lives two periods. In each labor cohort there is a pool of Z risk-neutral workers who discount the future by a factor [beta]. Each worker has associated with him or her a value for a variable that will be called ability and will be denoted [[theta].sub.i] for worker i. It is assumed that [[theta].sub.i] is a random draw from a distribution described by a probability density function h([theta]), where h([theta]) > 0 for all [[theta].sub.L] [less than or equal to] [theta] [less than or equal to] [[theta].sub.H], and h([theta]) = 0 for all [theta] outside this interval. At the beginning of a worker's career, everyone in the economy (including the worker) knows h([theta]) but does not know the worker's actual realization of ability. Each worker must also choose an effort level each period. [e.sub.it] is the effort level choice of worker i in period t, where effort can be either low or high, that is, [e.sub.it] [member of] {[e.sup.l], [e.sup.h]}, and it is assumed [e.sup.h] - [e.sup.l] < [[theta].sub.H] - [[theta].sub.L]. The assumption [e.sup.h] - [e.sup.l] < [[theta].sub.H] - [[theta].sub.L] ensures that there are realizations for an entry-level worker's output such that the firm is uncertain whether the worker chose high or low effort. g([e.sub.it]) is the disutility an individual receives from effort, where g([e.sup.h]) > g([e.sup.l]) = 0 and g([e.sup.h]) < [e.sup.h] - [e.sup.l]. The assumption g([e.sup.h]) < [e.sup.h] - [e.sup.l] ensures that high effort is efficient.

There is a single risk-neutral firm in this economy that also has a discount factor [beta] and is in operation through period T (in section IV I also discuss what happens when the firm is infinitely lived). Each period the firm is in operation it has 2N job slots to fill, 2N < Z. N slots are entry-level positions filled by young workers. If young worker i is in an entry-level position in period t, then the worker's output equals [[theta].sub.i] + [e.sub.it] There are N - 1 skilled positions filled by old workers (a young worker placed in a skilled position produces zero). If old worker i is in a skilled position in period t, then output equals [lambda][[theta].sub.i] + [e.sub.it], where [lambda] > 1. There is a single managerial position, which is also filled by an old worker (as with the skilled positions, a young worker placed in the managerial position produces zero). If old worker i is in the managerial position in period t, then output equals [delta][[theta].sub.i] + [e.sub.it], where [delta] > [lambda]. Finally, the firm does not observe a worker's effort level, and a worker's output is privately observed by the firm as opposed to being publicly observed and verifiable. Note that because a worker's output is privately observed by the firm, contracts where payment is directly contingent on output are not feasible. As a result, old workers employed by the firm choose effort [e.sup.l] rather than [e.sup.h].

A worker who does not work at the firm is self-employed, and a self-employed worker receives utility U. It is assumed that [[theta].sub.L] + [e.sup.l] U. This assumption guarantees that productivity at the firm is sufficiently high that the firm can attract workers and earn positive profits. Note in this specification the firm acquires no information about those who were self-employed in the previous period. As a result, if the firm hires an outsider for the managerial position in period t, then it simply chooses randomly from the pool of old workers who were self-employed in period t - 1. I impose this assumption for tractability reasons. Clearly, a more realistic model would allow the firm to acquire some information about the ability levels of external candidates for the managerial position. My sense is that as long as the firm is limited in terms of the number of outside candidates it can evaluate, allowing the firm to acquire some information about the ability levels of external candidates would not affe ct the qualitative nature of the results.

In the analysis that follows it is assumed that in each period t, 1 [less than or equal to] t [less than or equal to] T - 1, the firm hires young workers by offering a contract that specifies [W.sup.E.sub.t], [W.sup.S.sub.t+1], and [W.sup.M.sub.t+1], where [W.sup.E.sub.t] is the entry-level wage in period t, [W.sup.S.sub.t+1] is the skilled wage in period t + 1, and [W.sup.M.sub.t+1] is the managerial wage in t + 1. Note because it is assumed that workers cannot bind themselves to the firm, a worker always has the option of choosing self-employment when he or she is old. Hence, in equilibrium the firm always specifies wages [W.sup.S.sub.t+1] [greater than or equal to] U and [W.sup.M.sub.t+1] [greater than or equal to]U. (6)

Finally, in addition to specifying wages, the contract offered to young workers in period t also indicates the extent to which the firm can hire outsiders for the managerial position in the following period. I consider two different possibilities for the extent to which the firm can commit its future promotion decisions. In section III it is assumed that commitment is all or nothing. That is, in each period t the firm either makes no commitment or commits to fill the managerial position in period t + 1 with a worker who was in an entry-level position in period t. (7) In section IV I consider the alternative assumption that the firm can commit to a promotion standard. (8)

III. ANALYSIS

The literature on labor market tournaments typically assumes that a firm can bind itself to a set of prizes that it will pay in future periods. The point of the following analysis is that when workers vary in terms of ability, it may also be important for the firm to commit to the manner in which workers will be assigned to jobs in the following period. In particular, extending the analysis of Malcomson (1984), I show that the firm sometimes has an incentive to commit to an assignment rule that limits its own ability to hire an outsider for positions that serve as a reward for prior performance.

The analysis that follows considers two cases. In the first case the managerial wage is the same whether the manager is promoted from within or is hired from the outside. Specifically, in any period t + 1 each type of manager is paid the managerial wage specified in the contract offered to young workers in period t, [W.sup.M.sub.t+1]. This case is useful to analyze because the problem associated with staffing the managerial position with an outsider is very clear in this case. However, in many or possibly even most cases, assuming that the firm is restricted to paying the same wage to each type of manager is probably not a valid assumption. I thus also analyze a second case in which the firm is allowed to pay a manager hired from the outside a different wage than that paid to managers who are promoted from within. (9)

I begin by considering the ex post optimal promotion rule for period t + 1 given that both types of managers are paid the same wage (remember, the ex post optimal promotion rule ignores the effects of the promotion rule on behavior in prior periods). Suppose N = 2, and let [[theta].sup.*] be the expected value for [theta] and [y.sub.jt] denote the output of the worker in the jth entry-level position in period t. At the end of period t the firm observes [y.sub.1t] and [y.sub.2t], and from these two values the firm draws inferences concerning the ability levels of its entry-level workers in period t. Let [[theta].sup.E.sub.jt] denote the expected ability of the jth entry-level worker in period t, that is, [[theta].sup.E.sub.jt] = E([[theta].sub.j]\[y.sub.jt]). We also know that for the young workers who were self-employed in period t, the firm learns nothing concerning the workers' abilities, and thus [[theta].sup.*] is the expected ability in period t + 1 for each of these workers.

Given this information, the ex post optimal promotion rule is simple. There are three cases. First, suppose [[theta].sup.E.sub.1t] > [[theta].sup.*] and [[theta].sup.E.sup.2t] > [[theta].sup.*]. Then in period t + 1 both entry-level workers are retained by the firm, where the managerial position is staffed by the worker for whom expected ability is higher. Second, suppose [[theta].sup.E.sub.1t] > [[theta].sup.*] and [[theta].sup.E.sub.2t] < [[theta].sup.*] (or [[theta].sup.E.sub.1t] < [[theta].sup.*] and [[theta].sup.E.sub.2t] > [[theta].sup.*]). Then in period t + 1 the entry-level worker with the higher expected ability is retained and placed in the managerial position, and the skilled position is filled by a worker who was self-employed in period t. Third, suppose [[theta].sup.E.sub.1t] < [[theta].sup.*] and [[theta].sup.E.sub.2t] < [[theta].sup.*]. Then in period t + 1 both positions are filled with workers who were self-employed in period t. To summarize, from the ex post standpoint profits are maximized w hen the firm hires the workers with the highest expected abilities.

The analysis easily extends to the general case of N entry-level positions. In period t + 1 the firm will retain all the period t entry-level workers whose values for [[theta].sup.E.sub.jt] are above [[theta].sup.*], whereas the firm's remaining old workers will be drawn from the ranks of the period t self-employed. Furthermore, of the old workers it employs in period t + 1, the one assigned to the managerial position is the individual with the highest expected ability. This means that, if none of the entry-level workers in period t have a value for [[theta].sup.E.sub.jt] above [[theta].sup.*], then the managerial position in period t + 1 is filled by an individual who was self-employed in period t. (10)

In the preceding discussion I showed that from an ex post perspective the firm has an incentive to sometimes staff the managerial position with an outsider. I now consider equilibrium behavior, in particular whether the firm ever commits to only staff the managerial position with an insider. The reason the firm might commit to such a rule is that in equilibrium the firm will take an ex ante rather than an ex post perspective. That is, in equilibrium the firm will be concerned with how the promotion rule in one period affects behavior in prior periods.

PROPOSITION 1. Holding everything (including [[theta].sub.L] and [[theta].sub.H]) fixed except h(*), if [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then (i)-(v) characterize equilibrium behavior in periods t and t + 1 for all 1 [less than or equal to] t [less than or equal to] T - 1. (11)

(i) [W.sup.M.sub.t+1] > [W.sup.S.sub.t+1] = U.

(ii) [W.sup.E.sub.t] - g([e.sup.h]) + [beta][(1/N)[W.sup.m.sub.t+1] + ([N - 1]/N)U] = (1+[beta])U.

(iii) Entry-level workers in period t provide high effort, whereas workers in the managerial and skilled positions in periods t and t + 1 provide low effort.

(iv) The firm commits to staff the managerial position in period t + 1 with an individual who was an entry-level worker in period t, and then staffs the position with the period t entry-level worker who produces the highest output.

(v) The skilled positions in period t + 1 are staffed by the remaining period t entry-level workers whose outputs in period t were greater than or equal to [[theta].sup.*] + [e.sup.h], and with workers who were self-employed in period t.

In each period t + 1, the ex post optimal promotion rule staffs the managerial position and the skilled positions with the old workers who have the highest values for expected ability. Proposition 1 tells us that this is not how the firm behaves in equilibrium. If [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then in each period t the firm commits to staff the managerial position in period t + 1 with one of the firm's period t entry-level workers. This means that, even when the best period t entry-level worker has a value for [[theta].sup.E.sub.jt] below [[theta].sup.*], it is still the case that in period t + 1 the managerial position is filled by an insider.

The logic for this result is as follows. In equilibrium, the firm uses the assignment rule in period t + 1 as a way of motivating entry-level workers in period t to provide effort [e.sup.h] rather than [e.sup.l]. In particular, the firm pays a managerial wage in t + 1 higher than what is necessary to attract a worker into the managerial position, and the possibility of receiving the supranormal wage in t + 1 provides the entry-level workers in t with the incentive to choose the high effort level. Given this, consider the wage required to attract young workers in period t into the entry-level position, [W.sup.E.sub.t]. Holding [W.sup.M.sub.t+1] fixed, this wage is negatively related to the probability that an entry-level worker in t will be promoted into the managerial position in t + 1. Hence, to increase this probability and thus reduce the period t entry-level wage, the firm prohibits outsiders from filling the managerial position in t + 1.

A different way of describing the time-inconsistency problem that underlies this result concerns the cost to the firm of paying the prize associated with the managerial position, that is, paying a manager [W.sup.M.sub.t+1] rather than U. From the standpoint of the present discounted value of the firm's profit stream evaluated at any date t, there is no cost to the firm of paying a high wage in period t + 1 to a manager who is promoted from within. The reason is that the entry-level wage in period t adjusts such that over the course of an entry-level worker's lifetime the worker's expected utility exactly equals what he or she would receive in self-employment. In contrast, from the standpoint of the present discounted value of the firm's profit stream evaluated at any date t, there is a cost of paying a high wage in period t + 1 to a manager hired from the outside. The reason is that the entry-level wage in period t does not adjust down in anticipation of a high wage earned by a t + 1 manager hired from the ou tside. The time-inconsistency problem is simply that in deciding in period t + 1 who to promote the firm evaluates promoting from within over hiring from the outside as having equal costs, whereas from the standpoint of period t promoting from within has a distinctly lower cost.

Notice the proposition states that (i)-(v) hold if [[theta].sup.*] - [[theta].sub.L] is sufficiently small. The rationale for this condition concerns the size of the ex post cost of always staffing the managerial position with an insider. When [[theta].sup.*] - [[theta].sub.L] is small, the potential lost productivity in each period t + 1 from not following the ex post optimal promotion rule is also small. The result is that it is optimal for the firm to prohibit outsiders from filling the managerial position. When [[theta].sup.*] - [[theta].sub.L] is large, the potential lost productivity from not following the ex post optimal promotion rule is potentially large, and the result is that it can be optimal for the firm to allow an outsider to fill the managerial position.

Proposition 1 demonstrates that by restricting its own ability to hire from the outside, the firm solves a time-inconsistency problem concerning who fills the managerial position. An assumption in the analysis, however, is that a manager promoted from within is paid the same wage as a manager hired from the outside. Because, as indicated earlier, for many settings this is probably not a valid assumption, a natural question to ask is how the analysis changes when the firm has the option of paying two different managerial wages--one for managers promoted from within and one for managers hired from the outside. I now consider this case.

The following proposition considers what happens when in each period t, 1 [less than or equal to]t [less than or equal to] T - 1, the contract offered to young workers specifies four wages: [W.sup.E.sub.t], [W.sup.S.sub.t+1], [W.sup.M.sub.t+1], and [W.sup.M'.sub.t+1]. [W.sup.M.sub.t+1] now denotes the managerial wage in period t + 1 for a manager who was an entry-level worker in period t, and [W.sup.M'.sub.t+1] denotes the managerial wage in period t + 1 for a manager who was self-employed in period t.

PROPOSITION 2. Suppose that in each period t, 1 [less than or equal to] t [less than or equal to] T - 1, the contract offered to young workers specifies [W.sup.E.sub.t], [W.sup.S.sub.t+1], [W.sup.M.sub.t+1] and [W.sup.M'.sub.t+1] Holding everything (including [[theta].sub.L] and [[theta].sub.H]) fixed except h(*), if (N - 1)g([e.sup.h]) > [beta][delta]([[theta].sub.H] - [[theta].sub.L]) and [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then (i)-(v) of Proposition 1 characterize equilibrium behavior in periods t and t+1 for all 1[less than or equal to] t [less than or equal to] T - 1. (12)

Proposition 2 tells us that even if the firm can specify two managerial wages, for some parameterizations the firm still restricts the managerial position in each period t + 1 to individuals who were entry-level workers in period t. There are now three steps to the argument. First, if the firm specifies a managerial wage for a manager hired from the outside that is below U, then an outsider would never accept it and the outcome would be just as if the firm committed to never hire an outsider for the managerial position (see note 12). Second, in the earlier analysis the firm sometimes committed to never hire an outsider for the managerial position because of a time-inconsistency problem. There was a time-inconsistency problem present because in the absence of a restriction concerning hiring from the outside for the managerial position, the prize associated with promotion was sometimes rewarded to an outsider. This problem, however, is also present when the firm can specify two managerial wages if the firm spec ifies a managerial wage for a manager hired from the outside that is above U and there is a positive probability of hiring from the outside (see again note 12).

Third, the other possibility is that the firm specifies a managerial wage for outsiders that just equals U. This possibility avoids the problem of having a prize for promotion that is sometimes received by an outsider. Nevertheless, this solution does not work if N is large and [[theta].sup.*] - [[theta].sub.L] is small. The problem is that, if [W.sup.M'.sub.t+1] = U and these two conditions hold, then a value for [W.sup.M.sub.t+1] potentially high enough to get entry-level workers in period t to choose [e.sup.h] would cause the firm to always fill the managerial position with an outsider. In other words, the firm would not be able to induce high effort among its entry-level workers because the workers would anticipate a zero probability of receiving the high managerial wage in the future.

As a final point, one might ask whether what we know about real-world promotion practices is consistent or inconsistent with the theoretical arguments presented so far. I have been able to find relatively little hard evidence concerning promotion from within versus hiring an outsider except in the case of CEOs, and even in that case there is not much evidence. I would say what I have found, however, is basically consistent with the argument. For example, one of the basic points is that the firm is more likely to restrict promotions to insiders when there is little potential extra productivity associated with hiring an outsider. Two reasons this could occur are that the firm is large, in which case the best internal candidate is likely to be of high ability, or that the industry is heterogeneous, so that outsiders are unlikely to have the human capital required to be a successful CEO. Both Parrino (1997) and Agrawal et al. (2001) study CEO succession empirically and find results consistent with each prediction.

In summary, in this section I have shown that a firm might want to commit to staff with insiders the position or positions that serve as a reward for prior performance. The logic is that because of time inconsistency, in the absence of such a commitment the firm ignores how its promotion decision in one period affects the entry-level wage in the prior period with the result that the firm hires from the outside too often. I also showed that this result can even arise when the firm has the option of paying two different managerial wages--one for managers promoted from within and one for managers hired from the outside. The logic here is that one potential way to avoid the time-inconsistency problem is to have no prize associated with hiring a manager from the outside. But this can result in an inability on the part of the firm to induce high effort among its entry-level workers because a managerial wage high enough to potentially induce high effort would result in only outsiders being promoted.

IV. DISCUSSION

In the previous section I demonstrated that a firm sometimes maximizes its own profits by establishing an internal labor market in which the managerial position can only be staffed by a worker who was promoted from within. This section covers two issues. First, I discuss how the analysis changes when the firm can commit to a promotion standard for internal promotion rather than only being able to commit to a total prohibition on hiring from the outside. Second, I discuss methods a firm may employ to bind its own future actions.

Promotion Standards

In section III it was assumed that the firm has limited ability to commit to a promotion rule. In particular, in each period t the firm either made no commitment or committed to fill the managerial position in period t + 1 with a worker who was an entry-level worker in period t. In this subsection I discuss how the analysis changes when the firm's commitment ability is more nuanced. That is, I assume the firm is able to commit to a promotion standard rather than only being able to commit to always promote an internal candidate. By a promotion standard I mean the firm commits to an output level such that an internal candidate is promoted in period t + 1 as long as one of the period t entry-level workers produces an output level at least equal to the standard.

I again begin by considering the case in which the managerial wage is the same whether the manager is promoted from within or is hired from the outside. As in section III, in this case the firm will commit to always staff the managerial position with an internal candidate given [[theta].sup.*] - [[theta].sub.L] sufficiently small. That is, the firm will commit to a promotion standard sufficiently low that the best entry-level worker's output is always at least equal to the standard. The logic for this result is the same as the logic for the equivalent result in section III. When [[theta].sup.*] - [[theta].sub.L] is small, the potential lost productivity associated with never hiring an outsider for the managerial position is also small. In turn, because of the savings from the reduced entry-level wage, it is optimal given such a parameterization to always promote an internal candidate.

Despite the similarity discussed, there is an important difference between the analysis of this case in section III and the analysis here. This difference concerns what happens when it is not optimal to always fill the managerial position with an internal candidate. In section III the firm either committed to such a rule or committed to nothing. As a result, if the firm decided not to commit to always promoting an internal candidate because such a commitment is too costly, the firm's promotion decision would simply follow the ex post optimal rule. That is, the firm would only promote an internal candidate when there existed an internal candidate whose expected ability was greater than or equal to [[theta].sup.*].

In contrast, here the firm can commit to a promotion standard. As a result, if the firm decides not to commit to always promote an internal candidate because such a commitment is too costly, the firm's promotion decision would still not follow the ex post optimal rule. Rather, the firm would set a promotion standard such that an outsider would only be hired for the managerial position when the outsider's expected ability is significantly greater than the best internal candidate. The reason is that the reduction in the entry-level wage associated with internal promotion makes it optimal to bias the promotion decision toward insiders even when the promotion of outsiders is allowed.

The other case is that there are two managerial wages-one for managers promoted from within and one for managers hired from the outside. In this case the analysis is quite different from the corresponding analysis in section III. In particular, when the firm can commit to a promotion standard and there are two managerial wages, then the firm offers a contract to young workers that satisfies the following three conditions. First, the managerial wage for a worker promoted from within is high enough to induce high effort among the entry-level workers. Second, the managerial wage for a worker hired from the outside equals U. Third, the promotion standard is set so high that an outsider is hired for the managerial position only when the expected ability of the best entry-level worker is less than [[theta].sup.*].

One interesting aspect of the contract is that as in the previous case the firm uses the promotion standard to reduce the frequency with which an outsider is hired for the managerial position. Given that the managerial wage is higher for workers promoted from within than for outsiders, in the absence of a promotion standard the firm would hire an outsider for the managerial position as long as the outsider's expected ability is not significantly less than the best internal candidate. But this is not efficient from an ex ante standpoint. The reason is that this behavior ignores the fact that there is no ex ante cost to paying the high managerial wage to an insider due to the corresponding reduction in the entry-level wage. Hence, from an ex ante perspective, the firm maximizes its profits by paying a high managerial wage only for workers promoted from within and achieving efficient assignment through the promotion standard.

Methods of Committment

Section III and the previous subsection discussed the idea that a firm may be able to increase its profitability by committing to a promotion rule that limits its own ability to hire from the outside. An interesting issue is how a firm can bind its own future actions. One possibility was suggested by Milgrom and Roberts (1988). They argued that the establishment of a rule-oriented personnel department may be a way that firms bind themselves to what I have called the ex ante optimal promotion rule. In their argument the firm takes the decision of who is to be promoted "out of the hands of decision makers who have an interest in making the best possible assignment and turns it over to someone whose interests are in following the stated policy" (Milgrom and Roberts, 1988, S176).

If the argument captures the role of the personnel department, then we would expect it to behave in a bureaucratic fashion and, in particular, be unresponsive to the views and complaints of line managers. (13) In fact, this seems to be the view within many firms concerning the behavior of the personnel department. Notice that in this interpretation the bureaucratic and unresponsive nature of the personnel department is part of the efficient design of the firm. Without such a department the firm follows the ex post rather than the ex ante optimal promotion rule, with the subsequent result being a decrease in the firm's long-run profitability.

The discussion provides a useful contrast between the explanation put forth here for why firms frequently favor internal candidates and alternate explanations for the phenomenon. As briefly discussed in the introduction, two alternate explanations are that specific human capital is important in performing the tasks associated with promotions and that lack of information concerning outside candidates makes it difficult to hire outsiders for managerial positions. (14) In both explanations the firm promotes the best candidate, but because of lack of skills or lack of information the best candidate is typically an internal one. In other words, one difference between the explanation put forth here and these alternates concerns the efficient design of the firm. The explanation put forth here is consistent with a personnel department that frequently stops the firm from choosing the best candidate, whereas the alternate explanations provide no reason to have such a department.

The other possibility concerning how a firm limits its own ability to hire outsiders is that the firm does not actually bind its future actions but rather achieves the ex ante optimal promotion rule through reputation considerations. It has been assumed that the firm is sufficiently short-lived that it is unable to establish a reputation for following the ex ante optimal rule. Potentially a longer-lived firm, however, could avoid the time-inconsistency problem through the establishment of the appropriate reputation. For example, suppose the firm was in the market for an infinite number of periods rather than only through period T, and that from an ex ante standpoint the firm's optimal behavior was to never hire an outsider for the managerial position. As long as the firm discounted the future sufficiently slowly, standard arguments yield that never hiring an outsider would in fact be an equilibrium even if the firm had no actual ability to commit. (15)

One qualification to the argument is that even if the firm itself is infinitely lived, it is typically the case that many of the decision makers inside the firm have relatively short time horizons. This suggests that due to standard agency problems between the stockholders and the managers making the promotion decisions, even the infinitely lived firm may find it difficult to follow the ex ante optimal promotion rule. (16) In turn, if this is the case, then it is possible that even an infinitely lived firm may find it advantageous to bind its own future actions through something like a rule-oriented personnel department.

One interesting point is that in terms of thinking about these two possible methods of commitment, the importance of the method probably depends on what type of promotion is being considered. At low levels of the job ladder the personnel department probably has significant ability to block promotions, that is, to block the firm from hiring someone from the outside if not hiring an outsider is the optimal ex ante rule. Thus, for promotions at low levels of the job ladder a bureaucratic and unresponsive personnel department is probably a very important commitment device. However, for promotions to high levels of the job ladder the situation is likely quite different. The personnel department at most firms does not have the ability to block the promotion and hiring preferences of high-level senior managers. Thus, for high-level promotions a bureaucratic and unresponsive personnel department is probably not very important as a commitment device. Rather, for such positions, it is the ability for a firm to establi sh a reputation that potentially allows it to achieve the ex ante as opposed to the ex post optimal promotion rule.

V. CONCLUSION

A firm deciding on its promotion practices will be concerned both with the efficient assignment of workers to tasks and with rewarding prior performance. This can lead to a time-inconsistency problem because from an ex ante standpoint firms would want to take into account both aspects of the promotion decision, but from an ex post standpoint it is only assignment that matters. This article explores this time-inconsistency problem and in particular argues that the common practice of favoring internal candidates for promotion can be understood as a response by firms to the problem of time inconsistency.

There are a number of directions in which the analysis in this article could be extended. First, one could investigate the extent to which other promotion practices are the result of time inconsistency. One potential example is the now outlawed practice of mandatory retirement. Lazear (1979) argued that mandatory retirement is a consequence of imperfect monitoring leading to upward-sloping age-earnings profiles. In his argument there is the potential for workers to shirk, and firms employ upward-sloping age-earnings profiles to avoid the shirking problem. Thus mandatory retirement is needed because old workers would not voluntarily retire at the optimal retirement age because at that age the wage exceeds the reservation wage.

In Lazear's argument there is a reason for an age at which the firm has the option of terminating the employment relationship, but there is no reason why the firm would want to constrain its own ability to retain workers beyond a certain age. This is of interest because, as found by Kittner (1977), for many of the workers who were subject to mandatory retirement the practice was binding on both the worker and the firm. That is, there was both an age at which the firm had the option of forcing the worker to retire and an age (sometimes the same age) at which the firm was prohibited from retaining the worker. This suggests that Lazear's argument is not a complete explanation for the use of mandatory retirement, and that at least part of the motivation for its use involved time inconsistency.

Second, in this article I have explored a relatively simple model so that the logic of the argument is clear. It might be worthwhile investigating the basic idea in a more realistic setting to see whether the conclusions of such an analysis match well with what we know concerning real-world promotion practices. There are a number of ways in which the analysis could be made more realistic. For example, I have focused on a single firm where the firm's external candidates for the managerial position are the self-employed from the previous period. A more realistic approach would be to assume competition in the labor market between firms, where a firm's external candidates are the employees of the firm's rivals. Such a model would be more realistic concerning who fills managerial positions when an internal candidate is not chosen and potentially would also make more realistic predictions concerning the relevant pay of those promoted from within and those hired from the outside.

Another possible change concerns why high-level positions are associated with high wages. I have followed most of the tournament literature and assumed that the only reason that higher-level jobs are associated with high wages is that the firm is attempting to induce high effort among its entry-level workers. But from a real-world standpoint there are other reasons why high-level jobs are associated with high wages. For example, the signaling argument originally developed in Waldman (1984b) and Ricart i Costa (1988) makes this prediction. In that argument promoting a worker serves as a positive signal of the worker's ability with the result that promoted workers receive large wage increases to stop them from being bid away by prospective employers. (17) One interesting direction for future research might be to investigate how alternative reasons for why high-level jobs are associated with high wages influence a firm's incentive to favor internal candidates for promotion.

APPENDIX

Proof of Proposition 1. In the proof that follows I assume that [W.sup.S.sub.t+1] is paid both to skilled workers in period t + 1 in who were entry-level workers in period t and to skilled workers in period t + 1 who were self-employed in period t. However, because in equilibrium both types of workers would be paid the same wage even without this assumption, this assumption is not necessary to prove the proposition, but it does significantly simplify the proof. In the proof t also assume that given a contract for entry-level workers in period t such that there are multiple outcomes for the effort choices in period consistent with equilibrium behavior, the outcome that occurs is the one that maximizes the expected number of entry-level workers in period t who choose high effort. This assumption is also not necessary to prove the proposition but significantly simplifies the proof.

Consider first how the firm forms inferences concerning workers' abilities. There are three possibilities. First, if in period the equilibrium is such that each entry-level worker chooses [e.sup.h] with probability one, then [[theta].sup.E.sub.jt] = [y.sub.jt] - [e.sup.h] for all j. Second, if in period t the equilibrium is such that each entry-level worker chooses [e.sup.l] with probability one, then [[theta].sup.E.sub.jt] = [y.sub.jt] - [e.sup.l]. Third, if in period t the equilibrium is such that entry-level worker j chooses [e.sup.h] with probability p and [e.sup.l] with probability (1 - p), then the firm draws its inferences using Bayes's rule.

The first result is that [W.sup.M.sub.t+1] [greater than or equal to] U because the firm will be unable to staff the managerial position in period t+1 if [W.sup.M.sub.t+1] < U. Similarly, we know [W.sup.S.sub.t+1] [greater than or equal to] U. The second result is that because in period t + 1 the wages for the managerial and skilled positions are fixed, in period t + 1 the workers in these positions provide low effort.

Suppose for now that there exists a contract that satisfies (i)-(v). Let X denote the firm's aggregate wage bill for this contract for the young in period t and the old in period t + 1, and let Y denote the firm's expected aggregate output for the young in period t and the old in period t + 1. Given this, we first focus on the family of contracts for which the managerial position in period t + 1 is restricted to period t entry-level workers. We know that in period t + 1 the firm will want to maximize the aggregate output of its managerial and skilled positions. This in combination with the assumption [delta] > [lambda] > 1 yields that the firm will staff the managerial position in period t + 1 with the period t entry-level worker with the highest value for [[theta].sup.E.sub.jt], and the skilled positions will be staffed by a combination of the remaining period t entry level workers with values for [[theta].sup.E.sub.jt] greater than or equal to [[theta].sup.*] (see note 6) and workers who were self-employed i n period t. In turn, given this contract for which all period entry-level workers provide high effort, the condition translates into (iv) and (v).

Continuing to restrict attention to contracts for which the managerial position in period t + 1 is restricted to period t entry-level workers, if the contract is such that all period t entry-level workers provide high effort, it must be the case that either [W.sup.M.sub.t+1] > U or [W.sup.S.sub.t+1] > U for otherwise all entry-level workers in period t would have an incentive to provide low effort. Further, if [W.sup.S.sub.t+1] = U, then the individual rationality constraint for entry-level workers in period t yields (ii). Given this, consider a contract for which [W.sup.S.sub.t+1] > U and each period t entry-level worker chooses [e.sup.h] with probability one. The firm's expected aggregate output for the young in period t and the old in period t + 1 equals Y. Let [DELTA] = [beta]([W.sup.S.sub.t+1] - U) and let c denote the expected number of skilled positions in period t + 1 that will be filled with outsiders. The individual rationality constraint for entry-level workers in period t now yields that the firm' s aggregate wage bill for the young in period t and the old in period t + 1 must be greater than or equal to X + c[DELTA] The reason is that c[DELTA] of the premium associated with being a skilled worker is not going to period t entry-level workers, and thus the aggregate wage bill must rise by at least c[DELTA] to compensate for this. In turn, because productivity is the same and the wage bill is higher, any such contract is dominated by a contract that satisfies (i)-(v).

We now have that among contracts for which the managerial position in period t + 1 is restricted to period t entry-level workers, if we further restrict attention to contracts for which all entry-level workers in period t provide high effort, then the best contract satisfies (i)-(v). Given this, consider a contract in this family in which each period t entry-level worker does not choose [e.sup.h] with probability one. Let c now denote the expected number of entry-level workers in period t who provide low rather than high effort. For this contract the firm's expected aggregate output for the young in period t and the old in period t + 1 is less than or equal to Y - c([e.sup.h] - [e.sup.l]). Furthermore, the individual rationality constraint for entry-level workers in period t now yields that the firm's aggregate wage bill for the young in period t and the old in period t + 1 is greater than or equal to X - cg([e.sup.h]). The reason is that there is a decrease in the expected aggregate disutility of working at the entry-level positions equal to cg([e.sup.h]), and thus the aggregate wage bill potentially decreases by as much as cg([e.sup.h]). In turn, because [e.sup.h] - [e.sup.l] > g([e.sup.h]), any such contract is dominated by a contract that satisfies (i)-(v).

We now have that among contracts for which the managerial position in period t+1 is restricted to period t entry-level workers, the best contract satisfies (i)-(v). Given this, we now consider contracts for which the managerial position in period t + 1 is not restricted to individuals who were entry-level workers in period t. Consider such a contract for which there is a positive probability that at least one entry-level worker in period chooses [e.sup.h]. For a period t entry-level worker to choose [e.sup.h] with positive probability, it must be that either [W.sup.M.sub.t+1] [greater than or equal to] U + (g([e.sup.h])/[beta]) or [W.sup.S.sub.t+1] [greater than or equal to] U + (g([e.sup.h])/[beta]). Also, for such a contract the firm will staff the managerial position with a worker who was self-employed in period t if [[theta].sup.e.sub.jt] < [[theta].sup.*] for all j. Let v be the probability this occurs and let c again be the expected number of entry-level workers in period t who provide low rather than high effort.

Given a contract of the type described, the firm's expected aggregate output for the young in period t and the old in period t + 1 must be strictly less than Y - c([e.sup.h] - [e.sup.l]) + [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]). The reason is that the firm loses in expected productivity c([e.sup.h] - [e.sup.l]) because on average c entry-level workers provide low rather than high effort and gain something less than [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]) because the managerial position can be staffed with a worker who was self-employed in period t. Suppose [W.sup.M.sub.t+1] [greater than or equal to] U + (g([e.sup.h])/[beta]). Given this and the individual rationality constraint for an entry-level worker in period t, the firm's expected aggregate wage bill for the young in period t and the old in period t + 1 must be greater than or equal to X + vg([e.sup.h]) - cg([e.sup.h]). The reason is that on average, at least vg([e.sup.h]) of the premium associated with being the manager in period t + 1 is not going to period t entry-level workers, and thus the aggregate wage bill must rise by vg([e.sup.h]) to compensate for this, whereas there is a decrease in the expected aggregate disutility of working at the entry-level positions equal to cg([e.sup.h]) and the aggregate wage bill decreases by as much as cg([e.sup.h]) to compensate for this. We now have that expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - X - c([e.sup.h] - [e.sup.l]) + [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]) - vg([e.sup.h]) + cg([e.sup.h]). Given this and [e.sup.h] - [e.sup.l] > g([e.sup.h]), expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - X + v([beta][delta]([[theta].sup.*] -[[theta].sub.L]) - g([e.sup.h])). But this means that if [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - X. Suppose [W.sup.S.sub.t+1] [greater than or equal to] U + (g([e.sup.h])/[beta]). The expected number of skilled positions in period t + 1 that are filled by workers who were self-employed in period t is greater than v. Thus we again have that the firm's expected aggregate wage bill for the young in period t and the old in period t + 1 is greater than or equal to X + vg([e.sup.h]) - cg([e.sup.h]). Hence, if [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then a contract that satisfies (i)-(v) dominates the best contract for which the managerial position in period t + 1 is not restricted to individuals who were entry-level workers in period t and there is a positive probability at least one entry-level worker in period t chooses [e.sup.h].

Within the family of contracts for which the managerial position in period t + 1 is not restricted to individuals who were entry-level workers in period t, consider a contract for which there is a probability one that every entry-level worker in period t chooses [e.sup.l]. Let v again be the probability that the firm staffs the managerial position with a worker who was self-employed in period t. The expected aggregate output for the young in period t and the old in period t + 1 must be less than Y - N([e.sup.h] - [e.sup.l]) + [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]). The reason is that the firm loses in expected productivity N([e.sup.h] - [e.sup.l]) because entry-level workers provide low rather than high effort and gain something less than [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]) because the managerial position can be staffed with a worker who was self-employed in period t. Further, from the individual rationality constraint for the entry-level workers in period t we know that the aggregate wage bill for the young in period t and the old in period t + 1 is greater than or equal to X - Ng([e.sup.h]). The reason is that there is a decrease in the expected aggregate disutility of working at the entry-level positions equal to Ng([e.sup.h]), and thus the aggregate wage bill decreases by as much as Ng([e.sup.h]). Given [e.sup.h] - [e.sup.l] > g([e.sup.h]), if [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then -N([e.sup.h] - [e.sup.l]) + [beta]v[delta]([[theta].sup.*] - [[theta].sub.L]) < -Ng([e.sup.h]). Hence, if [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then a contract that satisfies (i)-(v) dominates the best contract for which the managerial position in period t + 1 is not restricted to individuals who were entry-level workers in period t.

Combining the results, we now have that a contract that satisfies (i)-(v) dominates the best contract that does not satisfy (i)-(v). The final step of the proof, therefore, is to demonstrate that there exists a contract that satisfies (i)-(v). Consider a contract that satisfies (i), results in all entry-level workers in period t providing high effort, and contains a commitment by the firm to staff the managerial position in period t+1 with an individual who was an entry-level worker in period t. From above we know that such a contract will also satisfy the remaining parts of (iii), (iv), and (v). Further, given each entry-level worker has a 1/N probability of becoming the manager in the following period and an (N - 1)/N probability of receiving U in the same period, the individual rationality constraint for entry-level workers in period t yields (ii).

The only result left to show is that, given [W.sup.S.sub.t+1] = U and the firm commits to staff the managerial position in period t + 1 with an individual who was an entry-level worker in period t, there exists a value [W.sup.M.sub.t+1] such that it is equilibrium behavior for every entry-level worker in period t to provide high effort. Suppose all entry-level workers in period t provide high effort. We can show that this is consistent with equilibrium if no worker has an incentive to deviate. Consider worker i. If the worker chooses high effort, then the worker's discounted expected utility over the two periods, denoted U([e.sup.h]), is given by (A1).

(A1) U([e.sup.h]) = [W.sup.E.sub.t] - g([e.sup.h])

+[beta][Pr([e.sup.h])[W.sup.M.sub.t+1] + (1 -Pr([e.sup.h]))U],

where Pr([e.sup.h]) is the probability the firm believes that worker i is the highest-ability worker given the worker chooses [e.sup.h]. If the worker chooses low effort, then the worker's discounted expected utility over the two periods, denoted U([e.sup.l]), is given by (A2).

(A2) U([e.sup.l]) = [W.sup.E.sub.t] + [beta][Pr([e.sup.l])[W.sup.M.sub.t+1] + (1 - Pr([e.sup.l]))U],

where Pr([e.sup.l]) is the probability the firm believes that worker i is the highest-ability worker given the worker chooses [e.sup.l]. From earlier we know that given an equilibrium in which each entry-level worker chooses high effort, the firm promotes the worker with the highest output. This yields Pr([e.sup.h]) = 1/N > Pr([e.sup.l]). In turn, (Al) and (A2) now yield U([e.sup.h]) > U([e.sup.l]) if [W.sup.M.sub.t+1] is sufficiently large. (Note: there is no upper bound on [W.sup.M.sub.t+1] because [W.sup.E.sub.t] can always be lowered sufficiently such that (ii) holds.) Hence, there must exist a contract such that (i) through (v) are satisfied.

Proof of Proposition 2. From the proof of Proposition 1, we know that if [[theta].sup.*] - [[theta].sub.L] is sufficiently small and the firm commits to staff the managerial position in period t + 1 with an individual who was an entry-level worker in period t, then (i)-(v) of Proposition 1 must be satisfied. This in turn implies that if an equilibrium contract does not satisfy (i)-(v) of Proposition 1 and [[theta].sup.*] - [[theta].sub.L] is sufficiently small, then the firm must not restrict the managerial position in period t + 1 to an individual who was an entry-level worker in period t.

Given the above, suppose [W.sup.M'.sub.t+1] < U and the firm does not restrict the managerial position in period t + 1 to an individual who was an entry-level worker in period t. Because the managerial wage in period t + 1 for a manager who was self-employed in period t is less than the utility such a worker would receive in self-employment, the firm would not be able to attract into the managerial position in period t + 1 a worker who was self-employed in period t. Thus, it is as if the firm restricted the managerial position in period t + 1 to an individual who was an entry-level worker in period t, and from the proof of Proposition 1 we therefore know that if [[theta].sup.*] - [[theta].sub.L] is sufficiently small then (i)-(v) of Proposition 1 must be satisfied (see note 12).

Suppose the firm does not restrict the managerial position in period t + 1 to an individual who was an entry-level worker in period t but [W.sup.M'.sub.t+1] is set sufficiently high that the firm never hires into the managerial position in period t + 1 a worker who was self-employed in period t. It is again as if the firm restricted the managerial position in period t + 1 to an individual who was an entry-level worker in period t. Thus, the proof of Proposition 1 again implies that if [[theta].sup.*] - [[theta].sub.L] is sufficiently small then (i)-(v) of Proposition 1 must be satisfied (see again note 12).

In the rest of the proof we assume [W.sup.S.sub.t+1] = U. This follows given arguments similar to those presented in the proof of Proposition 1. Suppose [W.sup.M'.sub.t+1] = U and the firm does not restrict the managerial position in period t + 1 to an individual who was an entry-level worker in period t. There are two possibilities. First, the contract could be such that the managerial position in period t + 1 is always staffed with a worker who was self-employed in period t. For such a contract the entry-level workers in period t will provide low rather than high effort, and the expected output of the manager in period t + 1 will be strictly less than if the managerial position in period t + 1 was restricted to an individual who was an entry-level worker in period t. Thus, the expected aggregate output for the young in period t and the old in period t + 1 must be strictly less than Y - N([e.sup.h] - [e.sup.L]). Furthermore, the individual rationality constraint for the entry-level workers in period t yields that the aggregate wage bill for the young in period t and the old in period t + 1 must be greater than or equal to X - Ng([e.sup.h]). Because [e.sup.h] - [e.sup.l] g([e.sup.h]) we have that this contract is dominated by a contract that satisfies (i)-(v) of Proposition 1.

Second, the contract could be such that there is a strictly positive probability that the managerial position in period t + 1 is filled with an individual who was an entry-level worker in period t and a strictly positive probability that the managerial position in period t + 1 is filled with an individual who was self-employed in period t. This means U [less than or equal to] [W.sup.M.sub.t+1] [less than or equal to] U + [delta]([[theta].sub.H] - [[theta].sup.*]). Given (N - 1)g([e.sup.h]) > [beta][delta]([[theta].sub.H] - [[theta].sub.L]), the incentive compatibility constraint for entry-level workers in period t now yields that not all entry-level workers in period t provide high effort. Rather, the expected number of entry-level workers in period t who provide high effort cannot exceed [beta][delta]([[theta].sub.H] - [[theta].sup.*])/g([e.sup.h]) < N - 1. Let c again be the expected number of entry-level workers in period t who provide low rather than high effort. We know c > 1. We now have that the expected aggregate output for the young in period t and the old in period t + 1 must be strictly less than Y - c([e.sup.h] - [e.sup.l]) + [beta]([[theta].sup.*] - [[theta].sup.L]). Further, the individual rationality constraint for the entry-level workers in period t yields that the expected aggregate wage bill for the young in period t and the old in period t + 1 must be greater than or equal to X - cg([e.sup.h]). We now have that expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - c([e.sup.h] - [e.sup.l]) + [beta]([[theta].sup.*] - [[theta].sup.L]) - X + cg([e.sup.h]). Given this, c > 1, and [e.sup.h] - [e.sup.l] > g([e.sup.h]), we have that expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - X - ([e.sup.h] - [e.sup.l]) + g([e.sup.h]) + [beta]([[theta].sup.*] - [[theta].sub.L]). But this means that for [[theta].sup.*] - [[theta].sub.L] sufficiently small this contract is dominated by a contract that satisfies (i) - (v) of Proposition 1.

Suppose [W.sup.M'.sub.t+1] > U and the firm does not restrict the managerial position in period t + 1 to an individual who was an entry-level worker in period t. There are again two possibilities. First, the contract could be such that the managerial position in period t + 1 is always staffed with a worker who was self-employed in period t. For such a contract the entry-level workers in period t will provide low rather than high effort and the expected output of the manager in period t + 1 will be strictly less than if the managerial position in period t + 1 was restricted to an individual who was an entry-level worker in period t. Thus, the expected aggregate output for the young in period t and the old in period t + 1 must be strictly less than Y - N([e.sup.h] - [e.sup.l]). Furthermore, the individual rationality constraint for the entry-level workers in period t combined with [W.sup.M'.sub.t+1] > U yields that the aggregate wage bill for the young in period t and the old in period t + 1 must be greater than X - Ng([e.sup.h]). Because [e.sup.h] - [e.sup.l] > g([e.sup.h]) we have that this contract is dominated by a contract that satisfies (i) - (v) of Proposition 1.

Second, the contract could be such that there is a strictly positive probability that the managerial position in period t + 1 is filled with an individual who was an entry-level worker in period t and a strictly positive probability that the managerial position in period t + 1 is filled with an individual who was self-employed in period t. This means [W.sup.M.sub.t+1] [less than or equal to] [W.sup.M'.sub.t+1] + [delta]([[theta].sub.H] - [[theta].sup.*]). Let c again be the expected number of entry-level workers in period t who provide low rather than high effort, and let v again be the probability that in period t + 1 the managerial position is filled by a worker who was self-employed in period t. Suppose [W.sup.M'.sub.t+1] [greater than or equal to] U + [delta]([[theta].sup.*] - [[theta].sub.L]). We know that the expected aggregate output for the young in period t and the old in period t + 1 must be less than Y - c([e.sup.h] - [e.sup.l]) + v[beta][delta]([[theta].sup.*] - [[theta].sub.L]). Furthermore, the individual rationality constraint for the entry-level workers in period t combined with [W.sup.M'.sub.t+1] [greater than or equal to] U + [delta]([[theta].sup.*] - [[theta].sub.L]) yields that the expected aggregate wage bill for the young in period t and the old in period t + 1 must be greater than or equal to X - cg([e.sup.h]) + v[beta][delta]([[theta].sup.*] - [[theta].sub.L]). Because [e.sup.h] - [e.sup.l] > g([e.sup.h]) we have that this contract is dominated by a contract that satisfies (i) - (v) of Proposition 1. Hence, U < [W.sup.M'.sub.t+1] < U + [delta]([[theta].sup.*] - [[theta].sub.L]).

Suppose U < [W.sup.M'.sub.t+1] < U + [delta]([[theta].sup.*] - [[theta].sub.L]). Given [W.sup.M.sub.t+1] [less than or equal to] [W.sup.M'.sub.t+1] + [delta]([[theta].sub.H] - [[theta].sup.*]) and (N - 1)g([e.sup.h]) > [beta][delta]([[theta].sub.H] - [[theta].sub.L]), the incentive compatibility constraint for entry-level workers in period t now yields c > 1. We know that the expected aggregate output for the young in period t and the old in period t + 1 must be less than Y - c([e.sup.h] - [e.sup.l]) + v[beta][delta]([[theta].sup.*] - [[theta].sub.L]). Furthermore, the individual rationality constraint for the entry-level workers in period t yields that the expected aggregate wage bill for the young in period t and the old in period t + 1 must be greater than X - cg([e.sup.h]). We now have that expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - c([e.sup.h] - [e.sup.l]) + v[beta][delta]([[theta].sup.*] - [[theta].sup.L]) - X + cg([e.sup.h]). Given this, c > 1, and [e.sup.h] - [e.sup.l] > g([e.sup.h]), we have that expected aggregate output minus the expected aggregate wage bill for the young in period t and the old in period t + 1 must be strictly less than Y - X - ([e.sup.h] - [e.sup.l]) + g([e.sup.h]) + v[beta][delta]([[theta].sup.*] - [[theta].sub.L]). But this means that for [[theta].sup.*] - [[theta].sub.L] sufficiently small this contract is dominated by a contract that satisfies (i) - (v) of Proposition 1.

(1.) Papers concerning thc assignment aspects of the promotion decision include Sattinger (1975), Rosen (1978, 1982), Waldman (1984a), and Gibbons and Waldman (1999a). See Sattinger (1993) for a survey. Papers that investigate promotions as a reward for prior performance include Lazear and Rosen (1981), Malcomson (1984), Rosen (1986), Lazear (1989), and Prendergast (1993). Gibbons and Waldman (1999b) survey both topics.

(2.) One piece of evidence that firms typically favor internal candidates for promotion is that between 1984 and 1994 over 85% of the individuals appointed to the CEO position at a Fortune 500 firm were promoted from within. See Chan (1996) for a discussion.

(3.) In a recent paper Tsoulouhas et al. (2001) extend Chan's analysis to try to explain a number of facts concerning CEO succession. Their model is closer than Chan (1996) to the one considered here in that they do allow for a production stage after the promotion decision takes place.

(4.) Another related analysis is Kahn and Huberman (1988). Kahn and Huberman consider a time-inconsistency problem that arises in environments characterized by a single task. In their analysis, the possibility of future wage increases provides an incentive for workers to invest in specific human capital, but a time-inconsistency problem concerning the probability of receiving a future wage increase causes workers to underinvest. The conclusion is that the employment of up-or-out contracts is one way that the firm can avoid this time-inconsistency problem. See Prendergast (1993) and Zabojnik (1998) for alternative approaches to the problem of workers underinvesting in specific human capital.

(5.) See Novos (1992, 1995) for related analyses.

(6.) To simplify the exposition, it is also assumed that if a worker is indifferent between working at the firm and self-employment, the worker chooses the firm, and that if the firm is indifferent between internal and external candidates in staffing either the managerial position or a skilled position, the firm chooses the internal candidate.

(7.) It can be shown that, as long as [delta] - [lambda] is not too large, the firm never has an incentive to commit to fill the skilled positions in period t + 1 only with workers who were in entry-level positions in period t.

(8.) One drawback of the model described concerns the implicit assumption that promotions are used to reward prior performance, rather than bonuses. Although this is consistent with many if not most employment situations, as argued by Baker et al. (1988), there are theoretical reasons why bonuses should be used to reward prior performance and promotions should be used to assign workers to tasks rather than having promotions used for both purposes. Their reasoning is that using promotions to achieve both goals leads the firm to trade off incentives and assignment, and this trade-off can be avoided by using bonuses rather than promotions to reward prior performance. Fairburn and Malcomson (2001) address the Baker et al. (1988) argument. Fairburn and Malcomson allow for the possibility of bribery between a worker and the manager who evaluates the worker's performance. They show that bribery eliminates the ability of the firm to offer incentives through bonus payments, with the result that the firm uses the possi bility of promotion to a higher-paying job as an incentive for effort. Although I do not formally model the bribery process as in Fairburn and Malcomson, the analysis is consistent with their conclusion that incentives are provided through the possibility of future promotions. I could demonstrate the main results of the paper in a Fairburn and Malcomson--type setting, but such an analysis would be quite involved, and the intuition underlying the time-inconsistency problem (which is the focus here) would be more difficult to follow.

(9.) It is not necessary to specify whether in any period t + 1 the firm is restricted to pay the same wage to a skilled worker who was an entry-level worker in period and to a skilled worker who was self-employed in period t. The reason is that in equilibrium the two types of workers are paid the same wage whether or not there exists such a restriction.

(10.) Notice that even in the ex post case the firm will staff the managerial position with an outsider relatively rarely (only when all the entry-level workers in the previous period have a value for [[theta].sup.E.sub.jt] below [[theta].sup.*]). This is consistent with the discussion in the introduction that one reason a firm may staff higher levels of a job ladder entirely or almost entirely with insiders is that the firm has better information about the abilities of its own workers.

(11.) Note that [W.sup.E.sub.t] and [W.sup.M.sub.t+1] are not uniquely defined.

(12.) As was true for Proposition 1, [W.sup.E.sub.t] and [W.sup.M.sub.t+1] are not uniquely defined. Also, rather than directly specifying that the managerial position in period t + 1 is restricted to individuals who were entry-level workers in period t, the contract could simply specify a value for [W.sup.M'.sub.t+1] sufficiently high or low that the firm never fills the managerial position in period t + 1 with an outsider.

(13.) For this practice to solve the firm's time-inconsistency problem it must be that at the date of the actual promotion decision the firm cannot reverse its earlier action and take authority over the decision away from the personnel department. One reason the firm might be unable to reverse itself and take authority away from the personnel department is if the personnel department has specialized knowledge required for the evaluation of candidates. For example, it might be that only the personnel department has the experience necessary to properly interpret a worker's performance evaluations and references.

(14.) See Becker (1962) for an early discussion of the importance of specific human capital, and Novos (1992, 1995) for analyses that show that lack of information concerning outside candidates reduces the number of outsiders that firms hire. The models considered by Novos build on the earlier work of Greenwald (1986).

(15.) As discussed in the previous subsection, the ex ante optimal promotion rule can be that the firm sometimes hires an outsider, but that it does this less often than would maximize current profits. Such a rule would be difficult to achieve through reputation formation because it would be unclear to a future potential employee whether the hiring of an outsider today was consistent or inconsistent with the firm's ex ante optimal rule. My conjecture is that given an infinitely lived firm facing this situation, there exists an equilibrium in which the firm does better than the ex post optimal promotion rule, but there is no equilibrium in which the firm achieves the best ex ante rule. Formal analysis of this case, however, is beyond the scope of this article.

(16.) One might argue that as long as the managers making the promotion decisions own some stock in the firm, then there will be no agency problems between the stockholders and the managers. Stein (1989) shows that this argument is incorrect in a world in which outsiders are imperfectly informed about the actions of the managers and the current stock price serves as a signal of long-run profitability. He finds that in such a world managers behave myopically, that is, they stress current profits more than is optimal from the standpoint of the long-run profitability of the firm.

(17.) Other articles that consider this signaling argument include Waldman (1990), Bernhardt (1995), and Zabojnik and Bernhardt (2001). Zabojnik and Bernhardt's analysis is particularly relevant in that they incorporate the job assignment signaling story into a tournament-style model. However, their analysis does not consider a firm's incentive to favor internal candidates for promotion over external ones due to time inconsistency.

REFERENCES

Agrawal, A., C. Knoeber, and T. Tsoulouhas. "CEO Succession: Handicapping and the Choice Between Insiders and Outsiders." Mimeo, University of Alabama, 2001.

Baker, G., M. Jensen, and K. Murphy. "Compensation and Incentives: Practice vs. Theory." Journal of Finance, 43(3), 1988, 593-616.

Becker, G. S. "Investment in Human Capital: A Theoretical Analysis." Journal of Political Economy, 70(1), 1962, 9-49.

Bernhardt, D. "Strategic Promotion and Compensation." Review of Economic Studies, 62(2), 1995, 315-39.

Chan, W. "External Recruitment versus Internal Promotion." Journal of Labor Economics, 14(4), 1996, 555-70.

Fairburn, J., and J. Malcomson. "Performance, Promotion, and the Peter Principle." Review of Economic Studies, 68(1), 2001, 45-66.

Gibbons, R., and M. Waldman. "A Theory of Wage and Promotion Dynamics Inside Firms." Quarterly Journal of Economics, 114(4), 1999a, 1321-58.

-----. "Careers in Organizations: Theory and Evidence," in Handbook of Labor Economics, vol. 3, edited by O. Ashenfelter and D. Card. Amsterdam: North-Holland, 1999b, 2373-437.

Greenwald, B. "Adverse Selection in the Labour Market." Review of Economic Studies, 53(3), 1986, 325-47.

Kahn, C., and G. Huberman. "Two-Sided Uncertainty and 'Up-or-Out' Contracts." Journal of Labor Economies, 6(4), 1988, 423-44.

Kittner, D. "Forced Retirement: How Common Is It?" Monthly Labor Review, 10(12), 1977, 60-61.

Lazear, E. "Why Is There Mandatory Retirement?" Journal of Political Economy, 87(6), 1979, 1261-84.

-----. "Pay Equality and Industrial Politics." Journal of Political Economy, 97(3), 1989, 561-80.

Lazear, E., and S. Rosen. "Rank-Order Tournaments as Optimum Labor Contracts." Journal of Political Economy, 89(5), 1981, 841-64.

Malcomson, J. "Work Incentives, Hierarchy, and Internal Labor Markets." Journal of Political Economy, 92(3), 1984, 486-507.

Milgrom, P., and J. Roberts. "An Economic Approach to Influence Activities in Organizations." American Journal of Sociology, 94(Supplement), 1988, S154-S179.

Novos, I. E. "Learning by Doing, Adverse Selection and Firm Structure." Journal of Economic Behavior and Organization, 19(1), 1992, 17-39.

-----. "Imperfections in Labor Markets and the Scope of the Firm." International Journal of Industrial Organization, 13(4), 1995, 387-410.

Parrino, R. "CEO Turnover and Outside Succession: A Cross-Sectional Analysis." Journal of Financial Economics, 46(1), 1997, 165-97.

Prendergast, C. "A Theory of 'Yes Men.'" American Economic Review, 83(4), 1993, 757-70.

Ricart i Costa, J. "Managerial Task Assignments and Promotions." Econometrica, 56(2), 1988, 449-66.

Rosen, S. "Substitution and the Division of Labour." Economica, 45(1), 1978, 235-50.

-----. "Authority, Control, and the Distribution of Earnings." Bell Journal of Economics, 13(2), 1982, 311-23.

-----. "Prizes and Incentives in Elimination Tournaments." American Economic Review, 76(4), 1986, 701-15.

Sattinger, M. "Comparative Advantage and the Distribution of Earnings and Abilities." Econometrica, 43(3), 1975, 455-68.

-----. "Assignment Models and the Distribution of Earnings." Journal of Economic Literature, 31(2), 1993, 831-80.

Stein, J. "Efficient Capital Markets, Inefficient Firms: A Model of Myopic Corporate Behavior." Quarterly Journal of Economics, 104(4), 1989, 655-69.

Tsoulouhas, T., C. Knoeber, and A. Agrawal. "Contests to Become CEO." Mimeo, University of Alabama, 2001.

Waldman, M. "Worker Allocation, Hierarchies, and the Wage Distribution." Review of Economic Studies, 51(1), 1984a, 95-109.

-----. "Job Assignments, Signalling, and Efficiency." Rand Journal of Economics, 15(2), 1984b, 255-67.

-----. "Up-or-Out Contracts: A Signaling Perspective." Journal of Labor Economics, 8(2), 1990, 230-50.

Zabojnik, J. "Sales Maximization and Specific Human Capital." Rand Journal of Economics, 29(4), 1998, 790-802.

Zabojnik, J., and D. Bernhardt. "Corporate Tournaments, Human Capital Acquisition, and the Firm Size-Wage Relation." Review of Economic Studies, 68(3), 2001, 693-716.

MICHAEL WALDMAN *

* I would like to thank Dave Butz, Harold Demsetz, Bruce Fallick, Robert Gibbons, Bengt Holmstrom, Robert Hutchens, Paul Milgrom, Olivia Mitchell, Ian Novos, Mark Rebiek, Jean Rosenthal, Ken Sokoloff, two anonymous referees, the editor William S. Neilson, and participants at workshops at Cornell University, UCLA, and the Stanford Institute for Theoretical Economics for helpful comments.

Waldman: Charles H. Dyson Professor in Management and Professor of Economics, Johnson Graduate School of Management, Cornell University, Ithaca, NY 14853. Phone 1-607-255-8631, Fax 1-607-254-4590, E-mail mw46@cornell.edu