首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:The political economy of intergenerational income mobility.
  • 作者:Ichino, Andrea ; Karabarbounis, Loukas ; Moretti, Enrico
  • 期刊名称:Economic Inquiry
  • 印刷版ISSN:0095-2583
  • 出版年度:2011
  • 期号:January
  • 语种:English
  • 出版社:Western Economic Association International
  • 摘要:The intergenerational elasticity of income is generally considered one of the best summary measures of the degree to which a society gives equal opportunities of success to all its members, irrespective of their family background. Starting with pioneering work by Solon (1992) and Zimmerman (1992), the economic literature has made important advances on the question of how to measure this parameter using the GaltonBecker-Solon (GBS) regression:
  • 关键词:Endowments;Parent and child;Parent-child relations

The political economy of intergenerational income mobility.


Ichino, Andrea ; Karabarbounis, Loukas ; Moretti, Enrico 等


I. INTRODUCTION

The intergenerational elasticity of income is generally considered one of the best summary measures of the degree to which a society gives equal opportunities of success to all its members, irrespective of their family background. Starting with pioneering work by Solon (1992) and Zimmerman (1992), the economic literature has made important advances on the question of how to measure this parameter using the GaltonBecker-Solon (GBS) regression:

(1) [y.sub.s] = a + [beta][y.sub.f] + [u.sub.s]

where [y.sub.s] is son's log income and [y.sub.f] is father's log income. A lower [beta] denotes a smaller association between father's and son's income and therefore a higher degree of social mobility. As such, a lower [beta] is often interpreted as being a desirable feature of a society.

While we have learned a lot about how to estimate this reduced-form parameter, less progress has been made on understanding its underlying structural determinants. What does [beta] actually measure? Is a lower [beta] necessarily more desirable? Important progress on these questions has been made by Becker and Tomes (1979), who have shown how the intergenerational persistence of income reflects both "nature and nurture." In their model individuals are assigned talent by nature, and parents can add to that talent by privately investing in their children. The intergenerational transmission of income is therefore a combination of exogenous biological factors and endogenous optimizing behavior of parents. However, the Becker and Tomes model generally ignores the role of redistributive policies and their deeper determinants. Redistributive policies have the potential to play an important role in determining how income is transmitted from one generation to the next. For example, public education can significantly affect economic opportunities of individuals who come from disadvantaged socioeconomic backgrounds. At the same time, it can also affect parents' incentives to privately invest in their children human capital, both directly and through the disincentive effect of taxes. More in general, most redistributive policies--including taxation, affirmative action, welfare programs, subsidies that target poor individuals--potentially affect the intergenerational elasticity of income. While some studies have highlighted the role of public policies as a determinant of social mobility, most existing studies take these policies as exogenous.

In this paper we use a model with exogenous talent endowments, endogenous parental investment in children and endogenous redistributive institutions, to identify the structural parameters that govern the intergenerational mobility. Our framework extends the Becker and Tomes framework and clarifies how the interaction between private and collective decisions determines the equilibrium level of social mobility. The model allows for a structural interpretation of the widely studied parameter [beta]. This is important because it allows a better understanding of the deeper politico-economic determinants of intergenerational mobility and the role of public policy. The model also shows how we should interpret and rank differences over time and across countries in [beta]. Since redistributive policies generate a trade-off between insurance and incentives, the optimal [beta] is not necessarily zero for all societies. In addition, international comparisons of intergenerational elasticity of income are shown to be not particularly informative about fairness without taking into account differences in politico-economic institutions. The predictions of the model seem generally consistent with the empirical evidence.

Our framework focuses on how parents transfer economic endowments to their children through private and collective investment in their human capital. Before having children, parents know their own genetic ability but are uncertain about their children's genetic ability. Consistent with Becker and Tomes (1979) and Loury (1981), parents can decide to invest privately in the human capital of their children, given an exogenous degree of transmission of genetic ability. This private investment offsets some of the risk of having low genetic ability, thus reducing the probability that an individual might turn out to have low productivity and therefore low income. Since private investment can offset some but not all of the genetic risk, parents "under the veil of ignorance" have an incentive to collectively create public institutions that provide further insurance against the risk of low genetic ability. A natural example of this type of policy is public education.

We model public education as an insurance system that increases the income of the low talented children, at the expense of lowering the income of the more talented children. We show how and why a more progressive educational policy increases social mobility in equilibrium. The equilibrium level of social mobility depends on the costs and benefits of public education. This trade-off is resolved by two forces: (a) the balance between costly insurance and incentives to privately invest in children's human capital and (b) the political process that aggregates conflicting interests regarding the desired degree of social mobility.

A novel insight of our analysis is to show how political economy forces shape the equilibrium level of social mobility. Even if public education is relatively costless to provide for the average family, it may hurt the interests of the rich dynasties who, in a world of increased social mobility, are more likely to move down the income ladder. As a result, the maximum amount of mobility ([beta] = 0) is not necessarily the equilibrium one, even when public insurance is relatively cheap to provide.

More generally, the model shows that existing differences in [beta] across countries are (at least in part) governed by all those political institutions that affect public education. Therefore, two societies with similar fundamentals (such as the degree of parental altruism, variability in market earnings, degree of biological and cultural transmission of family characteristics, labor market discrimination, asset market incompleteness etc.) may display very different degrees of intergenerational mobility depending on the identity of the politically decisive family.

In the last part of the paper, we use data on a cross section of countries for which reliable estimates of [beta] are available to test the predictions of the model. In general, we find that they are consistent with the empirical evidence. For example, our model predicts that in countries where rich dynasties are more politically active than poor dynasties, social spending for public education should be lower and therefore income mobility should be lower. We find that this appears to be the case in our sample. The difference in the probability of party affiliation between rich and poor appears to be strongly correlated with [beta]. Such difference has five times larger predictive power than the rate of return to education, which is often considered as one of the most prominent determinants of mobility (Solon 1999, 2004; Corak 2006). While causality is obviously unclear, these empirical correlations are at least consistent with our model.

The rest of the paper is organized as follows. Section II discusses the related literature. In Section III we describe the model and examine its positive properties. In Section IV we derive the politico-economic determinants of social mobility and show their relation to the GBS regression. In Section V we present our empirical evidence. Section VI concludes. All omitted derivations are in Appendix A. Appendix B describes the data.

II. RELATED LITERATURE

The objective of our model is to derive the structural politico-economic parameters underlying the intergenerational elasticity of income. This coefficient--[beta] in Equation (1)--has been the main focus of the existing empirical literature: see among others Solon (1992); Zimmerman (1992); Bjorklund and Jantti (1997); Mulligan (1997) and Solon (1999). Our model is also related to a more recent empirical strand of research that examines within-country trends in mobility and compares [beta] over time; see for instance Mazumder (2005, 2007); Lee and Solon (2006); and Aaronson and Mazumder (2008).

Most theoretical work in this area has focused on the role of the genetic transmission of ability, the incentives for parental investment, and the role of the asset market in explaining the intergenerational transmission of income. Our framework builds on the theoretical work of Becker and Tomes (1979), and on extensions of this work by Goldberger (1989); Mulligan (1997); and Solon (2004).

While some studies have highlighted the role of public policies as a determinant of social mobility, most existing studies take these policies as exogenous. Examples of papers that have argued that institutions may be important determinants of mobility, but take these institutions as exogenous include, among others, the original contribution of Becker and Tomes (1979); Glomm and Ravikumar (1992); Checchi, Ichino, and Rustichini (1999); Solon (1999, 2004); Davies, Zhang and Zeng (2005); Mayer and Lopoo (2005); and Hassler, Rodriguez Mora, and Zeira (2007).

In our setting, social mobility depends on public redistributive policies that we model as the outcome of a politico-economic equilibrium. In this sense, our model relates to the equilibrium models of Saint-Paul and Verdier (1993); Alesina and Rodrik (1994); and Persson and Tabellini (1994). These papers show how cross-sectional inequality causes growth, through endogenous public policies. Benabou (1996) further develops this strand of literature and endogenizes the relationship between inequality, social mobility, redistribution and growth as a function of the incompleteness of the financial market. While our model abstracts from (physical) capital accumulation, it emphasizes the endogenous production of human capital (talent) as an intermediate input for the production of final income. Fernandez and Rogerson (1998) analyze a reform from a locally financed to a centralized educational system in a multicommunity model with endogenous choice of location. Relative to their paper, we instead focus on explaining cross country outcomes. In this case, migration becomes a less important determinant of social mobility and differences in political institutions become stronger determinants of social mobility. As in our paper, Bernasconi and Profeta (2007) endogenize institutions in a model with mobility and argue that the politically determined level of public education may reveal the true talent of the children and relax the mismatch of talents to occupations. Relative to this paper, our model includes both economic and political choices.

In a seminal paper, Piketty (1995) explains the emergence of permanent differences in attitudes toward redistribution. Benabou and Ok (2001) show how rational beliefs about one's relative position in the income ladder affect the equilibrium level of redistribution. These papers derive the implications of social mobility for redistributive policies, while we focus on the reverse channel. Specifically, we analyze how endogenously chosen public policies affect the intergenerational mobility.

It is important to note that because the direction of causation in our model differs from the one emphasized in the study of Benabou and Ok (2001), we obtain a different prediction for the relationship between mobility and redistribution in the United States and Europe. In their paper, more mobility is associated with less redistribution because voters who are below the mean oppose redistribution in the rational expectation of income gains in the future. This explanation is intuitive, but cross Atlantic evidence suggests that the United States is less mobile and less redistributive than continental Europe. (1) In our paper, political economy forces that constrain the development of public education also lead to a lower degree of social mobility. Thus, our model predicts a positive correlation between social mobility and redistribution of income across countries.

III. A SIMPLE MODEL OF THE INTERGENERATIONAL TRANSMISSION OF INCOME

We first set up the model and derive the intergenerational transmission equation for income and talent. Then, we derive the first and second moments of income and talent distributions and discuss how these moments evolve in response to more progressive public policies.

A. Setup of the Model

We consider an infinite horizon overlapping generations economy populated by a measure one of dynasties, i [member of] [0, 1]. In each period t = 0, 1, 2, ... two generations are alive, fathers and sons. In each generation, earnings (which we also call "output" or "income" interchangeably) are produced according to the production function [Y.sub.i,t] = f([[micro].sub.t], [[THETA].sub.i.t], [U.sub.i.t). The parameter [[mu].sub.t] represents the public policy; [[THETA].sub.i,t] is father's human capital or basic skill (e.g., IQ) which we call "talent"; and [U.sub.i,t] denotes a random and inelastic factor of production which represents "market luck." Specifically, we assume that the production function is given by:

(2) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [[micro].sub.t] [member of] (0, 1] and [alpha] [greater than or equal to] 0.

Figure 1 shows the production function graphically. Public policy and its effects are characterized by two parameters, [[mu].sub.t] and [alpha]. The parameter [[mu].sub.t] characterizes the amount of redistribution in the economy. A lower [[mu].sub.t] implies a more progressive public policy, but also more distortions. This is shown visually in the left panel of Figure 1, where for a given amount of talent and market luck, a lower [[mu].sub.t] is associated with less output for the talented or lucky families, but with more output for the less talented or unlucky families. The most natural example of the public policy represented by [[mu].sub.t] is public education. In Section V we offer evidence in line with this interpretation of [[mu].sub.t]. (2) Henceforth, a lower [[mu].sub.t] is called a more progressive public policy or a more progressive educational system.

The parameter [alpha] characterizes the efficiency of public education. For a given [[mu].sub.t], a higher [alpha] implies that a smaller fraction of talents [[THETA].sub.i,t] gains from progressivity because the system creates disincentives for high talented agents. In the fight panel of Figure 1, the area to the left of the intersection of the production function with the 45[degrees] line measures the gains from progressivity. As [alpha] increases, this area becomes smaller relative to the area to the fight of the intersection of the production function with the diagonal, which measures the efficiency costs of progressivity. (3) Henceforth, a higher [alpha] denotes more distortions.

In each period t the following events take place:

1. Fathers produce output [Y.sub.i,t] according to Equation (2), given the predetermined talent [[THETA].sub.i,t], market luck [U.sub.i,t] and public policy [[mu].sub.t].

2. Fathers choose the policy for their sons, [[mu].sub.t+1], according to the institution or political process P.

3. Sons are born with a random family endowment [V.sub.i, t+1]. The random factor of production [U.sub.i,t+1] is realized.

4. Fathers observe [V.sub.i,t+1] and [U.sub.i,t+1] and choose investment [I.sub.i,t] to maximize the dynastic utility, given resources [Y.sub.i,t]. Investment produces son's talent according to the production function [[THETA].sub.i,t+1] = g([I.sub.t], [h.sub.i] [V.sub.i,t+1]).

5. Fathers die, sons become fathers and the process repeats ad infinitum.

For this section we treat Ix as an exogenous parameter. In Section IV we endogenize it. Son i is born with random family endowment [V.sub.i,t+1], which, following Becker and Tomes (1979), is assumed to follow a "Galtonian" AR(1) process:

(3) [v.sub.i,t+1] = (1 - [[rho].sub.1])[[rho].sub.0] + [[rho].sub.1] + [v.sub.i,t] + [[epsilon].sub.i,t+1]

where v = In V (small caps denote logs of corresponding variables throughout the paper). For every dynasty i, [[epsilon].sub.i,t+1] is a white noise process with expected value E([[epsilon].sub.i,t]) = 0, variance Var([[epsilon].sub.i,t]) = [[sigma].sup.2.sub.v] and zero autocorrelations. We have 0 [less than or equal to] [[rho].sub.1] < 1 and therefore the logarithm of family endowment regresses toward the mean, has stationary expectation E([v.sub.i,t]) = [[rho].sub.0], and has stationary variance Var([v.sub.i,t]) = [[sigma].sup.2.sub.v]/(1 - [[rho].sup.2.sub.1]). The parameter [[rho].sub.1] characterizes the cultural or genetic inheritance of traits related to talent and income, and is assumed identical across families i.

[FIGURE 1 OMITTED]

A second random component is represented by market luck, [U.sub.i,t+1], whose logarithm is a white noise process, has variance [[sigma].sup.2.sub.u], and is independent to [[epsilon].sub.i,t]. The difference between [U.sub.i,t] and [[THETA].sub.i,t] is that the latter is an elastic factor of production. As a result, talent is affected by the inefficiencies associated with the policy [mu].

Fathers care about the quality of their children. They observe [V.sub.i,t+1] and [U.sub.i,t+1] and decide how to allocate their predetermined income [Y.sub.i,t] into consumption [C.sub.i,t] and investment [I.sub.i,t] in order to maximize the dynastic utility:

(4) In [C.sub.i,t] + 1/[gamma] ln [Y.sub.i,t+1]

subject to the budget constraint:

(5) [C.sub.i,t] + [I.sub.i,t] = [Y.sub.i,t]

where [Y.sub.i,t+1] is children's income. (4) The parameter [gamma] > 0 captures the degree of parental altruism, with higher values denoting smaller altruism. Parental investment [I.sub.i,t] can be thought as an private educational input (e.g., tuition fees) that increases a child's talent.

Sons' talent is produced with the following production function:

(6) [[THETA].sub.i,t+1] = ([h.sub.i][V.sub.i,t+1])[I.sub.i,t]

where [h.sub.i] is a family-specific time-invariant ability effect which allows dynasties to be ex ante heterogeneous. This heterogeneity captures long-run differences in market incomes, for instance due to labor market discrimination against certain racial, ethnic, or religious groups. We assume that [h.sub.i] is distributed according to the density function [[PHI].sub.h] with bounded support H [subset] [R.sub.++], and is orthogonal to the disturbances [[epsilon].sub.i,t+1 ]and [u.sub.i,t+1].

B. The Transmission of Income Across Generations

In this Section we restrict attention to steady-state public policies, that is we set [[mu].sub.t+1] = [[mu].sub.t] = [mu] for all t. Under this assumption, income and talent are stochastic processes with well defined and easy to analyze unconditional stationary moments. We generalize our analysis in Section IV, where we endogenize the choice of [mu]. Solving the problem in Equations (4)-(5), using the production functions (2) and (6), and taking logs, we obtain the equation that describes the intergenerational transmission of income in family i:

(7) [y.sub.i,t+1] = [[delta].sub.0,i] + [delta].sub.1][y.sub.i,t] + [[delta].sub.2][v.sub.i,t] + [[delta].sub.3] [u.sub.i,t+1]

where:

(8) [[delta].sub.0,i] = [[delta].sub.0] + [[delta].sub.i]

(9) [[delta].sub.0] = [mu] ln ([mu]/[mu] + y) + [alpha] ln[mu]

(10) [[delta].sub.i] = [mu] ln [h.sub.i]

(11) [[delta].sub.1] = [mu]

(12) [[delta].sub.2] = [mu]

(13) [[delta].sub.3] = [mu]

The intercept [[delta].sub.0,i] can be decomposed into two parts. [[delta].sub.0] is a common effect across all dynasties i, and [[delta].sub.i] is the dynasty-specific time-invariant effect due to [h.sub.i]. Our autoregressive coefficient, [[delta].sub.i], is different from the one described by Becker and Tomes (1979) because we assume multiplicative (in levels) production functions for output and talent. (5) While the previous literature has focused on the role of private incentives for the intergenerational mechanism, our [[delta].sub.1] coefficient emphasizes instead the role of public policies. Specifically, the novel element of our model is that the slope [[delta].sub.i] is collectively decided by the fathers of each dynasty. Therefore, our mechanism maps collective action outcomes to equilibrium levels of intergenerational transmission of income. In the Appendix we present the intergenerational transmission of talent.

C. The Trade-Off Between Equity and Efficiency

Expectations. From Equation (7) we take the unconditional, stationary expectation of income ("long-run income") for family i:

(14) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

for all t. In Equation (14), the expectation is conditioned only on [h.sub.i] to denote the dependency of long-run income on long-run family ability [h.sub.i]. There are four ways through which the public policy [mu] affects long-run income.

1. Distortions in Private Investment: This is captured by the ln ([mu]/[mu]+[gamma]) term. When public policy becomes more progressive (lower [mu]), the marginal propensity to invest in human capital, [mu]/([mu] + [gamma]), is lower and as a result the long-run level of income tends to decline. This effect is identical for every dynasty i.

2. Direct Distortions in Output: This effect is shown in the [alpha]ln [mu] term, and is associated with the shifter [[mu].sup.[alpha]] in the production function for income in Equation (2). The effect of [mu] on output is more adverse when the parameter [alpha] increases.

3. Social Insurance or Benefits of Public Education: The [mu] term that multiplies the bracket in the numerator of Equation (14) captures the exponent of the term [[THETA].sup.u], in Equation (2). For low ability dynasties (low [h.sub.i]), a more progressive public educational system increases long-run income. The opposite happens for sufficiently high ability families. The intuition is shown in Figure 1.

4. Intertemporal Insurance or Social Mobility: This effect is given by the denominator 1 - [mu] and is associated with the slope [[delta].sub.1] of the intergenerational transmission of income in Equation (7). For sufficiently low ability dynasties (low [h.sub.i]), the numerator is negative and the prospect of upward mobility (lower [mu]) increases long-run income. For high ability dynasties, the numerator is positive and increased mobility decreases their long-run income.

We can write father i's conditional (on the state of the system) expectation for son's income as the sum of the long-run level of income in Equation (14) and the transitory deviation of current income and current family endowment from their long-run levels:

(15) [E.sub.t]([y.sub.i,t+1] |[h.sub.i] ) = E([y.sub.i,t+1] | [h.sub.i]) + [mu] ([y.sub.i,t] - E([y.sub.i,t+1] | [h.sub.i])) + [mu][[rho].sub.1] ([v.sub.i,t] - [[rho].sub.0] )

where the time subscript in the left hand side denotes conditioning on the information set as of period t (which is summarized by father's income, [y.sub.i,t], and family endowment, [v.sub.i,t]). As we show in Section IV.A, fathers take into account how progressivity affects this conditional expectation when voting for [mu].

This analysis highlights two important points. First, there is a trade-off between equity and efficiency. Second, there is political conflict over the equilibrium level of social mobility. In particular, as we discuss more formally in Section IV.B, fathers with higher ability [h.sub.i] or with favorable shocks in their market activity, [u.sub.i,t], or in their family endowment, [v.sub.i,t], prefer less progressive policies. It is this heterogeneous effect of [mu] on dynastic welfare that makes the political economy aspect of the model interesting and supports our argument that politico-economic determinants may be significantly associated with mobility outcomes.

Variances. To understand the implication of our model for inequality, we first consider the stationary, unconditional variability that a given dynasty [h.sub.i] faces in its income process. From Equation (7) this is:

(16) Var([y.sub.i,t+1] | [h.sub.i]) = [[mu].sup.2]/1 - [[mu].sup.2] 1 + [[rho].sub.1][mu]/1 - [[rho].sub.1][mu] [[sigma].sup.2.sub.v] /1 - [[rho].sup.2.sub.1]

+ [[mu].sup.2]/1 - [[mu].sup.2] [[sigma].sup.2.sub.u]

Inequality across generations occurs because the disturbances [[epsilon].sub.i,t+1] and [u.sub.i,t+1] have different realizations across time for a given family i. From inspection of Equation (16), we see that a more progressive system (lower [mu]) reduces the variability of income. In addition, it lowers the fraction of variability attributed to family luck [v.sub.i,t+1]. Intuitively, market luck [u.sub.i,t+1] matters only for the final production of income, while family luck [v.sub.i,t+1] affects both the production of talent directly, and the production of final output indirectly (through talent). As a result, more progressive public policies reduce the relative importance of the latter in the intergenerational variance of income.

If all families were identical, then the variance that families face across generations in Equation (16) coincides with the stationary inequality in the cross section of families. More in general, with heterogeneous families, the ex post or cross-sectional variance of income can be decomposed in two parts: (6)

(17) Var([y.sub.i,t+1]) = Var([y.sub.i,t+1] | [h.sub.i]) + Var(E([y.sub.i,t+1] | [h.sub.i]))

The second term in Equation (17) represents the variance "under the veil of ignorance," which from Equation (14) equals:

(18) Var(E([y.sub.i,t+1] |[h.sub.i])) = [[mu].sup.2]Var(ln [h.sub.i])/[(1 - [mu]).sup.2]

To summarize, in Equation (17) the stationary total inequality in the cross section of families is decomposed into the dynastic variability in the process for income--common to all families i--and the inequality that arises because heterogeneous families have different levels of long-run income. It is immediate to see that a more progressive educational system reduces all inequalities. In the Appendix we also discuss the variance of talent.

Covariances. Consider now the intergenerational correlation of income. This summary statistic is what the literature calls social mobility, inequality across generations or "equality of opportunity." Conditioning on [h.sub.i], we distinguish the intergenerational correlation of income within family, Corr([y.sub.i,t+1], [y.sub.i,t] |[h.sub.i]), from the correlation we may observe in the data when families are heterogeneous, Corr([y.sub.i,t+1], [y.sub.i,t]). The latter is discussed in Section IV.C in relation to the GBS regression. Consider the time series of output and talent for some family i with time-invariant ability level [h.sub.i]. Given that we are in a stationary state with Var([y.sub.i,t+1] |[h.sub.i] ) = Var([y.sub.i,t] | [h.sub.i]), we can derive the dynastic intergenerational correlation of income:

(19) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

For talent, the correlation Corr([[theta].sub.i,t+1], [[theta].sub.i,t] |[h.sub.i]) is given in Appendix A.

D. Summary

In Proposition 1 we summarize how a more progressive public policy (lower [mu]) affects the moments of income and talent.

PROPOSITION 1. Effects of Progressivity on Income and Talent: In any stationary state, with a time-invariant public policy 0 < [[mu].sub.t+1] = [[mu].sub.t] = [mu] [less than or equal to]< 1 we have:

1. A more progressive system (lower [mu]) decreases/increases long-run income and talent for sufficiently high/low [h.sub.i] families. A more progressive system favors families with temporarily, low output, [y.sub.i,t] < E([y.sub.i,t+1] |[h.sub.i]), and it favors families with temporarily low family endowment, [v.sub.i,t] < [[rho].sub.0].

2. The dynastic variance of income, Var ([y.sub.i,t+1] |[h.sub.i]), and the dynastic variance of talent, Var([[theta].sub.i,t+1] |[h.sub.i]), are increasing in [mu]. Var( [y.sub.i,t +1] |[h.sub.i]) / Var([[theta].sub.i,t+1] |[h.sub.i]), that is, the intra-family ratio of intergenerational inequalities, is bounded above by 1, and is increasing in [mu].

3. The cross-sectional inequality of income Var([y.sub.i,t+1]) and the cross-sectional inequality of talent Var([[theta].sub.i,t+1]) increase in [mu]. Their ratio is bounded above by 1 and also increases in [mu].

4. The dynastic intergenerational correlation of income Corr([y.sub.i,t+1], [y.sub.i,t] |[h.sub.i]) is increasing in [mu]. The ratio Corr([y.sub.i,t+1], [y.sub.i,t] |[h.sub.i])/Corr([[theta].sub.i,t+1], [[theta].sub.i,t] |[h.sub.i]) is smaller than 1, and increases in [mu].

Proposition 1 shows how a more progressive public policy decreases the dynastic and cross-sectional inequalities of income and talent, and also decreases the within-dynasty intergenerational correlation of income. These two predictions are consistent with the general equilibrium effects of educational subsidies as derived recently by Hassler, Rodriguez Mora, and Zeira (2007). They also tend to imply a positive comovement of the cross-sectional and the intergenerational inequality, as discussed by Solon (2004). Finally, our model predicts that in a society with no public policy ([mu] = 1), the ratio of variances and intergenerational correlations of income over talent take their maximum value (unity). As public policy becomes more progressive these ratios decrease. Intuitively, when the progressivity of public education increases, a given amount of variation in the production of talent across time or across families matters less for final earnings in the market. (7) In Section V we offer some evidence in line with this prediction.

IV. THE POLITICAL ECONOMY OF SOCIAL MOBILITY

First, we define the politico-economic equilibrium. Then, we derive the equilibrium choice of the public policy [mu] in terms of deeper political, economic, cultural, and genetic parameters. Finally, we show the relationship between the equilibrium level of [mu] and the slope of the GBS regression, [beta].

A. Politico-Economic Equilibrium

In period t, father i observes and takes as given the realization of last period's output, [y.sub.i,t], and endowment, [v.sub.i,t]. However, fathers do not know the realization of children's endowment [v.sub.i,t+1] and market luck [u.sub.i,t+1] before they vote for [[mu].sub.t+1] and they need to form rational expectations. Father i's preferences over public policies [[mu].sub.t+1] are ordered according to the conditional expectation of Equation (4):

(20) W([[mu].sub.t+1]; [h.sub.i], [y.sub.i,t], [v.sub.i,t], s) = ln [C.sub.i,t] + 1/[gamma] [E.sub.t]([y.sub.i,t+1] |[h.sub.i])

where s is the vector of structural parameters, and the conditional expectation, [E.sub.t] ([y.sub.i,t+1] |[h.sub.i]), is given by Equation (15). [C.sub.i,t] is the optimal level of consumption:

(21) [C.sub.i,t] = [gamma]/[[mu].sub.t+1] + [gamma] [Y.sub.i,t]

which is a function of the public policy. Note that we reinstate the time subscript in [mu].

An important simplification for deriving the equilibrium in our model is that sons are born after fathers have chosen the public policy [[mu].sub.t+1]. As a result, sons do not affect the choice of [mu]. Under this assumption, preferences of fathers over current policies are independent of future policies, and there is no need to explicitly consider dynasties' expectations about future policy outcomes. (8) This assumption is intuitive in the context of intergenerational mobility. As we discuss in Section V in a cross section of OECD countries, it is public spending on education--rather than other forms of government activity--that strongly correlates with social mobility. Since public education is regarded as highly redistributive at the primary level, that is, before sons' political rights are extended, our assumption captures this realistic feature of the intergenerational transmission.

The policy that maximizes Equation (20) is called the "most preferred policy for dynasty i":

(22) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

The most preferred policy for every father reflects various trade-offs. First, it reflects the four channels that affect the long-run value of income in Section III.C. In addition, transitory deviations from long-run income and transitory deviations from long-run family endowment also affect the most preferred policy, as shown in Equation (15). Finally, public policy allocates resources intertemporally and creates a trade-off across generations. The consumption-investment ratio for every father is [gamma]/[[mu].sub.t+1]. A less progressive system (higher [[mu].sub.t+1]) distorts less the incentive of parents to privately invest in their children talent and therefore when [[mu].sub.t+1] decreases parents transfer more resources to the next generation.

To solve the model we define a relevant family-specific summary of the system which we call "income potential," [Q.sub.i,t]. Income potential therefore summarizes the history of all relevant market and family shocks. Our functional form assumptions--log preferences and multiplicative production functions--imply that income potential for family i at time t is the log-sum of three terms: life-long ability level In [h.sub.i], current log income, [y.sub.i,t], plus a term proportional to log family endowment, [v.sub.i,t].

(23) [Q.sub.i,t] = ln [h.sub.i] + [y.sub.i,t] + [[rho].sub.1] [v.sub.i,t]

PROPOSITION 2. Preferences over Public Policy:

1. Induced preferences over [[mu].sub.i,t+1] as described by W(*) in Equation (20) are single-peaked if (but not only if) [alpha] > 1 for any [Q.sub.i,t].

2. The most preferred policy [[mu].sub.i,t+1] is strictly increasing in [Q.sub.i,t].

The first part of the Proposition establishes a sufficient condition for the indirect utility W to be single-peaked. The second part shows that families with higher income potential prefer less progressive public policies. Families with high income potential may be families from advantaged groups (high [h.sub.i]) or families that face favorable economic ([y.sub.i,t] > E([y.sub.i,t+1] | [h.sub.i])) or cultural ([v.sub.i,t] > [[rho].sub.0]) shocks. Therefore, in our model families from disadvantaged social groups (low [h.sub.i]) may still prefer less progressive public policies, if their last generations experienced good luck in the market or in the production of talent.

Because transitory shocks affect preferences for public policies, in general the equilibrium policy will not be time invariant, as assumed for simplicity in Section III. The easiest but most restrictive way to proceed is to assume a precommitment institution in which the initial generation of fathers observe {[y.sub.i,0], [v.sub.i,0], [h.sub.i]} and choose once and for all a time-invariant system [mu], which by assumption remains active in all future periods. A second possibility is to consider the stochastic steady state of the model, in which the distribution of income potentials in the population is stationary. In this case, the optimal [mu] remains constant in time, but the identity of the decisive family is allowed to vary, since in the steady-state families are hit by different market and family shocks. Under both these cases, the analysis for the long-run moments in Section III applies, and the time-invariant coefficients for the stochastic processes are given by the optimal stationary [mu]. Finally, we can apply our comparative statics to the most general case, when the dynastic variance and the public policy depend on calendar time along the transitional dynamics in a period-by-period decision making process. Under this setting, the equilibrium public policy (yet to be defined) will in general depend on the current state [Q.sub.i,t] of the decisive father. (9)

Let the distribution of income potential in the cross section of dynasties at time t be [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. We define the political institution in terms of the equilibrium outcome that it implies.

DEFINITION 1. Institution P: An institution P results in the public policy [[mu].sup.e.sub.t+1] mostly preferred by the dynasty in the 100pth percentile of the income potential distribution [[PHI].sub.t], that is, the family with an income potential such that P = [[PHI].sub.t]([Q.sub.i,t]). We denote the decisive dynasty as [Q.sub.p,t].

Our definition encompasses some commonly used institutions, both in the optimal policy and in the political economy literature. Let the average income potential be [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. Then if p = [[PHI].sub.t]([[bar.Q].sub.t]), one obtains the utilitarian social rule that maximizes the welfare of the average father or the welfare "behind the veil of ignorance" for [Q.sub.i]:

(24) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

In reality, however, public policies are determined by the aggregation of known, conflicting political interests. The leading choice in the political economy literature is the one person, one vote democratic institution. If [alpha] > 1, then by Proposition 2 induced preferences over policies are single-peaked. As a result, the father with the median most preferred policy is the decisive voter. By the second part of the same Proposition, this is the father with the median income potential, [Q.sub.50,t]. Note that this formulation allows both the identity and the income or the family endowment of the decisive father to vary over time. Since the median father's vote is decisive, it follows that p = 1/2 is the unique equilibrium outcome of the pure majority rule game (i.e., the Condorcet winner).

More in general, we can allow for p > 1/2, capturing campaign contributions or more active political participation of the rich fathers. Alternatively, a higher p may parameterize the ideologically diverse preferences for parties of the poor fathers, as in the probabilistic voting model. If p < 1/2, then social preferences are averse to inequality and can be thought to internalize the ex ante variance given in Equation (18). From a political economy point of view, a lower p may capture the bargaining power of socialist parties or labor organizations in unionized economies. In the limit, p = 0 leads to the "Rawlsian institution" that maximizes the welfare of the least well-off dynasty. Henceforth, we parameterize political preferences with p. In Section V we show how to measure this key parameter in the data.

B. Politico-Economic Determinants of Social Mobility

Given this definition, the properties of the equilibrium level of the public policy, [[mu].sup.e.sub.t+1], are given in the following Proposition.

PROPOSITION 3. Equilibrium Public Policy: The equilibrium policy [[mu].sup.e.sub.t+1] is increasing both in [alpha] and in p. It increases in [h.sub.p], [y.sub.p,t], [v.sub.p,t] and in [[rho].sub.0], it decreases in [gamma], and it does not depend on [[sigma].sup.2.sub.v] and [[sigma].sup.2.sub.u]. It increases in [[rho].sub.1] if and only if [v.sub.i,p] - [[rho].sub.0] > 0.

This Proposition shows how public education becomes less progressive (higher [[mu].sub.t+1]) when output costs [alpha] increase, but more progressive as the position of the decisive dynasty in the income potential distribution p decreases. Our result shows that, as long as optimally chosen public policies have the potential to affect intergenerational mobility (which in our model is shown in Section IV.C), there is no reason to expect that a collective action of fathers transmits a perfectly mobile society to their sons ([[mu].sup.e] = [beta] = 0). It is important to note that for the refusal of this proposition, one would need to show both that the costs of progressive public policies are negligible and that institutions favor the low ability families. This is an interesting point, because empirically it may be difficult to find evidence for the magnitude of [alpha] or in reality some public reforms may entail small efficiency costs (Lindert 2004). On the other hand, a recent strand of research in political economy points out that various politico-economic outcomes can be simply explained by the fact that rich families have a larger "say" in the political equilibrium, that is, that the political system is wealth-biased (Benabou 1996; Campante 2007; Alesina and Giuliano 2009; Karabarbounis 2010).

What is the novelty of our results? Most of the existing literature following the initial Becket and Tomes (1979) contribution has attributed to the reduced-form coefficient in Equation (1) a specific meaning for social mobility, namely that equality of opportunity is desirable. (10) If equality of opportunity is however costly for private incentives, more of it is not necessarily desirable. (11) Relative to these views, our model emphasizes--in addition to standard incentive costs--political economy constraints that may further limit or enhance the extent of social mobility. For instance, in our model perfect social mobility may be optimal under a utilitarian institution (if [alpha] is very small), but not politically sustainable if rich families and business interests restrict the development of the welfare state and the provision of public education (i.e., if p is sufficiently high). To put it differently, two societies with similar dynastic fundamentals may display very different degrees of intergenerational mobility depending on which is the decisive dynasty selected by the existing political institutions.

The politico-economic trade-off behind our model can be conceptualized by a decline in the position of the decisive voter p. Societies in which families with lower income potential have a larger "say" for the equilibrium outcome, choose more progressive systems, expect higher mobility and lower inequality. However, progressivity results in a lower long-run level of income for sufficiently high ability families, and may even lower average income. (12) In our model, if the distribution of income potential [[PHI].sub.t] is fight skewed ([Q.sub.50] < [bar.Q])--perhaps because the ability distribution [[PHI].sub.h] is skewed--then a majority voting of fathers chooses a more progressive public policy relative to the utilitarian optimum. Holding average income potential [bar.Q] constant, an increase in the (right) skewness of the distribution of income potentials, leads the majority of fathers to demand more progressive policies and higher social mobility.

Interestingly, the effects of a higher ex ante inequality in abilities, Var(ln h), due for instance to market discrimination against ethnically or racially diverse groups, depend on the political process p. If p is low, then higher Var(ln h) could be associated with more skewness and hence a poorer decisive voter which results in more progressive policies. On the other hand, if de facto political power is ultimately related to income potential and hence p is relatively high, a higher ex ante variability could be associated with more powerful elites, less progressivity and lower social mobility. In Section V we offer some suggestive evidence in favor of the second effect. In the Appendix we discuss in more detail the intuition behind the other comparative statics of our model.

C. Structural Politico-Economic Interpretation of the Galton-Becker-Solon (GBS) Regression

Our theoretical framework offers a structural interpretation for the log-linear intergenerational earnings model which is estimated in the empirical literature cited in Section II. The literature typically focuses on the GBS regression:

(25) [y.sub.i,t+1] = a + [beta][y.sub.i,t] + [[epsilon].sub.i]

where [y.sub.t+1] and [y.sub.t] denote son's and father's lifelong log earnings in the population. Previous models have recognized that [beta] is a function of genetic and cultural inheritance, altruism, technological parameters and the structure of the asset market. However, we show that this coefficient also depends on political economy variables which determine the institutions that a generation puts in place to insure its offspring from adverse shocks.

PROPOSITION 4. Population Slope of the GBS Regression: The slope in the population regression of son's on father's income, [beta], also known as the intergenerational elasticity of income is given as follows. (13)

1. If the economy is in a stationary state with [[mu].sup.e.sub.t+1] = [[mu].sup.e.sub.t] = [[mu].sup.e], then the intergenerational elasticity equals the intergenerational correlation of incomes and is given by:

(26) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where the variance in the denominator refers to the cross sectional variance in Equation (17) and [[mu].sup.e] is the equilibrium public policy defined in Proposition 3. The intergenerational elasticity [beta] increases in [[mu].sup.e] and in p.

2. If the economy is for a long time in the steady-state [[mu].sup.e.sub.t] = [[mu].sup.e.sub.t-1] = ..., but in t + 1 an unexpected structural break in the political institution p happens, then the intergenerational elasticity is given by:

(27) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

In this case [[beta].sub.t+1]/[[beta].sub.t] = [[mu].sup.3.sub.t+1]/[[mu].sup.e.sub.t] and the ratio is increasing in [p.sub.t+1]/[p.sub.t].

The first part of the proposition refers to the special case in which the economy is in a steady state with constant intergenerational mobility (the coefficient [beta]) and cross-sectional variances. The second part considers instead the case in which a political shock at time t determines a change of the decisive dynasty such that intergenerational mobility changes with respect to previous periods ([[beta].sub.t+1][not equal to][[beta].sub.t-1] for i [greater than or equal to] 0) and cross-sectional income variances may differ across generations. In principle, analogous formulas can be obtained for other shocks affecting the intergenerational elasticity of incomes and cross-sectional variances, but given the focus of this paper here we study the case of a political shock.

Under the assumption that the advanced economies for which an estimate of [beta] is available are essentially characterized by a fairly similar set of economic and biological fundamentals, differences in the estimated [beta] for these countries should correlate with differences in the dynasty that has decisive power in the political process. To put it differently, if economic and biological fundamentals are more similar than political equilibria across these advanced economies, we should observe more mobility in countries in which the position of the decisive dynasty is lower in the hierarchy of dynastic income potentials. The empirical exercise in the next section should be interpreted as a suggestive assessment of the extent to which political economy variables that proxy for the decisive dynasty capture the cross-country variation in [beta].

However, Equations (26) and (27) emphasize also other more traditional determinants that might explain the cross-country variability in [beta]. For example, in steady state and for given decisive dynasty, social mobility increases ([beta] decreases) with market luck variability (higher [[sigma].sup.2.sub.u]), and decreases with ex ante heterogeneity (higher Var(ln [h.sub.i])). It decreases with output costs (higher [alpha]), with the ability of the decisive family (higher [Q.sub.p]), with the long-run family endowment (higher [[rho].sub.0]) and with the degree of altruism (higher [gamma]). Greater market variability increases cross-sectional inequality and makes the position of children highly uncertain, thereby increasing social mobility. For the same reason, the comparative static with respect to [[sigma].sup.2.sub.v] and [[rho].sub.1] is theoretically ambiguous, [alpha], p, [h.sub.p], [[rho].sub.0], and [gamma] affect social mobility indirectly, through the equilibrium level of [mu] (see Appendix for these comparative statics). Finally, note that our model predicts that ex ante heterogeneity Var(ln[h.sub.i]) affects positively [beta] only conditional on Ix. Higher ex ante heterogeneity may operate also indirectly through public policy, and it may increase (if it is associated with smaller p) or decrease (under higher p) social mobility.

V. EMPIRICAL EVIDENCE ON THE POLITICO-ECONOMIC DETERMINANTS OF MOBILITY

In this section we turn to the empirical evidence on the predictions of the model. Specifically, we present evidence on the relationship between political variables that our model indicates as important determinants of social mobility and observed measures of mobility across countries or within a country over time. We stress that this evidence needs to be interpreted only as suggestive and descriptive. The number of countries for which we have data is limited, and the available data are not sufficiently informative to identify causal relationships. Nevertheless, the evidence is generally consistent with the predictions of our model and in particular, it supports a positive cross-country and within-country correlation between proxies for p and estimates of [beta].

We consider first an interesting case study which represents a salient example of a political shock as described in Equation (27). Over the past few decades, the United Kingdom has experienced a tremendous decline in social mobility. In particular, Blanden, Gregg and Macmillan (2007) document a 50% decline in social mobility between the 1958 and the 1970 cohorts. Such decline has generated widespread concern among the public and prompted the government in 2009 to issue a White Book that addresses the causes and implications of the decline in mobility. Blanden, Gregg and, Macmillan (2007) argue that the main cause of the decline is represented by changes in educational attainment of different income groups.

Their evidence is consistent with our model. However, our model goes further and indicates that educational policies are likely to be an endogenous outcome. According to our framework, the ultimate determinant of the decline in social mobility should be a change in societal preferences for redistribution. Indeed, this prediction is consistent with the sharp change in political preferences which led Margaret Thatcher to become Prime Minister in 1979. The Thatcher revolution was caused by a clear move toward the fight by the UK electorate, as indicated by the fact that public expenditure for education fell, the power of the unions declined, regressive value added taxes (VAT) increased, and more progressive corporate and income taxes declined. (14)

Turning to cross-country evidence, credible estimates of [beta] are available only for a limited number of countries. We use estimates from Corak's (2006) meta-analysis conducted for nine OECD countries and complement these with three more observations. In the Appendix we discuss more in detail the construction of our dataset and the sources. (15)

In Figure 2, the vertical axis shows estimates of [beta] for a cross section of advanced democratic OECD countries. Consistent with what has long been documented in the existing literature on mobility, the United Kingdom, the United States, and France are the least mobile, while Northern European countries appear the most mobile. Canada is the most mobile Anglo-Saxon country, and Sweden is the least mobile among the Nordic countries. The existing literature has mostly focused on the left panel of the Figure (e.g., Corak 2006), which shows a positive bivariate association between [beta] and the private return to schooling. The right panel, which is more novel, depicts a negative association between [beta] and public expenditure on education. The Figure shows that the correlation between social mobility and public expenditures for education is at least as strong as the correlation between the private internal return to education and mobility. (16) When we divide public expenditure in education per student as a percentage of per capita GDP at the primary, secondary and tertiary levels, we find that all are negatively correlated with [beta]. Notably and consistent with our model, the correlation is stronger at the primary level, where public expenditures are arguably more redistributive. (17)

[FIGURE 2 OMITTED]

To obtain a direct measure of political preferences (the parameter p in the model), we use data from the World Value Surveys (WVS). (18) We focus on the differential in political participation between low income voters and middle and upper income voters. The income classification follows the WVS and is standardized by country. As Table 1 shows, on average, around 33% of the population is classified as "poor" (low income). Variation across countries is not large. (19) Political participation can be measured with a variety of variables. In Figure 3 we measure political participation with membership in political parties. The vertical axis in the figure measures inequality in party affiliation, defined as the fraction of middle and upper income voters who are members of political parties divided by the fraction of low income voters who are members of political parties. A lower value for this index denotes a relatively more politically active class of low income families and hence a lower p. Note that we are not interested in the political participation of the poor per se, but in their participation relative to that one of other income groups in the same country. Our measure of relative participation therefore holds constant other country-specific factors that may affect political participation.

[FIGURE 3 OMITTED]

The correlations in Figure 3 are consistent with the model. The bivariate correlations of the political inequality index with public spending and intergenerational elasticity are, respectively, -0.49 and 0.79. (20) When we use an alternative measure of the gap in political participation that compares participation by high income voters to participation by low income voters (thus excluding middle income voters), the correlation is even stronger.

The existing literature has argued that one of the most important empirical determinants of social mobility is the rate of return to human capital (see e.g., Solon 1999 and 2004; Corak 2006). When we regress [beta] on estimates of the return to schooling, we find that the return to schooling explains only 8% of the cross-country variation. Notably, and consistent with our model, our measure of inequality between rich and poor families in political affiliation explains 42% of the variation in social mobility. (21) We have repeated this exercise with four other measures of political participation: participation in labor unions, interest in politics, signing petitions, and participating in lawful demonstrations. We find that the patterns are similar to those presented, with the bivariate correlations ranging from 0.43 to 0.63. (Results available upon request.)

In Figure 4, we investigate the relationship between the degree of heterogeneity in a society, public education, and social mobility. Our model predicts that higher ex ante heterogeneity (higher Var(ln[h.sub.i])) should be associated with more public spending and therefore higher social mobility if p is low. If p is high, more heterogeneity should be associated with higher talent ([h.sub.p]) for the decisive family, and less progressivity. Our empirical proxy for heterogeneity is an index of ethnolinguistic fragmentation measured in 1961. (22) The upper left panel shows that more diverse countries are associated with less public spending on education. Our model explains this positive correlation only if p is relatively high, which as discussed above is consistent with recent theoretical and empirical literature. The bottom panel shows that the predicted link between heterogeneity and mobility is also supported by the data. The bivariate correlation is 0.26. Excluding the very heterogeneous and mobile Canada, the correlation increases to 0.67.

Another prediction of the model has to do with the strength of cultural transmission [[rho].sub.1]. As a proxy, we use an index of weak family ties. (23) Weaker family ties proxy for a lower [[rho].sub.1] in our model. In Figure 4, weaker family ties are associated with more public provision of education and more mobility. This lends support to the view that strong family ties and strong social policies are substitutes.

[FIGURE 4 OMITTED]

We conclude with a final piece of evidence. Becker and Tomes' (1979) original contribution aimed at explaining within a unified economic model the degree of cross-sectional inequality, and its relation with intergenerational inequality. We proxy for cross-sectional inequality in earnings, Var([y.sub.i,t]), with the Gini coefficient for gross earnings. The variance in talent or skills, Var([[theta].sub.i,t]), is proxied by the Gini coefficient for factor income. (24) In our sample the bivariate association between cross-sectional gross earnings inequality and intergenerational inequality is around 0.72. Within the context of our model, market variability, [[sigma].sup.2.sub.u], explains the lack of perfect correlation. Higher variability increases cross-sectional inequality to a degree that ultimately raises social mobility. (25)

Proposition 1 implies that the ratio of gross earnings over factor inequality should decline when the progressivity of the educational system increases ([mu] decreases). Figure 5 shows a strong association between the ratio of the Gini coefficients and public expenditure in education. It also shows the direct relationship between the deeper determinant p and the ratio of inequalities Var ([y.sub.i.t])/Var([[theta].sub.i,t]) that can rationalize this association. In particular, our model predicts that in societies where the poor participate more in political parties, redistributive public education takes place and therefore the ratio of income over talent inequality decreases. The right panel of the figure is consistent with this prediction.

[FIGURE 5 OMITTED]

VI. CONCLUSION

Intergenerational mobility emerges from "nature," "nurture," and endogenous public policies. While the previous literature has derived social mobility as a function of the optimizing behavior of utility-maximizing families, in this paper we generalize the structural log-linear social mobility model and endogenize the political process that aggregates conflicting preferences for intergenerational mobility.

Our model provides a structural interpretation of the widely studied GBS reduced-form coefficient [beta]. This is important because it allows a better understanding of the deeper economic determinants of intergenerational mobility and the role of public policy. We show that public policies generate a trade-off between insurance and incentives. Our model adds to this knowledge by pointing out that even if insurance is relatively costless to provide, a less than perfectly mobile society is possible because of political economy constraints in a world of heterogeneous interests. In other words, two societies may have the same set of dynastic fundamentals such as parental altruism, level of GDP, asset markets, ethnic fragmentation, and cultural traits, but different political institutions, in which case social mobility outcomes will differ.

We conclude with some empirical evidence that lends support to our claim that politico-economic variables are likely to be important determinants of cross-country differences in social mobility.

ABBREVIATIONS

ELF: Ethnolinguistic Fractionalization

GBS: Galton-Becker-Solon

GDP: Gross Domestic Product

VAT: Value Added Tax

WVS: World Value Surveys

doi: 10.1111/j.1465-7295.2010.00320.x

APPENDIX A: DERIVATIONS AND PROOFS

Derivation of Income and Talent Transmission Equations

First, forward the production function for output, Equation (2), one period and solve for [[THETA].sub.i,t+1]:

(A1) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Substitute Equation (A1) into the production function for talent, Equation (6), and solve for investment:

(A2) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

If we insert this equation into the budget constraint, [C.sub.i,t] = [Y.sub.i,t] - [I.sub.i,t], we see that the budget is concave for [[mu].sub.t+1][less than or equal to] 1, strictly when [[mu].sub.t+1] < 1. Since the utility function (4) is strictly concave, the solution to the problem is unique and interior and is characterized by the first-order condition:

(A3) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Substituting [C.sub.i,t] back in the budget constraint, we take the solution for children's income:

(A4) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Taking logs and letting [[mu].sub.t+1] = [mu] in Equation (A4) yields the income transition Equation (7) in the text, for the coefficients defined in Equations (8)-(I3). From Equation (A2) and the budget constraint we can also take the solution for investment and consumption:

(A5) [I.sub.i,t] = ([[mu].sub.t+1]/[[mu].sub.t+1] + [gamma])[Y.sub.i,t]

(A6) [C.sub.i,t] = ([gamma]/[[mu].sub.t+1] + [gamma]) [Y.sub.i,t]

To derive the intergenerational transmission equation for talent, we first substitute the production function (2) into the solution (A4). This yields a relationship between sons' income and fathers' talent:

(A7) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Next, substitute Equation (A7) into Equation (Al) to obtain the solution for talent:

(A8) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Taking logs and setting [[mu].sub.t+1] = [[mu].sub.t] = [mu] gives the transmission equation for talent:

(A9) [[theta].sub.i,t+1] = [[lambda].sub.0,i] + [[lambda].sub.1][[theta].sub.i,t] + [[lambda].sub.2][v.sub.i,t+1] + [[lambda].sub.3][u.sub.i,t]

where:

(A10) [[lambda].sub.0,i] = [[lambda].sub.0] + [[lambda].sub.i]

(A11) [[lambda].sub.0] = ln ([mu]/[mu] + [gamma]) + [alpha] ln [mu]

(A12) [[lambda].sub.i] = ln[h.sub.i]

(A13) [[lambda].sub.1] = [mu]

(A14) [[lambda].sub.2] = 1

(Al5) [[lambda].sub.3] = [mu]

The talent transmission equation differs from the income transmission equation due to the coefficients [[lambda].sub.2] and [[lambda].sub.i] (as opposed to the coefficients [[delta].sub.2] and [[delta].sub.i] in the text). These coefficients measure the effects of cultural and genetic endowment on talent and output, respectively. For the case of talent, these effects do not depend on [mu], since public policies are imposed on final output.

Expected Income and Talent

First we show that given a stationary [mu], income and talent are stationary processes. Subtracting [[rho].sub.1][y.sub.i,t] from both sides of the income transmission equation (7), using the definition for [v.sub.i,t+1] in (3), and substituting in the resulting expression the fact that [[rho].sub.1]([[delta].sub.2][v.sub.i,t] - [y.sub.i,t]) = -[[rho].sub.1] ([[delta].sub.0,i] + [[delta].sub.1] [y.sub.i,t-1] + [[delta].sub.3][u.sub.i,t]), we can express the income process in Equation (7) as the sum of an ARMA(2,1 ) process plus an independent white noise:

(Al6) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

The process is stationary if the roots of the characteristic equation, 1 - ([[delta].sub.1] + [[rho].sub.1])x - (-[[delta].sub.1][[rho].sub.1])[x.sup.2] = 0, lie outside the unit circle. The two roots are given by [[phi].sub.1] = -1/[[rho].sub.1] and [[phi].sub.2] = -1/[[delta].sub.1] = 1/[mu]. Therefore, the log income process is stationary for every family i, if [rho] < 1 and [mu] < 1. A similar reasoning applies for the talent process.

The unconditional expectation of log income for family i in Equation (14) in the text is easy to compute by setting E([y.sub.i,t+1] ) = E([y.sub.i,t]) = E([y.sub.i,t-1]) in Equation (A16) or (7). All comparative statics for this expectation follow from inspection. From the talent transmission equation (A9), we take the unconditional expectation of log talent for family i:

(Al7) E([[theta].sub.i,t+1]|[h.sub.1]) = [[rho].sub.0] + ln([h.sub.i][mu]/[mu]+[gamma])+[alpha] ln [mu]/1 - [mu]

From the income transmission equation (7) we can compute the conditional expectation of income:

(Al8) [E.sub.t]([y.sub.i,t+1]|[h.sub.i]) = E([y.sub.i,t+1]|[h.sub.i]) + [mu]([y.sub.i,t] - E([y.sub.i,t+1]|[h.sub.i])) +[mu][[rho].sub.1]([v.sub.i,t] - [[rho].sub.0])

where the state of the system includes {[y.sub.i,t], [[theta].sub.i,t], [v.sub.i,t], [u.sub.i,t]}, and E([y.sub.i,t+1][h.sub.i]) is the unconditional expectation given in Equation (14). Similarly for talent we have:

(A19) [E.sub.t]([[theta].sub.i,t+1]|[h.sub.i]) = E([[theta].sub.i,t+1]|[h.sub.i]) + [mu]([[theta].sub.i,t] - E([[theta].sub.i,t+1]|[h.sub.i])) + [[rho].sub.1]([v.sub.i,t] - [[rho].sub.0])

Variance of Income and Talent

To derive the unconditional, stationary variance Var ([y.sub.i,t+1]|[h.sub.i]) for dynasty i, we impose stationarity in Equation (7) and recall that [u.sub.i,t+1] is independent from [v.sub.i,t+1] and [y.sub.i,t]:

(A20) (1 - [[mu].sup.2])Var([y.sub.i,t+1]|[h.sub.i]) = [[mu].sup.2]Var([v.sub.i,t+1])

+2[[mu].sup.2]Cov([y.sub.i,t], [v.sub.i,t+1]|[h.sub.i])

+[[mu].sup.2]Var([u.sub.i,t+1])

To compute the covariance term, we use the stationarity of the process, the properties of [[epsilon].sub.i,t+1] and the properties of the covariance to take:

(A21) Cov([y.sub.i,t], [v.sub.i,t+1]|[h.sub.i]) = [[rho].sub.1][mu][[sigma].sup.2.sub.v]/(1 - [[rho].sub.1][mu])(1 - [[rho].sup.2.sub.1])

Substituting Equation (A21) into Equation (A20), using the definitions of the variances for [v.sub.i,t+1] and [u.sub.i,t+1] and rearranging we obtain the expression given in the text, Equation (16). The same reasoning yields the variance of talent for family i:

(A22) Var([[theta].sub.i,t+1]|[h.sub.i]) = 1/1 - [[mu].sup.2] 1 + [[rho].sub.1][mu]/1 - [[rho].sub.1][mu][[sigma].sup.2.sub.v]/1 - [[rho].sup.2.sub.1] + [[mu].sup.2]/1 - [[mu].sup.2] [[sigma]/sup.2.sub.u]

which is also increasing in [mu]. Taking the ratio of income's over talent's variance we obtain:

(A23) Var([y.sub.i,t]|[h.sub.i])/Var([[theta].sub.i,t]/[h.sub.i] = [kappa] + [[sigma].sup.2.sub.u]/[kappa]/[[mu].sup.2] + [[sigma].sup.2.sub.u]

where we define:

[kappa]([mu], [[rho].sub.1]) = 1 + [[rho].sub.1][mu]/1 - [[rho].sub.1][mu] [[sigma].sup.2.sub.v]/1 - [[rho].sup.2.sub.1]

Because [mu] < 1, the denominator exceeds the numerator in Equation (A23), and the ratio is smaller than unity as claimed in Proposition 1. Next we prove the claim in Proposition 1 that this ratio is increasing in [mu]. The derivative of the ratio with respect to [mu] is proportional to:

(A24) [[sigma].sup.2.sub.u] [[[kappa].sub.1](1 - 1/[[mu].sup.2]) + 2 [kappa]/[[mu].sup.3]] + 2 [[kappa].sup.2]/[[mu].sup.3]

where [[kappa].sub.1] is the derivative of K with respect to It. If the first term in Equation (A24) is positive, then our claim is proven. After some algebra, the sufficient condition reads as:

(A25) g([mu], [[rho].sub.1]) = [mu]([[mu].sup.2] - 1 - [mu][[rho].sup.2.sub.1]) > - 1

Because the function g has minimum at -1, ([[rho].sub.1] = 1 and [mu] = 1), the sufficient condition holds and the claim is proven.

Finally, we consider the inequality in the cross section of families. From Equation (16) it is obvious that Var([y.sub.i,t+1]) increases in [mu]. For talent we have:

(A26) Var([[theta].sub.i,t+1]) = Var([[theta].sub.i,t+1]|[h.sub.i]) + 1/[(1 - [mu]).sup.2]Var(ln[h.sub.i])

where the first term in the right hand side of this equation is given by Equation (A22), and the last term equals the variance of the unconditional expectation of talent (the variance of Equation (Al7)). It is straightforward to see that Var([[theta].sub.i,t+1]) also increases in [mu]. From Equations (16) and (A26), consider the ratio of income over talent inequality in the cross section of families:

(A27) Var([y.sub.i,t])/Var([[theta].sub.i,j]) = [kappa] + [[sigma].sup.2.sub.u] + 1+[mu]/1-[mu] Var(ln[h.sub.i])/[kappa]/[[mu].sup.2] + [[sigma].sup.2.sub.u] + 1/[[mu].sup.2] 1+[mu]/1-[mu] Var(ln [h.sub.i])

where [kappa] is defined above. To prove the claim in Proposition 1 that this ratio also increases in [mu], let us define [tau] = 1+[mu]/1-[mu], with [tau]' = 2[tau]/1-[[mu].sup.2]. Then after some tedious but straightforward algebra, the partial derivative of Equation (A27) with respect to ix is proportional to the following term:

(A28) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

The first two terms of this expression are positive, as shown in Equations (A24) and (A25). The term 2[tau]Var(ln[h.sub.i])(2[kappa] + [tau]Var(ln[h.sub.i]))/[[mu].sup.3] is also positive. Therefore, after factoring out the term [[sigma].sup.2.sub.u]Var(ln[h.sub.i]), it suffices to show that:

(A29) [tau]'(1 - 1/[[mu].sup.2]) + 2 [tau]/[[mu].sup.3] > 0

Plugging in the definitions of [tau] and [tau]' and using the fact that [mu] < 1, we can verify the above inequality.

Intergenerational Correlation of Income and Talent

In this part, we consider the intergenerational correlation within one dynasty i and treat [h.sub.i] as a time-invariant fixed effect. Because the variance is stationary, the stationary intergenerational correlation in income is equal to:

(A30) Corr([y.sub.i,t+1], [y.sub.i,t]|[h.sub.i]) = Cov([y.sub.i,t], [y.sub.i,t]|[h.sub.i])/ Var([y.sub.i,t]|[h.sub.i]) =[mu] + [mu] Cov([y.sub.i,t],[v.sub.i,t+1]|[h.sub.i]/Var([y.sub.i,t]|[h.sub.i])

where we have used Equation (7) and the properties of [u.sub.i,j+1]. To obtain the expression (19) in the text, we insert the variance from Equation (16) and the covariance from Equation (A21) into the correlation shown in Equation (A30). We can differentiate Equation (19):

(A31) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Because all terms are positive, the correlation is increasing in IX and the claim in Proposition 1 is proven. A similar reasoning shows that the stationary intergenerational correlation of talent is:

(A32) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Differently from income, the intergenerational correlation of talent has ambiguous comparative static in [mu]. A more progressive policy decreases both the covariance and the variance of income and talent. For income, the rate of decrease in the variance is smaller than that of the covariance and the comparative static is unambiguous. But in the case of talent, the covariance is not sufficiently decreasing because talent is not directly affected by [mu]. We can show that the intergenerational correlation in talent is increasing in [mu] provided that [[sigma].sup.2.sub.u] is not too large relative to [[sigma].sup.2.sub.u].

Finally, we prove the claim in Proposition 1 that the ratio of intergenerational correlations is smaller than one and increasing in [mu]. First, consider the ratio:

(A33) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

The difference between the last term in the denominator and the numerator is [[sigma].sup.2.sub.v] [[sigma].sup.2.sub.u](1 - [[rho].sup.2.sub.1])(1 - [[rho].sub.1][mu])[[rho].sub.1][([mu] - 1).sup.2] This difference is positive because [[sigma].sup.2.sub.v] > 0, [[sigma].sup.2.sub.u] > 0 and [[rho].sub.1] < 1. As a result, the expression in Equation (A33) is smaller than unity, strictly when [mu] < 1, as claimed in Proposition 1. In addition, the ratio is increasing in [mu]. To see this, rewrite the ratio as:

(A34) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Denote by N the numerator and by D the denominator of this expression. The ratio of correlations increases in IX if and only if the derivate N'D - D'N is positive. Since the denominator exceeds the numerator, D > N, it suffices to show that N' > D' > 0. From Equation (A34) it is evident that both terms increase in [mu]. From N- D = -[[sigma].sup.2.sub.v] [[sigma].sup.2.sub.u] 1-[[rho].sup.2.sub.1]/1-[[rho].sub.1][mu][[rho].sub.1][([mu]-1).sup.2], we take N'-D' = -[[sigma].sup.2.sub.v] [[sigma].sup.2.sub.u](1 - [[rho].sup.2.sub.1])[[rho].sub.1]([mu]-1)(2-[rho][mu]-[rho])/[(1 - [[rho].sub.1][mu]).sup.2] 0, which proves the claim.

Proof of Proposition 2

Using the consumption function in Equation (21) and the conditional expectation of income in Equation (15), we can express the indirect utility function as:

(A35) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [Q.sub.i,t] = [y.sub.i,t] + [[rho].sub.1][v.sub.i,t] + ln[h.sub.i] is family i's income potential at time t. Differentiating W with respect to [[mu].sub.t+1] we take:

(A36) [partial derivative]W/[partial derivative][[mu].sub.t+1] = [W.sub.1] + 1/[gamma][[W.sub.2] + [W.sub.3] + [W.sub.4] + [W.sub.5] + [Q.sub.i,t]]

In this expression, the term [W.sub.1] = 1/[[mu].sub.t+1] + [gamma] < 0 captures the intertemporal trade-off, [W.sub.2] = ln([[mu].sub.t+1]/[[mu].sub.t+1] + [gamma]) < 0 measures the beneficial insurance effects of public policy, [W.sub.3] = [gamma]/[[mu].sub.t+1] + [gamma] > 0 is the term associated with the distortions in investment, [W.sub.4] = [alpha]/[[mu].sub.t+1] > 0 is the direct output cost, [W.sub.5] = [[rho].sub.0](1 - [[rho].sub.1]) > 0 shows that insurance is less beneficial the higher is the long-run level of the endowment [v.sub.i,t], and [Q.sub.i,t] is defined above. Differentiating Equation (A36) with respect to [[mu].sub.t+1] we have:

(A37) [partial derivative][W.sup.2]/[partial derivative][[mu].sup.2.sub.t+1][varies]1/[gamma] + [[mu].sub.t+1] - [alpha]/[[mu].sub.t+1][gamma]

A sufficient condition for single-peaked preferences is the strict concavity of the indirect utility. This requires that [[mu].sub.t+1][gamma]/[[mu].sub.t+1][gamma] < [alpha]. Since the left hand side of this inequality is bounded above by 1, the first part of the claim in Proposition 2 follows. For the second part of the Proposition, set [partial derivative]W/[partial derivative][[mu].sub.t+1] equal to zero, and use the implicit function theorem and the concavity of W in an interior optimum:

(A38) [partial derivative][[mu].sub.t+1]/[partial derivative][Q.sub.i,t][varies][partial derivative][W.sup.2]/[partial derivative][[mu].sub.t+1][partial derivative][Q.sub.i,t] = 1/[gamma] > 0

Proof of Proposition 3

If 0 < [[mu].sub.i,t+1] < 1 is the most preferred public policy for a dynasty with parameter [Q.sub.i,t], then it necessarily satisfies the first-order condition, [partial derivative]W/[partial derivative][[mu].sub.t+1] = 0, where the derivative is given by Equation (A36). In addition, if [alpha] > 1, then W is globally concave, and hence any solution to the first order condition will be the unique optimum. Since the implicit function theorem applies, the comparative static [partial derivative][[mu].sub.t+1]/[partial derivative]Z has the same sign as the cross partial [[partial derivative].sup.2]W([[mu].sub.t+1]([h.sub.i]))/[partial derivative]([[mu].sub.t+1])[partial derivative]z.

Therefore, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].

Since the most preferred policy [[mu].sub.t+1] of low [Q.sub.i,t] families is lower, it follows that when the position of the decisive agent p decreases, [[mu].sub.t+1] also decreases. For the parameter that expresses the degree of parental altruism, after some algebra and using the first-order condition at optimum, we have:

(A39) [[partial derivative].sup.2]([[mu].sub.t+1]/[partial derivative] [[mu].sub.t+1][partial derivative][gamma] = 1/[gamma] < 0

We briefly discuss the remaining comparative statics. First, social mobility is lower in societies with higher long-run income (higher [[rho].sub.0]). At first glance, this may appear counterfactual, since the conjecture is that in less developed economies, social mobility is lower (Solon, 2002). However, this could be because less developed economies have poorer tax collection technologies (high [alpha]) and limited expansion of voting rights (high p).

Second, in the original Becket and Tomes (1979) model, altruistic parents invest more in the human capital of their children, which strengthens the intergenerational transmission and lowers social mobility. This result also holds in our model, but it takes place through a different mechanism. (26) Because a lower [mu] distorts fathers' investment decisions, a higher [mu] (less progressivity) redistributes resources in favor of the future generation. Hence, more altruistic fathers transfer more resources to the next generation by choosing a higher [mu].

Third, if the decisive voter is temporarily well-endowed in family ability ([v.sub.i,t] > [[rho].sub.0]), then cultural persistence decreases the progressivity of the public policy. This result is consistent with the hypothesis that stronger family ties offer insurance and therefore "crowd out" the scope for social insurance.

Fourth, given the income potential [Q.sub.p,t], the parameters [[sigma].sup.2.sub.v] and [[sigma].sup.2.sub.u] do not affect the optimal [mu]. Because of the assumed log--log specification, substitution and income effects cancel off, and consumption and investment are constant fractions of output, independently of the properties of the shocks. In a more general specification of preferences, the scope for insurance will increase when endowment and market luck become more variable. Nevertheless, the properties of the two shocks can matter indirectly for [mu], through the evolution of the income potential in the next period [Q.sub.p,t+1]. Therefore, the persistence and volatility of the equilibrium [mu] are affected by cultural, genetic, and market randomness.

Proof of Proposition 4

First, we examine a stationary state with [[mu].sub.t+1] = [[mu].sub.t]. The population coefficient vector is defined as the argument that minimizes the least squares problem in the population:

(A40) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

The well known formula for the population slope is given by:

(A41)

[beta] = Cov([y.sub.i,t+1], [y.sub.i,t])/Var([y.sub.i,t]) = Corr([y.sub.i,t+1], [y.sub.i,t]) =Cov([[delta].sub.0] + [[mu].sub.t+1](ln[h.sub.i] + [y.sub.i,t] + [v.sub.i,t+1] + [[mu].sub.i,t+1]), [y.sub.i,t])/ Var([y.sub.i,t])

which, from the imposed stationarity Var([y.sub.i,t+1]) = Var([y.sub.i,t]) also equals the cross-sectional intergenerational correlation, Corr([y.sub.i,t+1] [y.sub.i,t]). Recalling the properties of [[mu].sub.i,t+1] and [[epsilon].sub.i,t+1], we have:

(A42) [beta] = [[mu].sub.t+1](1 + Cov([v.sub.i,t+1], [y.sub.i,t]) + Cov(ln[h.sub.i], [y.sub.i,t])/Var([y.sub.i,t]))

The first covariance in the numerator is given by Equation (A21), because the fixed effect [h.sub.i] is orthogonal to the [[epsilon].sub.i,t+1] and hence the [v.sub.i,t+1] process. The stationary covariance between the family-fixed effect and income is given by:

(A43) Cov(ln [h.sub.i], [y.sub.i,t]) = [[mu].sub.t+1]/1 - [[mu].sub.t+1]Var(ln [h.sub.i])

Putting all pieces together and setting [[mu].sub.t+1] = [[mu].sub.t] = [mu], yields the expression for [beta] in Proposition 4.

Next, we show that [beta] is increasing in [mu]. Using the variances in Equations (16)-(18) yields:

(A44) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

(A45) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Consider the last term in the numerator and the denominator. Because 1-[[rho].sub.1][mu]/1 - [mu] is increasing in [mu], this term also increases in [mu]. So, adding the same, increasing in [mu], term both in the numerator and the denominator, tends, holding constant all other terms, to produce an increasing [beta], because the numerator is smaller than the denominator. Furthermore, [beta] will increase more in [mu] due to this last term, when Var(ln [h.sub.i]) is higher. Hence, consider Var(ln [h.sub.i]) = O. In this case (A45) collapses to the dynastic correlation in Equation (19). Previously in this Appendix, we showed that this correlation is increasing in [mu], which completes the proof of the claim that [beta] increases in [mu].

Differentiating Equation (A45) with respect to [[sigma].sub.u], we can show that:

(A46) [partial derivative][beta]/[partial derivative] [[sigma].sup.2.sub.u][varies]([[mu].sup.2] - 1) x([[rho].sub.1][[sigma].sup.2.sub.v]+(1-[[rho].sup.2.sub.1) (1+[mu])1-[[rho].sub.1][mu]/1 - [mu]Var(ln[h.sub.i])) [less than or equal to] 0

as claimed in Proposition 4. Differentiating Equation (A45) with respect to Var(ln [h.sub.i]), we obtain

(A47) [partial derivative][beta]/[partial derivative]Var (ln [h.sub.i])[varies](1 - [[mu].sup.2]) ((1 - [[rho].sub.1])[[sigma].sup.2.sub.v] + (1 - [[rho].sup.2.sub.1])(1 - )[[sigma].sup.2.sub.u]) [greater than or equal to] 0

The comparative statics of [beta] with respect to [alpha], p, [Q.sub.p,t], [[rho].sub.1] and [gamma] follow from Proposition 3 and the result [partial derivative][beta]/[partial derivative][mu] > 0. Finally, we have verified numerically that [mu] is non-monotonic in [[rho].sub.1] and [[sigma].sup.2.sub.v] for various combinations of parameters.

Finally, for the second part of the Proposition we use the new equilibrium [[mu].sub.t+1] in the AR(1) process for income in Equation (7). Var([y.sub.i,t] is given by Equation (17) in the text for policy [[mu].sub.t]. The formulas for Cov(ln[h.sub.i], [y.sub.i,t]) and Cov([v.sub.i,t+1], [y.sub.i,t]) are taken by assuming that before the structural break the economy is in a steady state with [[mu].sub.t], = [[mu].sub.s], for all s < t + 1.

APPENDIX B: DATA

Social Mobility: Data for the intergenerational earnings elasticity is taken from http://www.iza.org/index_html?lang= en&mainframe=http%3A//www.iza.org/en/webcontent/ personnel/photos/index_html%3Fkey%3D83&topSelect= personnel&subSelect=fellows Corak's (2006) meta-analysis. For Australia we use estimates from Leigh (2007). For Japan we use estimates from Lefranc, Ojima and Yoshida (2008). For Spain we use estimates from d'Addio (2007).

Private Return to Education: Taken from http://www.olis. oecd.org/olis/2007doc.nsf/LinkTo/NT000059E2/$FILE/ JT03238193.PDF Boarini and Strauss (2007), Table 3. Calculated as the simple average in every country for the years available (males and females).

Total Government Spending and Social Welfare Spending: Government spending denotes central government consumption and investment. Social Welfare denotes consolidated government spending on social services as percentage of GDP. This data is taken from http://www.igier.unibocconi.it/whos.php?vedi= 1169&tbn=albero&id_ folder= 177 Persson and Tabellini (2003). The variables are averaged over the 1960-1998 period.

Public Education: Data taken from http://www.oecdwash. org/PUBS/ELECTRONIC/epels.htm#edustat OECD's Online Education Database. The series extracted are Public education expenditure as % of GDP, Public education expenditure per student (% of p.c. GDP), at all levels, and Public education expenditure per student (% of p.c.GDP), at the primary, secondary, and tertiary level. For every country we average the series for all available years in periods 1970-2007.

Ethnolinguistic fractionalization (ELF): Taken from http:// weber.ucsd.edu/proeder/elf.htm Roeder (2001). The ELF index is defined as one minus the probability that two randomly chosen persons from a population belong to the same ethnic, linguistic or racial group. A higher ELF index denotes a more heterogeneous population. The value taken refers to the year 1961.

Gini Coefficient: The Gini coefficients at the factor and the gross earnings level are taken from http://www.sciencedirect. com/science/article/B6V97-40X8GBB-1/2/ a10cf47920052f9f940852f3781bc71c Milanovic (2000) and are averaged across all available periods for any given country.

Weak Family Ties: Taken from http://ftp.iza.org/dp2750. pdf Alesina and Giuliano (2007).

Political Inequality Variables: Taken from the Four Wave http://www.worldvaluessurvey.org/World Values Survey. The political participation variables that we use are recoded in binary form as follows: Interested in Politics (WVS code: E023; recoded as 1 for responders that answered 1 or 2, and 0 otherwise); Belong to Political Party (A068; already binary): Sign Petitions (E025; 1 if the responder answered yes and 0 otherwise); Participation in Lawful Demonstration (E027; 1 if the responder answered 1 or 2, 0 otherwise); Belong to Labor Union (A067; already binary). The income classification follows the variable X047R; see also Table 1.

REFERENCES

Aaronson, D., and B. Mazumder. "Intergenerational Economic Mobility in the US: 1940 to 2000." Journal of Human Resources, 43(1), 2008, 139-72.

Alesina, A., and P. Giuliano. "The Power of the Family." NBER Working Paper, No. 13051, 2007.

--. "Preferences for Redistribution." NBER Working Paper, No. 14825, 2009.

Alesina, A., and E. Glaeser. Fighting Poverty in the US and Europe: A World of Difference. Oxford, UK: Oxford University Press, 2004.

Alesina, A., and D. Rodrik. "Distributive Politics and Economic Growth." Quarterly Journal of Economics, 109, 1994, 465-90.

Atkeson, A., and R. Lucas. "On Efficient Distribution with Private Information." Review of Economic Studies, 59, 1992, 427-53.

Becker, G., and N. Tomes. "An Equilibrium Theory of the Distribution of Income and Intergenerational Mobility." Journal of Political Economy, 87, 1979, 1153-89.

--. "Human Capital and the Rise and Fall of Families." Journal of Labor Economics, 4, 1986, S1-39.

Benabou, R. "Inequality and Growth," in NBER Macroeconomics Annual, edited by B. Bernanke and J. Rotemberg. Cambridge: MIT Press, 1996,

11-73.

--. "Unequal Societies: Income Distribution and the Social Contract." American Economic Review, 90, 2000, 96-129.

Benabou, R., and Ok, E. "Social Mobility and the Demand for Redistribution: The POUM Hypothesis." Quarterly Journal of Economics, 116(2), 2001, 447-87.

Bernasconi, M., and P. Profeta. "Redistribution or Education'? The Political Economy of the Social Race." CESifo Working Paper No. 1934, 2007.

Bjorklund, A., and M. Jantti. "Intergenerational Income Mobility in Sweden Compared to the United States." American Economic Review, 87, 1997, 1009-18.

Blanden, J., P. Gregg, and L. Macmillan. "Accounting for Intergenerational Income Persistence: Noncognitive Skills, Ability and Education." Economic Journal, 117(1), 2007, C43-60.

Boarini, R., and H. Strauss. "The Private Internal Rates of Return to Tertiary Education: New Estimates for 21 OECD Countries." OECD Economics Department Working Papers, No. 591, 2007.

Campante, F. "Redistribution in a Model of Voting and Campaign Contributions." Mimeo, Harvard University, 2007.

Checchi, D., A. Ichino, and A. Rustichini. "More Equal But Less Mobile? Education Financing and Intergenerational Mobility in Italy and the US." Journal of Public Economics, 74, 1999, 351-93.

Corak, M. "Do Poor Children Become Poor Adults? Lessons From a Cross Country Comparison of Generational Earnings Mobility." Research on Economic Inequality, 13(1), 2006, 143-88.

Corak, M., and A. Heitz. "The Intergenerational Earnings and Income Mobility of Canadian Men: Evidence from Longitudinal Income Tax Data." Journal of Human Resources, 34(3), 1999, 504-33.

d'Addio, A. C. "Intergenerational Transmission of Disadvantage: Mobility or Immobility Across Generations?" OECD Social Employment and Migration Working Papers, No. 52, 2007.

Davies, J., J. Zhang, and J. Zeng. "Intergenerational Mobility under Private vs. Public Education." Scandinavian Journal of Economics, 107(3), 2005, 399-417.

Farhi. E., and I. Werning. "Inequality and Social Discounting." Journal of Political Economy, 115(3), 2007, 365-402.

Fernandez, R., and R. Rogerson. "Public Education and Income Distribution: A Dynamic Quantitative Evaluation of Education-Finance Reform." American Economic Review, 88(4), 1998, 813-33.

Glomm, G., and B. Ravikumar. "Public vs. Private Investment in Human Capital: Endogenous Growth and Income Inequality." Journal of Political Economy, 100, 1992, 818-34.

Goldberger, A. "Economic and Mechanical Models of Intergenerational Transmission." American Economic Review, 79(3), 1989, 504-13.

Hassler, J., J.V. Rodriguez Mora, K. Storesletten, and F. Zilibotti. "The Survival of the Welfare State." American Economic Review, 93(1), 2003, 87-112.

Hassler, J., J. V. Rodriguez Mora, and J. Zeira. "Inequality and Mobility." Journal of Economic Growth, 12, 2007, 235-59.

Karabarbounis, L. "One Dollar, One Vote." Mimeo, University of Chicago Booth School of Business. 2010.

Krusell, P., V. Quadrini, and J.-V. Rios-Rull. "Politico-Economic Equilibrium and Economic Growth." Journal of Economic Dynamics and Control, 21, 1997, 243-72.

Lee, C.-I., and G. Solon. "Trends in Intergenerational Income Mobility." NBER Working Paper, No. 12007, 2006.

Lefranc, A., F. Ojima, and T. Yoshida. "The Intergenerational Transmission of Income and Education: A Comparison of Japan and France." Working Paper, EUI, 2008.

Leigh, A. "Intergenerational Mobility in Australia." B.E. Journal of Economic Analysis and Policy, 7(2), Article 6, 2007.

Lindert, P. Growing Public: Social Spending and Economic Growth Since the Eighteenth Century, Vol. 1 and 2. New York, NY: Cambridge University Press, 2004.

Loury, G. "Intergenerational Transfers and the Distribution of Earnings." Econometrica, 49(4), 1981, 843-7.

Mayer, S., and L. Lopoo. "Has the Intergenerational Transmission of Economic Status Changed?" Journal of Human Resources, 40(1), 2005, 169-85.

Mazumder. B. "Fortunate Sons: New Estimates of Intergenerational Mobility in the U.S. Using Social Security Earnings Data." Review of Economics and Statistics, 87(2), 2005, 235-55.

Mazumder, B. "Trends in Intergenerational Mobility." Industrial Relations, 46(1), 2007, 1-6.

Milanovic, B. "The Median Voter Hypothesis, Income Inequality and Income Redistribution: An Empirical Test with the Required Data." European Journal of Political Economy, 16, 2000, 367-410.

Mulligan, C. Parental Priorities and Economic Inequality. Chicago, IL: The University of Chicago Press, 1997.

Pekkarinen, T., R. Uusitalo, and S. Kerr. "School Tracking and Intergenerational Income Mobility: Evidence from the Finnish Comprehensive School Reform." Mimeo, Helsinki School of Economics, 2008.

Persson, T., and G. Tabellini. "Is Inequality Harmful for Growth?" American Economic Review, 84(3), 1994, 600-21.

--. The Economic Effects of Constitutions. Cambridge, MA: MIT Press, 2003.

Phelan, C. "Opportunity and Social Mobility." Review of Economic Studies, 73, 2006, 487-504.

Piketty, T. "Social Mobility and Redistributive Politics." Quarterly Journal of Economics, 110(3), 1995, 551-84.

--. "Theories of Persistent Inequality and Intergenerational Mobility," in Handbook of Income Distribution, Vol. 1, edited by A. B. Atkinson and F. Bourguignon. North-Holland, 2000, 429-76.

Saint-Paul, G., and T. Verdier. "Education, Democracy and Growth." Journal of Development Economics, 42, 1993, 399-407.

Solon, G. "Intergenerational Income Mobility in the United States." American Economic Review, 82, 1992, 393-408.

--. "Intergenerational Mobility in the Labor Market," in Handbook of Labor Economics, Vol. 3B, edited by O. C Ashenfelter and D. Card. New York, NY: Elsevier Science Press, 1999, 1762-96.

--. "Cross-Country Differences in Intergenerational Earnings Mobility." Journal of Economic Perspectives, 16(3), 2002, 59-66.

--. "A Model of Intergenerational Mobility Variation over Time and Place," in Generational Income Mobility in North America and Europe, edited by M. Corak. Cambridge University Press, 2004, 38-47.

Zimmerman, D. "Regression Toward Mediocrity in Economic Stature." American Economic Review, 82, 1992, 409-29.

(1.) See the evidence in Section V. See also Alesina and Glaeser (2004) for more on this point.

(2.) Pekkarinen, Uusitalo, and, Kerr (2008) show how the major Finnish educational reform in the 1970s decreased the intergenerational elasticity of income from 0.30 to 0.23. Their finding is consistent with our interpretation of [[mu].sub.t].

(3.) We do not restrict [[THETA].sub.i,t] to be smaller than unity. If in some period [[THETA].sub.i,t] [less than or equal to] 1 for all families i, we can think the special case with [alpha] = 0 as a growth-enhancing reform that benefits every family, with the least talented families gaining relatively more.

(4.) We assume that fathers cannot borrow against their son's future income. See Loury (1981); Becket and Tomes (1986) and Mulligan (1997), for an analysis of the relationship between social mobility and borrowing constraints. See also Benabou (1996, 2000).

(5.) Goldberger (1989) explains in detail the difference between the additive production function (as in the Becker and Tomes model) and the multiplicative production function. We also note that in our specific Cobb-Douglas environment, the degree of parental altruism ([gamma]) does not enter into the intergenerational transmission equation directly, that is, for given policy [mu] (see Solon 2004, for a similar result).

(6.) In Equation (16) the variance is not indexed by i and as a result [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. The variance of income is common to all families i because [h.sub.i] enters multiplicatively into the production of talent (6). The same comment applies for the intergenerational correlation of incomes below. In a more general version of our model, we could allow for heterogeneity in the returns to investment (e.g., with a production function of the form: [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. Under this specification the slope of the regression ([[delta].sub.1]) in Equation (7) depends on i.

(7.) This result reflects the difference between the coefficients [[delta].sub.2] and [[lambda].sub.2] (or [[delta].sub.i] and [[lambda].sub.i]) in the two intergenerational transmission equations. See Appendix for the details.

(8.) That is, the indirect utility W in Equation (20) depends only on the current choice variable, [[mu].sub.t+1], and not on future public policies, [[mu].sub.t+2],.... As a result, we do not have to consider the policy-fixed point problem that arises when current policies depend on expectations of future policies but also affect future policies through the optimal consumption and investment choices and the resulting intergenerational transmission of income and talent. Our setup resembles the equilibrium in the models of Persson and Tabellini (1994); Benabou (1996); and Fernandez and Rogerson (1998), with "one period-ahead commitment to policy." Krusell, Quadrini, and Rios-Rull (1997) show how to formulate and numerically solve for time-consistent politico-economic equilibria in a general class of models. Hassler et al. (2003) solve closed-form the Markov perfect equilibrium in a nontrivial dynamic voting game under the assumption of risk neutrality.

(9.) As a result, income and talent become regime switching stochastic processes, that is, with time varying coefficients. One interesting and realistic case occurs if there is an adjustment cost associated with an educational reform that aims to switch [mu]. In this case, the process for output would be a threshold ARMA(2,1) process, where the thresholds are defined by the distribution of [Q.sub.i,t] in the cross section of families. For instance, suppose that the fixed costs of expanding the public schooling infrastructure are too prohibitive and therefore [mu] can take only two values: 0 < [[mu].sub.l] < [[mu].sub.h] < 1. Assuming that in period t - 1, [[mu].sub.t] = [[mu].sub.h] was the optimal grandfather's choice, a majority of fathers support a switch of regime to [[mu].sub.t+1] = [[mu].sub.l], if [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] ln [[mu].sub.h]/[[mu].sub.l] + [alpha]/[[mu].sub.l] - [[mu].sub.h] ln [[mu].sub.h]/[[mu].sub.l] - [[rho].sub.0](1 - [[rho].sub.1]) is a constant, [[phi].sub.t] denotes the probability distribution of income potential in the cross section of dynasties as of the beginning of period t and [Q.sub.l,t] is the lowest realized income potential. We index the distribution by t to show the possible dependency on [[mu].sub.t] and hence on calendar time. Under this setting, the expectations, variances, and intergenerational correlations derived in Section III hold within each educational regime.

(10.) Becket and Tomes (1979; abstract and page 1182) argue that

"Intergenerational mobility measures the effect of a family on the well-being of its children."

(emphasis added). Another influential contribution is that of Mulligan (1997, page 25), who in defining social mobility notes that

"The degree of intergenerational mobility is [...] an index of the degree of 'equality of opportunity'. Equality of opportunity is often seen as desirable because, with little correlation between the incomes of parents and children, children from rich families do not enjoy much of a 'head start' on children from poor families."

The same presumption may be implied by the introductory paragraph in the study of Solon (1999).

(11.) Piketty (2000) and Corak (2006) make this point. In an influential paper, Atkeson and Lucas (1992) have shown the optimality of zero mobility. Recently, Phelan (2006) and Farhi and Werning (2007) challenge this result based on the social discount rate exceeding the private one.

(12.) We have not explicitly considered the growth-enhancing effects of public education. However, if average ability [bar.h] is sufficiently low, then in the steady state the stationary average income in the cross section of the dynasties, [[integral].sub.H] E([y.sub.i,t+1] |[h.sub.i])d[[PHI].sub.h] (h), is decreasing in [mu], and the progressivity increases long-run income, which implicitly may be capturing this realistic feature of public education.

(13.) Note that in both cases [beta] is expressed only as a function of the deeper parameters of the model.

(14.) VAT taxes rose around 15%, and each of the corporate tax rate and the top marginal income tax decreased by 17%. Public expenditure for education as a percentage of gross domestic product (GDP) decreased by 25% between 1975 and 1985 and by 30% by the end of the 1980s.

(15.) In the following Figures we use Corak's most preferred estimate, but we have verified the robustness of our results using the median estimate found in the literature. The nine countries are Denmark, Norway, Finland, Canada, Sweden, Germany, France, the United States, and the United Kingdom. We also add Japan, Spain, and Australia. Some recent papers have estimated the intergenerational income elasticity in Italy, but: (a) the estimates are based on heroic assumptions needed to use intergenerational income data of low quality; (b) the estimates are especially high and (c) even using a more conservative value, Italy is most of the times a major outlier of which we cannot be really confident. The only variable that seems to explain satisfactorily Italy's low degree of mobility is the strength of family ties (high [[rho].sub.1]).

(16.) Conditioning on both determinants, the latter turns out to be much more strongly associated with mobility than the former (correlation of -0.43 vs. 0.15).

(17.) In contrast, the correlation of [beta] with total government spending is -0.05, and the correlation of [beta] with spending on social expenditures is -0.11. The weakness of these correlations illustrates that it is educational expenditure, rather than other forms of government spending (e.g., unemployment insurance, assistance to poor families, welfare benefits, etc.), that may matter for social mobility.

(18.) In a previous version of the paper we used voter turnout in elections and union density as additional proxies for p. For all cases we find correlations between p, [mu], and [beta] that are consistent with our model.

(19.) Sweden and Germany are the two outliers.

(20.) This finding is robust to the exclusion of outliers.

(21.) One of the few studies that attribute cross-country differences in mobility to public policies is Corak and Heitz (1999). The authors conjecture that Canada's progressivity can explain its higher mobility relative to the United States.

(22.) The index is defined as one minus the probability that two random persons in some country belong to the same ethnic, linguistic or racial group.

(23.) The index is due to Alesina and Giuliano (2007). We thank the authors for providing us with their data.

(24.) These statistics come from Milanovic (2000).

(25.) Bjorklund and Jaintti (1997) hypothesize that common causes may explain United States's higher intergenerational and cross-sectional inequality relative to Sweden's. Recently, Hassler, Rodriguez Mora, and Zeira (2007) argue that inequality and mobility may be positively correlated if labor market institutions differ significantly across countries or negatively correlated if educational subsidies drive the cross-country variation.

(26.) In our model, altruistic fathers invest more in their children's human capital, holding constant [mu]. However, because of the log-linear specification, altruism does not enter directly in the intergenerational transmission equation. See Solon (2004) for a similar result.

ANDREA ICHINO, LOUKAS KARABARBOUNIS and ENRICO MORETTI *

* We would like to thank Alberto Alesina, Giacomo Calzolari, Ed Glaeser, John Hassler, Larry Katz, Mattia Nardotto, Sevi Rodriguez-Mora, and seminar participants at C6-Capri, ESSLE-CEPR 2008, Edinburgh University, Harvard and Sorbonne for useful discussions. We also thank two anonymous referees for their helpful suggestions.

Ichino: Professor of Economics, University of Bologna, Department of Economics, Bologna, Italy. Phone (+39) 051-20-98-878 E-mail andrea.ichino@unibo.it

Karabarbounis: Assistant Professor of Economics and Neubauer Family Faculty Fellow, University of Chicago, Booth School of Business, Chicago, IL. Phone (+01) 773-834-8327 E-mail loukas.karabarbounis@ chicagobooth.edu

Moretti: Professor of Economics, University of California-Berkeley, Department of Economics, Berkeley, CA. Phone (+01) 510-642-6649 E-mail moretti@econ. berkeley.edu
TABLE 1
Classification in Poor, Middle, and Rich Per
Country: WVS Data

 Poor (%) Middle (%) Rich (%)

Australia 29 34 37
Canada 31 36 33
Denmark 31 41 28
Finland 33 33 34
France 33 37 30
Germany 39 33 28
Japan 32 36 32
Norway 35 40 25
Spain 30 44 26
Sweden 26 44 31
The United Kingdom 35 35 30
The United States 35 36 29
Average 33 37 30

Notes: Percentages are rounded to sum to 100. The
numbers refer to the full sample from the Four Wave WVS
Data. Actual percentages used in the empirical results may
differ slightly depending on the political variable used.
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有