Two puzzles of asset pricing and their implications for investors.
Campbell, John Y.
1. Introduction
The tradeoff of risk and return is becoming ever more important for
individuals, institutions, and public policy. In fact, Bernstein (1996)
suggests that the rational analysis of risk is a defining characteristic
of the modern age.
This paper explores risk and return in aggregate stock market
investment. It is based on several earlier expositional and research
pieces, notably Campbell (1999, 2000, 2003), Campbell and Cochrane
(1999), Campbell and Shiller (2001), and Campbell and Viceira (2002).
The comparison of the stock market with the money market is
startling. For example, if we look at log real returns on US stocks and
Treasury bills over the period 1947.2-1998.4, we find, first, that the
average stock return is 8.1%, while the average bill return is 0.9%; and
second, that the volatility of the stock return is 15.6% while the
volatility of the ex post real bill return is only 1.8%. (1)
These facts lead to two puzzles of asset pricing. The first was
christened the equity premium puzzle by Mehra and Prescott (1985): Why
is the average real stock return so high (in relation to the average
short-term real interest rate)? The second might be called the equity
volatility puzzle: Why is the volatility of stock returns so high (in
relation to the volatility of the short-term real interest rate)? The
classic reference to this second puzzle is Shiller (1981).
Economists have tried to resolve these puzzles by linking asset
prices to aggregate consumption. This is a natural approach because
consumption is the most obvious determinant of marginal utility (in
simple models, the only determinant). Hence, covariance with consumption
measures risk. Also, consumption can be thought of as the dividend on
the portfolio of aggregate wealth. It is natural to model stocks as
claims to the stream of consumption.
Unfortunately, aggregate consumption has several properties that
deepen the puzzles of asset pricing. First, real consumption growth is
very stable, with an annualized standard deviation of 1.1%. Second, the
correlation of consumption growth and stock returns is weak (0.23 at a
quarterly frequency, and 0.34 at an annual frequency). Third, stock
prices have very little ability to forecast consumption growth. The
[R.sup.2] statistic of a regression of consumption growth on the log
dividend-price ratio is never greater than 4% at horizons from 1 to 4
years.
Economists also try to link stock prices to the behavior of
dividends, without assuming that dividends equal consumption. Here too
there are puzzles. Quarterly dividend volatility is high (28%), but this
is due to strong seasonality in dividends. Annual dividend volatility is
only about 6%. This volatility is much larger than consumption growth
(1%), but much smaller than stock returns (16%). Stock returns are
somewhat more strongly correlated with dividends than with consumption,
but the maximum correlation at any horizon up to 4 years is only 0.34 at
a 1-year horizon. Finally, the dividend-price ratio has little ability
to forecast dividend growth. The [R.sup.2] statistic of a regression of
dividend growth on the log dividend-price ratio is never greater than 8%
at horizons from 1 to 4 years.
These features of US financial data are also apparent in other
countries. Campbell (2003) summarizes stock market data from Morgan
Stanley Capital International (MSCI) and macroeconomic data from the
International Financial Statistics (IFS) of the International Monetary
Fund for 11 developed countries. He also reports results for long-term
annual data from Sweden (starting in 1920), the UK (starting in 1919),
and the US (starting in 1891). He shows that stock markets have
delivered average real returns of 5% or better in almost every country
and time period. The exceptions to this occur in short-term quarterly
data, and are concentrated in markets that are particularly small
relative to GDP (Italy) or that predominantly represent claims on
natural resources (Australia). Short-term debt, on the other hand, has
rarely delivered an average real return above 3%. Stock markets are
volatile in every country, while aggregate consumption is smooth and
aggregate dividends have an intermediate volatility.
These numbers show that the equity premium and equity volatility
puzzles are not unique to the United States but characterize many other
countries as well. Recently a number of authors have suggested that
average excess returns in the US may be overstated by sample selection
or survivorship bias. If economists study the US because it has had an
unusually successful economy, then sample average US stock returns may
overstate the true mean US stock return. The international data suggest
that this is not a serious problem. (2)
The organization of this paper is as follows. Section 2 presents
the equity premium and equity volatility puzzles. Section 3 argues that
the equity volatility puzzle is the harder of the two to resolve, and
presents several possible explanations. Section 4 discusses implications
for investors.
2. The equity premium puzzle and the equity volatility puzzle
I now state the equity premium puzzle using the stochastic discount
factor (SDF) paradigm. This approach to asset pricing, which has its
roots in the work of Rubinstein (1976), Breeden (1979), Grossman and
Shiller (1981), and Shiller (1982), has become increasingly influential
since the work of Hansen and Jagannathan (1991). Cochrane (2001)
provides a unified textbook treatment of asset pricing in these terms.
Consider the intertemporal choice problem of an investor, indexed
by k, who can trade freely in some asset i and can obtain a gross simple
rate of return (1 + [R.sub.ij+1]) on the asset held from time t to time
t + 1. If the investor consumes [C.sub.kt] at time t and has
time-separable utility with discount factor [delta] and period utility
U([C.sub.kt]), then his first-order condition is
U'([C.sub.kt]) = [delta][E.sub.t] [(1 + [R.sub.i,t+1])U'
([C.sub.k,t+1])]. (1)
The left hand side of (1) is the marginal utility cost of consuming
one real dollar less at time t; the right hand side is the expected
marginal utility benefit from investing the dollar in asset i at time t,
selling it at time t + 1, and consuming the proceeds. The investor
equates marginal cost and marginal benefit, so (1) must describe the
optimum.
Dividing (1) by U'([C.sub.kt]) yields
1 = [E.sub.t] [(1 + [R.sub.i,t+1] [delta]
U'([C.sub.k,t+1])/U'([C.sub.kt])] = [E.sub.t] = [(1 +
[R.sub.i,t+1]) [M.sub.k,t+1]], (2)
where [M.sub.k,t+1] =
[delta]U'([C.sub.k,t+1])/U'([C.sub.kt]) is the intertemporal
marginal rate of substitution of the investor, also known as the
stochastic discount factor or SDF. Since marginal utility must always be
positive, the SDF must always be positive.
The derivation just given for equation (2) assumes the existence of
an investor maximizing a time-separable utility function, but in fact
the equation holds more generally. The existence of a positive
stochastic discount factor is guaranteed by the absence of arbitrage in
markets in which non-satiated investors can trade freely without
transactions costs. In general there can be many such stochastic
discount factors--for example, different investors k whose marginal
utilities follow different stochastic processes will have different
[M.sub.k,t+1] -- but each stochastic discount factor must satisfy
equation (2). It is common practice to drop the subscript k from this
equation and simply write 1 = [E.sub.t] [(1 + [R.sub.i,t+1])
[M.sub.t+1]]. In complete markets the stochastic discount factor
[M.sub.t+1] is unique because investors can trade with one another to
eliminate any idiosyncratic variation in their marginal utilities.
To understand the implications of (2) in a simple way, I follow
Hansen and Singleton (1983) and assume that the joint conditional
distribution of asset returns and the stochastic discount factor is
lognormal and homoskedastic. While these assumptions are not literally
realistic--stock returns in particular have fat-tailed distributions
with variances that change over time--they do make it easier to discuss
the main forces that should determine the equity premium.
The assumption of lognormality implies that the log riskless
interest rate satisfies
[r.sub.f,t+1] = -[E.sub.t][m.sub.t+1] - [[sigma].sup.2.sub.m]/2.
(3)
The log riskless interest rate is negatively related to the
conditional expectation of the log SDF. When the SDF is expected to be
high, marginal utility in the future is expected to be high relative to
the present; the investor has an incentive to save, and this depresses
the equilibrium riskless interest rate. The log riskiess interest rate
also depends negatively on the conditional volatility of the log SDF.
Volatility produces a precautionary savings motive, which depresses the
riskless interest rate.
Also, the expected excess return on risky assets over the riskless
rate satisfies
[E.sub.t][r.sub.i,t+1] - [r.sub.f,t+1] + [[sigma].sup.2.sub.i]/2 =
-[[sigma].sub.im]. (4)
The variance term on the left hand side of (4) is a Jensen's
Inequality adjustment arising from the fact that we are describing
expectations of log returns. This term effectively converts the return
difference from a geometric average to an arithmetic average. It would
disappear if we rewrote the equation in terms of the log expectation of
the ratio of gross simple returns: log [E.sub.t][(1 + [R.sub.i,t+1])/(1
+ [R.sub.f,t+1])] = -[[sigma].sub.im].
The right hand side of (4) says that the risk premium is the
negative of the covariance of the asset with the stochastic discount
factor. An asset with a high expected return must have a low covariance
with the stochastic discount factor. Such an asset tends to have low
returns when investors have high marginal utility. It is risky in that
it fails to deliver wealth precisely when wealth is most valuable to
investors. Investors therefore demand a large risk premium to hold it.
The covariance [[sigma].sub.im] can be written as the product of
the standard deviation of the asset return [[sigma].sub.i], the standard
deviation of the stochastic discount factor [[sigma].sub.m], and the
correlation between the asset return and the stochastic discount factor
[[rho].sub.im]. Since [[rho].sub.im] [greater than or equal to] - 1,
-[[sigma].sub.im] [less than or equal to]
[[sigma].sub.i][[sigma].sub.m]. Substituting into (4),
[[sigma].sub.m] [greater than or equal to] [E.sub.t][[r.sub.i,t+1]
- [r.sub.f,t+1]] + [[sigma].sup.2.sub.i]/2/[[sigma].sub.i]. (5)
This inequality was first derived by Shiller (1982); a multi-asset
version was derived by Hansen and Jagannathan (1991). The right hand
side of (5) is the excess return on an asset, adjusted for Jensen's
Inequality, divided by the standard deviation of the asset's
return--a logarithmic Sharpe ratio for the asset. (5) says that the
standard deviation of the log stochastic discount factor must be greater
than this Sharpe ratio for all assets i, that is, it must be greater
than the maximum possible Sharpe ratio obtainable in asset markets.
Table 1 uses the data of Campbell (2003) and equation (5) to
illustrate the equity premium puzzle. For each country and sample period
the first column of the table reports the average excess return on stock
over short-term debt, adjusted for Jensen's Inequality by adding
one-half the sample variance of the excess log return to get a sample
estimate of the numerator in (5). This adjusted average excess return is
multiplied by 400 to express it in annualized percentage points. The
second column of the table gives the annualized standard deviation of
the excess log stock return, a sample estimate of the denominator in
(5). The third column gives the ratio of the first two columns,
multiplied by 100; this is a sample estimate of the lower bound on the
standard deviation of the log stochastic discount factor, expressed in
annualized percentage points. In the postwar US data the estimated lower
bound is a standard deviation greater than 50% a year; in the other
quarterly data sets it is below 10% for Italy, between 15% and 20% for
Australia and Canada, and above 30% for all the other countries. In the
annual data sets the lower bound on the standard deviation exceeds 30%
for all three countries. These are extraordinarily high volatilities
considering that the stochastic discount factor [M.sub.t+1] is a random
variable with a mean close to one that must always be positive.
2.1 The equity premium puzzle and consumption-based asset pricing
To understand why these numbers are disturbing, I now follow
Rubinstein (1976), Lucas (1978), Breeden (1979), Grossman and Shiller
(1979), Mehra and Prescott (1985) and other classic papers on the equity
premium puzzle and assume that there is a representative agent who
maximizes a time-separable power utility function defined over aggregate
consumption [C.sub.t]:
U([C.sub.t]) = [C.sup.1-[gamma].sub.t]-1 - [gamma], (6)
where [gamma] is the coefficient of relative risk aversion.
The assumption of power utility is not an arbitrary one. A
scale-independent utility function is required to explain the fact that
over the past two centuries, as wealth and consumption have grown
manyfold, riskiess interest rates and risk premia do not seem to have
trended up or down. Power utility is one of the few utility functions
that have this property. (3) Related to this, if different investors in
the economy have different wealth levels but the same power utility
function, then they can be aggregated into a single representative
investor with the same utility function as the individual investors.
Power utility implies that marginal utility U'([C.sub.t]) =
[C.sup.-[gamma].sub.t], and the stochastic discount factor [M.sub.t+1] =
[delta][([C.sub.t+1]/[C.sub.t]).sup.-[gamma]]. The assumption made
previously that the stochastic discount factor is conditionally
lognormal will be implied by the assumption that aggregate consumption
is conditionally lognormal (Hansen and Singleton 1983). Making this
assumption for expositional convenience, the log stochastic discount
factor is [m.sub.t+1] = log([delta]) - [gamma][DELTA][c.sub.t+1], where
[c.sub.t] = log([C.sub.t]).
Equation (3) now becomes
[r.sub.f,t+1] = -log [delta] + [gamma][E.sub.t][DELTA][c.sub.t+1] -
[[gamma].sup.2][[sigma].sup.2.sub.c]/2. (7)
Here [[sigma].sup.2.sub.c] denotes the unconditional variance of
log consumption innovations Var([c.sub.t+1] - [E.sub.t][c.sub.t+1]).
This equation says that the riskless real rate is linear in expected
consumption growth, with slope coefficient equal to the coefficient of
relative risk aversion. The conditional variance of consumption growth
has a negative effect on the riskless rate by stimulating precautionary
savings.
Equation (4) becomes
[E.sub.t][[r.sub.i,t+1] - [r.sub.f,t+1]] + [[sigma].sup.2.sub.i]/2
= [gamma][[sigma].sub.ic], (8)
where [[sigma].sub.ic]. denotes the unconditional covariance of
innovations Cov([r.sub.i,t+1] - [E.sub.t][r.sub.i,t+1], [c.sub.t+1] -
[E.sub.t][C.sub.t=1]). The log risk premium on any asset is the
coefficient of relative risk aversion times the covariance of the asset
return with consumption growth. Intuitively, an asset with a high
consumption covariance tends to have low returns when consumption is
low, that is, when the marginal utility of consumption is high. Such an
asset is risky and commands a large risk premium.
Table 1 uses (8) to illustrate the equity premium puzzle. As
already discussed, the first column of the table reports a sample
estimate of the left hand side of (8), multiplied by 400 to express it
in annualized percentage points. The second column reports the
annualized standard deviation of the excess log stock return, the fourth
column reports the annualized standard deviation of consumption growth,
the fifth column reports the correlation between the excess log stock
return and consumption growth, and the sixth column gives the product of
these three variables which is the annualized covariance
[[sigma].sub.ic] between the log stock return and consumption growth.
Finally, the table gives two columns with implied risk aversion
coefficients. The column headed RRA(l) uses (8) directly, dividing the
adjusted average excess return by the estimated covariance to get
estimated risk aversion. (4) The column headed RRA(2) sets the
correlation of stock returns and consumption growth equal to one before
calculating risk aversion. While this is of course a counterfactual
exercise, it is a valuable diagnostic because it indicates the extent to
which the ,equity premium puzzle arises from the smoothness of
consumption rather than the low correlation between consumption and
stock returns. The correlation is hard to measure accurately because it
is easily distorted by short-term measurement errors in consumption, and
Campbell (2003) shows that empirically it is quite sensitive to the
measurement horizon. By setting the correlation to one, the RRA(2)
column indicates the extent to which the equity premium puzzle is robust
to such issues. A correlation of one is also implicitly assumed in the
volatility bound for the stochastic discount factor, (5), and in many
calibration exercises such as Mebra and Prescott (1985) or Campbell and
Cochrane (1999).
Table 1 shows that the equity premium puzzle is a robust phenomenon
in international data. The coefficients of relative risk aversion in the
RRA(l) (8) column are generally extremely large. They are usually many
times greater than 10, the maximum level considered plausible by Mebra
and Prescott (1985). In a few cases the risk aversion coefficients are
negative because the estimated covariance of stock returns with
consumption growth is negative, but in these cases the covariance is
extremely close to zero. Even when one ignores the low correlation
between stock returns and consumption growth and gives the modelits best
chance by setting the correlation to one, the RRA(2) column still has
risk aversion coefficients above 10 in most cases.
2.2 Could the equity premium puzzle be spurious?
The risk aversion estimates in Table 1 are point estimates and are
subject to sampling error. No standard errors are reported for these
estimates. However authors such as Cecchetti, Lam, and Mark (1993) and
Kocherlakota (1996), studying the long-run annual US data, have found
small enough standard errors that they can reject risk aversion
coefficients below about 8 at conventional significance levels.
Of course, the validity of these tests depends on the
characteristics of the data set in which they are used. Rietz (1988) has
argued that there may be a peso problem in these data. A peso problem
arises when there is a small positive probability of an important event,
and investors take this probability into account when setting market
prices. If the event does not occur in a particular sample period,
investors will appear irrational in the sample and economists will
misestimate their preferences. While it may seem unlikely that this
could be an important problem in 100 years of annual data, Rietz (1988)
argues that an economic catastrophe that destroys almost all
stock-market value can be extremely unlikely and yet have a major
depressing effect on stock prices.
One difficulty with this argument is that it requires not only a
potential catastrophe, but one which affects stock market investors more
seriously than investors in short-term debt instruments. Many countries
that have experienced catastrophes, such as Russia or Germany, have seen
very low returns on short-term government debt as well as on equity. A
peso problem that affects both asset returns equally will affect
estimates of the average levels of returns but not estimates of the
equity premium. The major example of a disaster for stockholders that
did not negatively affect bondholders is the Great Depression of the
early 1930s, but of course this is included in the long-run annual data
for Sweden, the UK, and the US, all of which display an equity premium
puzzle.
Also, the consistency of the results across countries requires
investors in all countries to be concerned about catastrophes. If the
potential catastrophes are uncorrelated across countries, then it
becomes less likely that the data set includes no catastrophes; thus the
argument seems to require a potential international catastrophe that
affects all countries simultaneously.
Even if the equity premium puzzle is not entirely spurious, there
are several reasons to think that stock returns exceeded their true
long-run mean in the late 20th Century. Dimson, Marsh, and Staunton
(2002) present comprehensive international data for the whole 20th
Century and find that returns were generally higher in the later part of
the century. Siegel (1998) reports similar results for US data going
back to the early 19th Century. Fama and French (2002) point out that
average US stock returns in the late 20th Century were considerably
higher than accountants' estimates of the return on equity for US
corporations. Thus if one uses average returns as an estimate of the
true cost of capital, one is forced to the implausible conclusion that
corporations destroyed stockholder value by retaining and reinvesting
earnings rather than paying them out.
Unusually high stock returns in the late 20th Century could have
resulted from unexpectedly favorable conditions for economic growth. But
they could also have resulted from a correction of historical
mispricing, a one-time decline in the equity premium. Several economists
have recently argued that the equity premium is now far lower than it
was in the early 20th Century (Jagannathan, McGrattan, and Scherbina
2000, McGrattan and Prescott 2000). (5)
2.3 Could risk aversion be higher than we thought?
It is possible that the equity premium puzzle has an extremely
simple solution, namely that the coefficient of relative risk. aversion [gamma] is higher than economists traditionally thought. After all, it
is hard to get evidence about risk aversion from any other source than
asset markets. Experimental evidence is of very little use because it is
almost impossible to design experiments involving significant stakes,
and people should be almost indifferent with respect to small gambles.
(6) One might think that "thought experiments," or
introspection, would be sufficient to rule out very large values of
[gamma], but Kandel and Stambaugh (1991) point out that introspection
can deliver very different estimates of risk aversion depending on the
size of the gamble considered. This suggests that introspection can be
misleading or that some more general model of utility is needed.
The assumption of a high [gamma], however, leads to a second
puzzle. Equation (7) implies that the unconditional mean riskless
interest rate is
[Er.sub.f,t+1] = -log[delta] + [gamma]g -
[[gamma].sup.2][[sigma].sup.2.sub.c]/2, (9)
where g is the mean growth rate of consumption. Since g is
positive, as shown in Table 2, high values of [gamma] imply high values
of [gamma]g. Ignoring the term [[gamma].sup.2][[sigma].sup.2.sub.c]/2,
for the moment, this can be reconciled with low average short-term real
interest rates, shown in Table 2, only if the discount factor [delta] is
close to or even greater than one, corresponding to a low or even
negative rate of time preference. This is the riskfree rate puzzle
emphasized by Weil (1989).
Intuitively, the riskfree rate puzzle is that if investors are
risk-averse then with power utility they must also be extremely
unwilling to substitute intertemporally. Given positive average
consumption growth, a low riskiess interest rate and a high rate of time
preference, such investors would have a strong desire to borrow from the
future to reduce their average consumption growth rate. A low riskless
interest rate is possible in equilibrium only if investors have a low or
negative rate of time preference that reduces their desire to borrow.
(7)
Of course, if the risk aversion coefficient [gamma] is high enough
then the negative quadratic term [[gamma].sup.2][[sigma].sup.2.sub.c]/2,
in equation (9) dominates the linear term and pushes the riskless
interest rate down again. The quadratic term reflects precautionary
savings; risk-averse agents with uncertain consumption streams have a
precautionary desire to save, which can work against their desire to
borrow. But a reasonable rate of time preference is obtained only as a
knife-edge case.
Table 2 illustrates the riskfree rate puzzle in international data.
The table first shows the average riskfree rate, the mean consumption
growth rate, and the standard deviation of consumption growth. These
moments and the risk aversion coefficients calculated in Table 1 are
substituted into equation (9), and the equation is solved for an implied
time preference rate. The time preference rate is reported in percentage
points per year; it can be interpreted as the riskless real interest
rate that would prevail if consumption were known to be constant forever
at its current level, with no growth and no volatility. Risk aversion
coefficients in the RRA(2) range imply negative time preference rates in
every country except Switzerland, whereas larger risk aversion
coefficients in the RRA(1) range imply time preference rates that are
often positive but always implausible and vary wildly across countries.
The riskfree rate puzzle can be mitigated by use of the recursive preferences suggested by Epstein and Zin (1991) and Weil (1989). These
preferences allow the elasticity of intertemporal substitution to be a
free parameter, independent of the coefficient of relative risk
aversion, whereas power utility forces one to be the reciprocal of the
other. The riskfree rate puzzle is caused by a low elasticity of
intertemporal substitution rather than a high coefficient of relative
risk aversion. Direct evidence on the elasticity of intertemporal
substitution (Hall 1988, Campbell and Mankiw 1989) suggests that it is
fairly low, certainly well below one, although possibly higher than the
reciprocal of risk aversion.
2.4 The equity volatility puzzle
So far I have asked why average stock returns are so high, given
their volatility (and behavior of aggregate consumption). Now I ask
where the volatility itself comes from.
In order to understand the second moments of stock returns, it is
essential to have a framework relating movements in stock prices to
movements in expected future dividends and discount rates. The present
value model of stock prices is intractably nonlinear when expected stock
returns are time-varying, and this has forced researchers to use one of
several available simplifying assumptions. The most common approach is
to assume a discretestate Markov process either for dividend growth
(Mehra and Prescott 1985) or, following Hamilton (1989), for
conditionally expected dividend growth. The Markov structure makes it
possible to solve the present value model, but the derived expressions
for returns tend to be extremely complicated and so these papers usually
emphasize numerical results derived under specific numerical assumptions
about parameter values.
An alternative framework, which produces simpler closed-form
expressions and hence is better suited for an overview of the
literature, is the loglinear approximation to the exact present value
model suggested by Campbell and Shiller (1988). Campbell and
Shiller's loglinear relation between prices, dividends, and returns
provides an accounting framework: High prices must eventually be
followed by high future dividends or low future returns, and high prices
must be associated with high expected future dividends or low expected
future returns. Similarly, high returns must be associated with upward
revisions in expected future dividends or downward revisions in expected
future returns.
The loglinear approximation starts with the definition of the log
return on some asset i, [r.sub.i, t + 1] [equivalent to] log([P.sub.it +
1] + [D.sub.i, t+1) - log([P.sub.it]). The log return is a nonlinear
function of log prices [p.sub.it], and [p.sub.it + 1] and and log
dividends [d.sub.i,t + 1], but it can be approximated around the mean
log dividend-price ratio, ([d.sub.it] - [p.sub.it]), using a first-order
Taylor expansion. The resulting approximation is a stochastic difference
equation that can be solved forward to an infinite horizon if one is
willing to impose the terminal condition that [lim.sub.j [right
arrow][infinity]][[rho].sup.j][p.sub.i,t + j] = 0, effectively ruling
out explosive behavior of stock prices relative to dividends (the
"rational bubbles" of Blanchard and Watson (1982)). (8)
Finally, Campbell and Shiller take expectations to find that
[d.sub.it] - [p.sub.it] = -k/1 - [rho] + [E.sub.t] [summation over
([infinity]/j=0)] [[rho]'.sup.j] [[r.sub.i, t+1=j] -
[DELTA][d.sub.i, t+1+j]]. (10)
This equation says that the log dividend-price ratio is high when
dividends are expected to grow slowly, or when stock returns are
expected to be high. The equation should be thought of as an accounting
identity rather than a behavioral model; it has been obtained merely by
approximating an identity, solving forward subject to a terminal
condition, and taking expectations. Intuitively, if the stock price is
high today, then from the definition of the return and the terminal
condition that the stock price is non-explosive, there must either be
high dividends or low stock returns in the future. Investors must then
expect some combination of high dividends and low stock returns if their
expectations are to be consistent with the observed price.
Equation (10) describes the log dividend-price ratio rather than
the log price itself. This is a useful way to write the model because in
many data sets dividends appear to follow: a loglinear unit root
process, so that log dividends and log prices are nonstationary. In this
case changes in log dividends are stationary, so from (10) the log
price-dividend ratio is stationary provided that the expected stock
return is stationary. Thus log stock prices and dividends are
cointegrated, and the stationary linear combination of these variables
involves no unknown pararneters since it is just the log ratio.
Equation (10) can also be understood as a dynamic generalization of
the famous formula, usually attributed to Myron Gordon (1962) but
probably due originally to John Burr Williams (1938), that applies when
the discount rate is a constant R and the expected dividend growth rate
is a constant G:
D/P = R - G. (11)
So far I have written asset prices as linear combinations of
expected future dividends and returns. Campbell (1991) shows that it is
also possible to write asset returns as linear combinations of revisions
in expected future dividends and returns, but I do not pursue this
approach further here.
I now use this accounting framework to illustrate the stock market
volatility puzzle. The intertemporal budget constraint for a
representative agent, [W.sub.t+1] = (1 + [R.sub.p, t] + 1)([W.sub.t] -
[C.sub.t]), implies that aggregate consumption is the dividend on the
portfolio of all invested wealth, denoted by subscript w:
[d.sub.wt] = [c.sub.i] (12)
Many authors, including Grossman and Shiller (1981), Lucas (1978),
and Mehra and Prescott (1985), have assumed that the aggregate stock
market, denoted by subscript e for equity, is equivalent to the wealth
portfolio and thus pays consumption as its dividend. Here I follow
Campbell (1986) and Abel (1999) and make the slightly more general
assumption that the dividend on equity equals aggregate consumption
raised to a power [lambda]. In logs, we have
[d.sub.et] = [lambda][c.sub.i]. (13)
The coefficient [lambda] can be interpreted as a measure of
leverage. When [lambda] > 1, dividends and stock returns are more
volatile than the returns on the aggregate wealth portfolio. This
framework has the additional advantage that a riskless real bond with
infinite maturity--an inflation-indexed consol, denoted by subscript
b--can be priced merely by setting [lambda] = 0. The relative volatility of dividends and consumption suggests that [lambda] = 5 or 6 might be a
reasonable assumption.
The representative-agent asset pricing model with power utility,
conditional log-normality, and homoskedasticity implies that
[E.sub.i][r.sub.e,t+1] = [[mu].sub.e] +
[gamma][E.sub.t][DELTA][c.sub.t+1], (14)
where [[mu].sub.e] is an asset-specific constant term. The expected
log return on equity, like the expected log return on any other asset,
is just a constant plus relative risk aversion times expected
consumption growth. (9)
Substituting equations (13) and (14) into equation (10), I find
that
[d.sub.et] - [p.sub.et] = [k.sub.e]/1 - [rho] + ([gamma] -
[lambda]) [E.sub.t] [summation over ([infinity]/j=0)]
[[rho].sup.j][DETLA][c.sub.t+1+j]. (15)
Expected future consumption growth has offsetting effects on stock
prices. It has a direct positive effect by increasing expected future
dividends [lambda]-for-one, but it has an indirect negative effect by
increasing expected future real interest rates [gamma]-for-one.
These offsetting effects make it almost impossible for the standard
power utility model to explain the volatility of stock returns and their
positive correlation with consumption growth. We already know that the
coefficient of relative risk aversion must be large to explain the
equity premium puzzle. If [lambda] < [gamma], then good news about
future consumption drives down stock prices because the interest-rate
effect overwhelms the dividend effect. In this case positively
autocorrelated consumption growth implies that stock returns are
negatively correlated with consumption. If [lambda] = [gamma], then the
dividend-price ratio is constant and the volatility of stock returns is
just [lambda] times the volatility of consumption growth. Only if
[lambda] > [gamma] can we get stock returns to be positively
correlated with consumption growth, and an implausibly large is [lambda]
required to match the observed volatility of stock returns.
2.5 Do stock prices forecast dividend or earnings growth?
Of course, all these calculations are dependent on the assumption
made at the beginning of this subsection, that the log dividend on
stocks is a multiple [lambda] of log aggregate consumption. More general
models, allowing separate variation in dividends and consumption, can in
principle generate volatile stock returns from predictable variation in
dividend growth without creating offsetting variation in real interest
rates. But this explanation for stock market volatility requires that
the stock market forecasts dividend growth.
Campbell and Shiller (2003) present a simple graphical analysis
that makes it clear that stock prices have very little forecasting power
for future dividend growth. They point out that if a valuation ratio,
such as the dividend-price ratio, is stationary, then when the ratio is
at an extreme level either the numerator or the denominator of the ratio
must move in a direction that restores the ratio to a more normal level.
Something must be forecastable based on the ratio, either the numerator
or the denominator. In the case of the dividend-price ratio, a high
ratio must forecast either slow dividend growth or rapid price growth.
(10)
Does the dividend-price ratio forecast future dividend movements or
future price movements? To answer this question, Campbell and Shiller
use annual US data from 1872 to 2000, and present a pair of scatter
plots shown in Figure 1. Each scatterplot has the dividend-price ratio,
measured as the previous year's dividend divided by the January
stock price, on the horizontal axis. (The horizontal axis scale is
logarithmic but the axis is labeled in levels for ease of reference.)
Over this period the historical mean value for the dividend-price ratio
was 4.65%.
In the top part of the figure the vertical axis is the growth rate
of real dividends (measured logarithmically as the change in the natural
log of real dividends) over a time interval sufficient to bring the
dividend-price ratio back to its historical mean of 4.65%. More
precisely, the dividend growth rate is measured from the year preceding
the year shown until the year before the dividend-price ratio again
crossed 4.65%. Because dividends enter the dividend-price ratio with a
one-year lag, this is the appropriate way to measure growth in dividends
from the base level embodied in a given year's dividend-price ratio
to the level that prevailed when the dividend-price ratio next crossed
its historical mean.
Since 1872, the dividend-price ratio has crossed its mean value 29
times, with intervals between crossings ranging from one year to twenty
years (the twenty-year interval being between 1955 and 1975). The
different years are indicated on the scatter diagram by two-digit
numbers; a * after a number denotes a 19th Century date. The last year
shown is 1983, since this is the last year that was followed by the
dividend-price ratio crossing its mean. (The ratio has been below its
mean ever since.) A regression line is fit through these data points,
and a vertical line is drawn to indicate the dividend-price ratio at the
start of the year 2000. The implied forecast for dividend growth,
starting in the year 2000, is the horizontal dashed line marked where
the vertical line intersects the regression line.
It is obvious from the top part of Figure 1 that the dividend-price
ratio has done a poor job as a forecaster of future dividend growth to
the date when the ratio is again borne back to its mean value. The
regression line is nearly horizontal, implying that the forecast for
future dividend growth is almost the same regardless of the
dividend-price ratio. The R2 statistic for the regression is 0.25%,
indicating that only one-quarter of one percent of the variation of
dividend growth is explained by the initial dividend-price ratio.
It must follow, therefore, that the dividend-price ratio forecasts
movements in its denominator, the stock price, and that it is the stock
price that has moved to restore the ratio to its mean value. In the
lower part of Figure 1 the vertical axis shows the growth rate of real
stock prices (measured logarithmically as the change in log real stock
prices) between the year shown and the next year when the dividend-price
ratio crossed its mean value. The scatterplot shows a strong tendency
for the dividend-price ratio to predict future price changes. The
regression line has a strongly positive slope, and the R2 statistic for
the regression is 63%. This answers the question: It is the denominator
of the dividend-price ratio that brings the ratio back to its mean, not
the numerator.
There are several reasons to be cautious in interpreting the
results of Figure 1. First, the behavior of the dividend-price ratio can
be altered by shifts in corporate financial policy. A permanent shift
towards the use of share repurchases, for example, can reduce current
dividends but permanently increase the growth rate of dividends per
share by creating a steady decline in the number of shares outstanding.
This may have happened in recent years, in which case the low current
dividend-price ratio does not necessarily forecast low returns. To
address this concern, Campbell and Shiller (2001) look at earnings as
well as dividends. To eliminate the effects of short-run cyclical
variation in earnings, they average earnings over 10 years as
recommended in the classic investment text of Graham and Dodd (1934).
They find that the ratio of prices to smoothed earnings predicts price
variation rather than earnings variation, consistent with the results
just reported for dividends.
Second, the different points in the scatter diagram are not
independent of one another. There are not 120 independent observations
over 120 years; rather, there are only 29 independent observations
corresponding to the 29 occasions on which the dividend-price ratio
crossed its mean. If one uses a fixed horizon of 10 years, as Campbell
and Shiller do elsewhere in their study, there are only 12 independent
observations. Even allowing for this fact, however, the results appear
statistically significant in a Monte Carlo study reported by Campbell
and Shiller.
Third, the movements of the dividend-price and price-earnings ratio are extremely persistent. This can create serious statistical problems
with standard tests for predictability of returns. Campbell and Yogo
(2002) and Lewellen (2003), however, present modified tests that are
appropriate when predictor variables have near unit roots, and find that
these tests still deliver some evidence for predictability of returns.
Finally, the runup in stock prices in the late 1990's
diminished the statistical evidence that valuation ratios predict stock
returns. For several years in the late 1990's. the stock market
delivered high returns despite record low dividend-price ratios. This
evidence is not reflected in Figure 1 because the dividend-price ratio
has not yet returned to its mean. On the other hand, it is extremely
hard to rationalize the runup in prices using a model with a fixed
discount rate, because the implied dividend growth forecasts appear
wildly optimistic (Heaton and Lucas 1999); also the predictability of
dividend growth from the dividend-price ratio does not seem to have
increased. For these reasons I believe that the experience of the late
1990's is either an extreme version of previous swings in the stock
market, or possibly a one-time structural change to a permanently lower
equity premium; in either case it does not alter the overall message of
Figure 1. In the next part of this paper, I will discuss alternative
explanations of equity volatility, and their implications for portfolio
management.
3. Explaining equity volatility
In the previous part of this paper, I discussed two puzzles of
asset pricing, the equity premium puzzle and the equity volatility
puzzle. Several solutions to the equity premium puzzle are potentially
available. For example, investors may have higher risk aversion than
economists used to think; returns may have been unusually high in the
late 20th Century; and these high returns may have been caused in part
by a one-time correction of historical equity mis-pricing. In this case
future returns will tend to be lower than historical returns, and the
equity premium will diminish as a focus of academic attention.
The situation is not so favorable with respect to the equity
volatility puzzle. This puzzle raises fundamental questions about the
relationship between aggregate consumption and aggregate wealth. Since
consumption is ultimately financed by wealth (broadly defined to include
human wealth), any model with stationary asset returns implies that the
ratio of consumption to wealth must be stationary. Since consumption and
wealth appear individually to have unit roots, this implies that
consumption and wealth are cointegrated. In the very long run, then, the
annualized growth rates of consumption and wealth must be almost
identical; in particular, they must have identical volatilities. The
difficulty is that in the short run, the volatility of consumption
growth is far smaller than the volatility of wealth growth. (11)
Consumption is very smooth, while wealth is very volatile.
How can we reconcile the observed short-run properties of
consumption and wealth with the properties we know they must have in the
long run? There are only two possibilities. First, it may be that the
annualized volatility of consumption growth increases with the horizon
over which it is measured, so that ultimately it reaches the high
volatility of wealth growth. This would require that consumption is not
a random walk, but has positive serial correlation in growth rates.
Second, it may be that the annualized volatility of wealth growth
decreases with the horizon over which it is measured, so that ultimately
it reaches the low volatility of consumption growth. This would require
that wealth is not a random walk, but has negative serial correlation in
growth rates. These two possibilities represent fundamentally different
views of the world. Is the world safe as suggested by consumption, or
risky as suggested by the stock market?
Recent work of Lettau and Ludvigson (2001) suggests that
consumption, not wealth, accurately represents long-term risk. Lettau
and Ludvigson use US Flow of Funds data to construct a proxy for total
asset wealth, including not only equities but also other assets such as
real estate. They use labor income to proxy for human wealth, arguing
that labor income and human wealth should be cointegrated. They analyze
the three aggregate time series for consumption, labor income, and asset
wealth, and find that the three are cointegrated (even though no two of
them are cointegrated). The stationary linear combination of these
variables forecasts wealth, not consumption or labor income. In their
data consumption is extremely close to a random walk. Thus Lettau and
Ludvigson find that wealth is mean-reverting and adjusts over long
horizons to match the smoothness of consumption. A satisfactory model of
equity volatility must be consistent with this finding.
The loglinear asset pricing framework of Campbell and Shiller
(1988) and Campbell (1991) allows us to divide explanations for equity
volatility into several categories. First, equity volatility might be
caused by predictable variation in dividend growth (equivalent to
predictable variation in consumption growth if equities are modeled as
consumption claims, that is, as proxies for aggregate wealth). The
empirical difficulty with this explanation is that stock prices are not
good forecasters of consumption or dividend growth. There is also a
theoretical difficulty that predictable variation in consumption growth
should cause offsetting movements in real interest rates that dampen the
effect on stock prices.
If equity volatility is not caused by predictable variation in cash
flows, then it must be caused by variation in discount rates. The first
and most obvious component of the equity discount rate is the riskless
real interest rate. There is some variation in the real interest rate;
unfortunately it is not large enough to cause big swings in the stock
market as pointed out by Campbell (1991). Also, the timing of real
interest rate movements seems to be quite different from the timing of
stock market movements. The 1970's, for example, saw low real rates
and a depressed stock market, whereas the 1980's saw much higher
real rates and a buoyant stock market. For this reason stock prices are
not good forecasters of real interest rates (Campbell 2003).
The remaining component of the equity discount rate is the equity
premium, the expected excess return on stocks over short-term debt.
Stock market valuation ratios have historically predicted stock returns
over long horizons, consistent with the view that stock market movements
are driven by movements in the equity premium itself.
The equity premium can be thought of as volatility times the reward
for bearing volatility, or the quantity of risk times the price of risk.
Equity volatility does move over time, and does correlate positively
with return forecasts, rising during recessions and stock market
declines. However these movements of volatility are not proportional to
movements in returns as pointed out by Campbell (1987) and Harvey
(1989). Thus we are forced inexorably to the conclusion that the price
of risk itself must be moving over time. Since stock prices tend to
increase when the economy is strong and consumption is growing rapidly,
the price of risk must be countercyclical, moving opposite to
consumption growth. I now explore alternative structural models that can
generate countercyclical time-variation in the price of risk.
One class of models works within a representative-investor
framework and asks what preferences might generate countercyclical risk
aversion. Models of habit formation, such as Constantinides (1990) and
Campbell and Cochrane (1999), have this property, and I discuss these
models in detail in the next section. Countercylical risk aversion also
arises naturally in behavioral finance models that combine the prospect
theory of Kahneman and Tversky (1979) with the "house money
effect" of Thaler and Johnson (1990), that is, the tendency of
investors to worry less about losses that offset prior gains (Barberis,
Huang, and Santos 2001).
A second class of models emphasizes the aggregation of
heterogeneous agents. Each individual agent might have constant risk
aversion, yet they might interact in such a way that the representative
agent has time-varying risk aversion. Different models emphasize
different types of heterogeneity. There might be heterogeneous
constraints, so that some investors are constrained from stock market
participation or are prevented from diversifying their stock portfolios
(Constantinides, Donaldson, and Mehra 2002, Heaton and Lucas 1999,
Vissing-Jorgensen 2002). A relaxation of such constraints allows equity
risk to be shared more broadly, driving down the equilibrium price of
risk. This story might explain a one-time decline in the equity premium
in the late 20th Century, but is less suitable for explaining recurring
cyclical variation in the price of risk. (12)
Investors might also have heterogeneous uninsurable labor income
(Constantinides and Duffie 1996). Variation in the degree of
idiosyncratic risk can cause a high and possibly time-varying equity
premium. Heterogeneous risk aversion may also be important (Wang 1996,
Chan and Kogan 2002). In this case high stock returns would tend to
increase the wealth of risk-tolerant investors, increasing their weight
in the aggregate and driving down the risk-aversion of the
representative investor.
A third class of models emphasizes irrational expectations on the
part of at least some investors. Hansen, Sargent, and Tallarini (1999)
have emphasized that pessimism about long-run growth prospects can
explain both the equity premium and riskfree rate puzzles. (13) The
extrapolation of shocks to growth rates ("irrational
exuberance" if the shocks are positive and irrational gloom if they
are negative) can generate a time-varying and counter-cyclical price of
risk (Barsky and De Long 1993, Barberis, Shleifer, and Vishny 1998,
Cecchetti, Lam, and Mark 2000, Shiller 2000).
3.1 Habit formation
Sundaresan (1989) and Constantinides (1990) have argued for the
importance of habit formation, a positive effect of today's
consumption on tomorrow's marginal utility of consumption.
Two modeling issues arise at the outset. Writing the period utility
function as U([C.sub.t], [X.sub.t]), where [X.sub.t], is the
time-varying habit or subsistence level, the first issue is the
functional form for U(*). Abel (1990) has proposed that U(*) should be a
power function of the ratio C/[X.sub.t], while most other researchers
have used a power function of the difference [C.sub.t] - [X.sub.t]. The
second issue is the effect of an agent's own decisions on future
levels of habit. In standard "internal habit" models such as
those in Constantinides (1990) and Sundaresan (1989), habit depends on
an agent's own consumption and the agent takes account of this when
choosing how much to consume. In "external habit" models such
as those in Abel (1990) and Campbell and Cochrane (1999), habit depends
on aggregate consumption which is unaffected by any one agent's
decisions. Abel calls this "catching up with the Joneses."
Similar results can be obtained in either class of model, but external
habit models are genera lly easier to work with.
The choice between ratio models and difference models of habit is
important because ratio models have constant risk aversion whereas
difference models have time-varying risk aversion. In Abel's (1990)
ratio model, external habit adds a term to the equation describing the
riskless interest rate, but does not change the equation that describes
the excess return of risky assets over the riskless interest rate. The
effect on the riskless interest rate has to do with intertemporal
substitution. Holding consumption today and expected consumption
tomorrow constant, an increase in consumption yesterday increases the
marginal utility of consumption today. This makes the representative
agent want to borrow from the future, driving up the real interest rate.
This instability of the riskless real interest rate is a
fundamental problem for habit formation models. Time-nonseparable
preferences make marginal utility volatile even when consumption is
smooth, because consumers derive utility from consumption relative to
its recent history rather than from the absolute level of consumption.
But unless the consumption and habit processes take particular forms,
time-nonseparability also creates large swings in expected marginal
utility at successive dates, and this implies large movements in the
real interest rate. I now present an alternative specification in which
it is possible to solve this problem, and in which risk aversion varies
over time.
Campbell and Cochrane (1999) build a model with external habit
formation in which a representative agent derives utility from the
difference between consumption and a time-varying subsistence or habit
level. They assume that log consumption follows a random walk with mean
g and innovation [[member of].sub.t+1] This is a fairly good
approximation for U.S. data. The utility function of the representative
agent is a time-separable power utility function, with curvature [gamma], of the difference between consumption [C.sub.t] and habit
[X.sub.t] Utility is only defined when consumption exceeds habit.
It is convenient to capture the relation between consumption and
habit by the surplus consumption ratio [S.sub.t] defined by
[S.sub.t] [equivalent to] [C.sub.t] - [X.sub.t]/[C.sub.t]. (16)
The surplus consumption ratio is the fraction of consumption that
exceeds habit and is therefore available to generate utility. The SDF in
this model is given by
[M.sub.t+1] = [delta] [([S.sub.t+1]/[S.sub.t]).sup.-[gamma]]
[([C.sub.t+1]/[C.sub.t]).sup.-[gamma]]. (17)
The SDF is driven by proportional innovations in the surplus
consumption ratio, as well as by proportional innovations in
consumption. If the surplus consumption ratio is only a small fraction
of-consumption, then small shocks to consumption can be large shocks to
the surplus consumption ratio; thus the SDF can be highly volatile even
when consumption is smooth.
Even more important, the volatility of the SDF is itself
time-varying since it depends on the level of the surplus consumption
ratio. Shocks to consumption have a larger proportional effect on
[S.sub.t] when [S.sub.t] is small than when it is large:
C/S dS/dC = 1 - S/S. (18)
Hence investors are more averse to consumption risk when [S.sub.t]
small. If habit [X.sub.t] is held fixed as consumption [C.sub.t] varies,
the local coefficient of relative risk aversion is
-[Cu.sub.CC]/[u.sub.C] = [gamma]/[S.sub.t], (19)
where [u.sub.C] and [u.sub.CC] are the first and second derivatives
of utility with respect to consumption. Risk aversion rises as the
surplus consumption ratio [S.sub.t] declines, that is, as consumption
approaches the habit level. Note that [gamma], the curvature parameter
in utility, is no longer the coefficient of relative risk aversion in
this model.
To complete the description of preferences, one must specify how
the habit [X.sub.t] evolves over time in response to aggregate
consumption. Campbell and Cochrane suggest an AR(1) model for the log
surplus consumption ratio, s, [equivalent to] log(S,):
[s.sub.t+1] = (1 - [phi])s + [phi][s.sub.t] +
[lambda]([s.sub.t])[[member of].sub.t+1]. (20)
The parameter governs the persistence of the log surplus
consumption ratio, while the "sensitivity function"
[lambda]([s.sub.t]) controls the sensitivity of [s.sub.t+1] and thus of
log habit [x.sub.t + 1] to innovations in consumption growth [[member
of].sub.t + 1]. This modeling strategy ensures that the habit process
implied by a process for [s.sub.t + 1] always lies below consumption.
The logic of Hansen and Jagannathan (1991) implies that the largest
possible Sharpe ratio is given by the conditional standard deviation of
the log SDF. This is [gamma][sigma](1 + [lambda]([s.sub.t])), so a
sensitivity function that varies inversely with [s.sub.t] delivers a
timevarying, countercyclical Sharpe ratio.
The same mechanism helps to stabilize the riskless real interest
rate. When the surplus consumption ratio falls, investors have an
intertemporal-sub stitution motive to borrow from the future, but this
is offset by an increased precautionary savings motive created by the
volatility of the SDF. Campbell and Cochrane parameterize the model so
that these two effects exactly cancel. This makes the riskless real
interest rate constant, a knife-edge case that helps to reveal the pure
effects of time-varying risk aversion on asset prices. With a constant
riskless rate, real bonds of all maturities are also riskless and there
are no real term premia. Thus the equity premium is also a premium of
stocks over long-term bonds.
When this model is calibrated to fit the first two moments of
consumption growth, the average riskless interest rate, and the Sharpe
ratio on the stock market, it also roughly fits the volatility,
predictability, and cyclicality of stock returns. The model does not
resolve the equity premium puzzle, since it relies on high average risk
aversion, but it does resolve the stock market volatility puzzle.
The Campbell-Cochrane model assumes random walk consumption and
implies negative autocorrelation of stock returns. The Constantinides
(1990) model of habit formation, by contrast, assumes UD asset returns
and implies positive autocorrelation of consumption growth. Thus these
two models take different stands on the question of whether wealth or
consumption accurately represents long-run risk. The Constantinides
model fits the equity premium with low risk-aversion, but it achieves
this success at the cost of a positively serially correlated consumption
process that contradicts the empirical findings of Lettau and Ludvigson
(2001).
3.2 Heterogeneous labor income
The heterogeneity of utility-maximizing stock market investors may
have important effects. For example, if investors are subject to large
idiosyncratic risks in their labor income and can share these risks only
indirectly by trading a few assets such as stocks and Treasury bills,
their individual consumption paths may be much more volatile than
aggregate consumption. Even if individual investors have the same power
utility function, so that any individual's consumption growth rate
raised to the power-[gamma] would be a valid stochastic discount factor,
the aggregate consumption growth rate raised to the power-[gamma] may
not be a valid stochastic discount factor.
This problem is an example of Jensen's Inequality. Since
marginal utility is nonlinear, the average of investors' marginal
utilities of consumption is not generally the same as the marginal
utility of average consumption. The problem disappears when
investors' individual consumption streams are perfectly correlated
with one another as they will be in a complete markets setting. Grossman
and Shiller (1982) point out that it also disappears in a
continuous-time model when the processes for individual consumption
streams and asset prices are diffusions.
Constantinides and Duffie (1996) have provided a simple framework
within which the effects of heterogeneity can be understood.
Constantinides and Duffie postulate an economy in which individual
investors k have different consumption levels [C.sub.kt]. The
cross-sectional distribution of individual consumption is lognormal, and
the change from time t to time t + 1 in individual log consumption is
cross-sectionally uncorrelated with the level of individual log
consumption at time t. All investors have the same power utility
function with time discount factor [delta] and coefficient of relative
risk aversion [gamma].
In this economy each investor's own intertemporal marginal
rate of substitution is a valid stochastic discount factor. Hence the
cross-sectional average of investors' intertemporal marginal rates
of substitution is a valid stochastic discount factor. I write this as
[M.sup.*.sub.t+1] [equivalent to] [sigma][E.sup.*.sub.t+1]
[[([C.sub.k,t+1]/[C.sub.kt]).sup.-[gamma]]], (21)
where [E.sup.*.sub.t] denotes an expectation taken over the
cross-sectional distribution at time t. That is, for any
cross-sectionally random variable [X.sub.kt], [E.sup.*.sub.t][X.sub.kt]
[equivalent to] [lim.sub.k[right arrow][infinity]] (1/K) [summation over
(K/k=1)] [X.sub.kt], the limit as the number of cross-sectional units
increases of the cross-sectional sample average of [X.sub.kt]. Note that
[E.sup.*.sub.t][X.sub.kt] will in general vary over time and need not be
lognormally distributed conditional on past information.
An economist who knows the underlying preference parameters of
investors but does not understand the heterogeneity in this economy
might attempt to construct a representative-agent stochastic discount
factor, [M.sup.RA.sub.t+1], using aggregate consumption:
[M.sup.RA.sub.t+1] [equivalent to] [delta]
[([E.sup.*.sub.t+1][[C.sub.k,t+1]]/[E.sup.*.sub.t][[C.sub.kt]]).sup.-
[gamma]]. (22)
Using the assumptions on the cross-sectional distribution of
consumption, the difference between the valid log stochastic discount
factor [m.sup.*.sub.t+1] and the invalid log representative-agent
stochastic discount factor [m.sup.RA.sub.t+1] can be written as
[m.sup.*.sub.t+1] - [m.sup.RA.sub.t+1] = [gamma]([gamma] + 1)/2
[Var.sup.*.sub.t+1][DELTA][C.sub.k,t+1], (23)
where [Var.sup.*.sub.t] is defined analogously to [E.sup.*.sub.t]
as [Var.sup.*.sub.t][X.sub.kt] = [lim.sub.K[right arrow][infinity]]
(1/K) [summation over (K/k = 1)] [([X.sub.kt] - [E.sup.*.sub.t]
[X.sub.kt]).sup.2], and like [E.sup.*.sub.t] will in general vary over
time.
The time series of this difference can have a nonzero mean, helping
to explain the riskfree rate puzzle, and a nonzero variance, helping to
explain the equity premium puzzle. If the cross-sectional variance of
log consumption growth is negatively correlated with the level of
aggregate consumption, so that idiosyncratic risk increases in economic
downturns, then the true stochastic discount factor [m.sup.*.sub.t+1]
will be more strongly countercyclical than the representative-agent
stochastic discount factor constructed using the same preference
parameters; this has the potential to explain the high price of risk
without assuming that individual investors have high risk aversion.
Mankiw (1986) makes a similar point in a two-period model. It is also
possible that the correlation between idiosyncratic risk and aggregate
consumption itself moves over time in such a way that the price of risk
is time-varying.
An important unresolved question is whether the heterogeneity we
can measure has the characteristics that are needed to help resolve the
asset pricing puzzles. In the Constantinides-Duffie model the
heterogeneity must be large to have important effects on the stochastic
discount factor; a cross-sectional standard deviation of log consumption
growth of 20 percent, for example, is a cross-sectional variance of only
0.04, and it is variation in this number over time that is needed to
explain the equity premium puzzle. Interestingly, the effect of
heterogeneity is strongly increasing in risk aversion since
[Var.sup.*.sub.t+1] [DELTA][c.sub.k,t + 1] is multiplied by
[gamma]([gamma] + 1)/2 in (23). This suggests that heterogeneity may
supplement high risk aversion but cannot altogether replace it as an
explanation for the equity premium puzzle.
Cogley (1998) looks at consumption data and finds that measured
heterogeneity has only small effects on the SDF. Lettau (2002) reaches a
similar conclusion by assuming that individuals consume their income,
and calculating the risk-aversion coefficients needed to put model-based
stochastic discount factors inside the Hansen-Jagannathan volatility
bounds. This procedure is conservative in that individuals trading in
financial markets are normally able to achieve some smoothing of
consumption relative to income. Nevertheless Lettau finds that high
individual risk aversion is still needed to satisfy the
Hansen-Jagannathan bounds.
These conclusions may not be surprising given the Grossman-Shiller
(1982) result that the aggregation problem disappears in a
continuous-time diffusion model. In such a model, the cross-sectional
variance of consumption is locally deterministic and hence the false SDF
[M.sup.RA.sub.t+1] correctly prices risky assets. In a discrete-time
model the cross-sectional variance of consumption can change randomly
from one period to the next, but in practice these changes are likely to
be small. This limits the effects of consumption heterogeneity on asset
pricing.
It is also important to note that idiosyncratic shocks are assumed
to be permanent in the Constantinides-Duffie model. Heaton and Lucas
(1996) calibrate individual income processes to micro data from the
Panel Study of Income Dynamics (PSID). Because the PSID data show that
idiosyncratic income variation is largely transitory, Heaton and Lucas
find that investors can minimize its effects on their consumption by
borrowing and lending. This prevents heterogeneity from having any large
effects on aggregate asset prices.
To get around this problem, several recent papers have combined
heterogeneity with constraints on borrowing. Heaton and Lucas (1996) and
Krusell and Smith (1997) find that borrowing constraints or large costs
of trading equities are needed to explain the equity premium.
Constantinides, Donaldson, and Mebra (2002) focus on heterogeneity
across generations. In a stylized three-period overlapping generations
model young agents have the strongest desire to hold equities because
they have the largest ratio of labor income to financial wealth. If
these agents are prevented from borrowing to buy equities, the
equilibrium equity premium is large.
Heterogeneity in preferences may also be important. Several authors
have recently argued that trading between investors with different
degrees of risk aversion or time preference, possibly in the presence of
market frictions or portfolio insurance constraints, can lead to
time-variation in the market price of risk (Dumas (1989), Grossman and
Zhou (1996), Wang (1996), Chan and Kogan (2002)). Intuitively,
risk-tolerant agents hold more risky assets so they control a greater
share of wealth in good states than in bad states; aggregate risk
aversion therefore falls in good states, producing effects similar to
those of habit formation.
3.3 Irrational expectations
A number of papers have explored the consequences of relaxing the
assumption that investors have rational expectations and understand the
behavior of dividend and consumption growth. In the absence of
arbitrage, there exist positive state prices that can rationalize the
prices of traded financial assets. These state prices equal subjective
state probabilities multiplied by ratios of marginal utilities in
different states. Thus given any model of utility, there exist
subjective probabilities that produce the necessary state prices and in
this sense explain the observed prices of traded financial assets. The
interesting question is whether these subjective probabilities are
sufficiently close to objective probabilities, and sufficiently related
to known psychological biases in behavior, to be plausible.
Many of the papers in this area work in partial equilibrium and
assume that stocks are priced by discounting expected future dividends
at a constant rate. This assumption makes it easy to derive any desired
behavior of stock prices directly from assumptions on dividend
expectations. Barsky and De Long (1993), for example, assume that
investors believe dividends to be generated by a doubly integrated
process, so that the dividend growth rate has a unit root. These
expectations imply that rapid dividend growth increases stock prices
more than proportionally, so that the price-dividend ratio rises when
dividends are growing strongly. If dividend growth is in fact
stationary, then the high price-dividend ratio is typically followed by
dividend disappointments, low stock returns, and reversion to the
long-run mean price-dividend ratio. Under this assumption of stationary
dividend growth, Barsky and DeLong's model produces overreaction of
stock prices to dividend news, and this accounts for the equity
volatility pu zzle and the predictability of stock returns. (14)
Another potentially important form of irrationality is a failure to
understand the difference between real and nominal magnitudes.
Modigliani and Cohn (1979) argued that investors suffer from inflation
illusion, in effect discounting real cash flows at nominal interest
rates. Ritter and Warr (2002) and Sharpe (1999) argue that inflation
illusion may have led investors to bid up stock prices as inflation has
declined since the early 1980s. An interesting issue raised by this
literature is whether misvaluation is caused by a high level of
inflation (in which case it is unlikely to be important today) or
whether it is caused by changes in inflation from historical benchmark
levels (in which case it may contribute to high current levels of stock
prices).
A limitation of these models is that they do not consider general
equilibrium issues, in particular the implication of irrational beliefs
for aggregate consumption. Using for simplicity the fiction that
dividends equal consumption, investors' irrational expectations
about dividend growth should be linked to their irrational expectations
about consumption growth. Interest rates are not exogenous, but like
stock prices, are determined by investors' expectations. Thus it is
significantly harder to build a general equilibrium model with
irrational expectations.
To see how irrationality can affect asset prices in general
equilibrium, consider first a static model in which log consumption
follows a random walk with drift. Investors understand that consumption
is a random walk, but they underestimate its drift. Such irrational
pessimism lowers the average riskfree rate, increases the equity
premium, and has an ambiguous effect on the price-dividend ratio. Thus
pessimism has the same effects on asset prices as a low rate of time
preference and a high coefficient of risk aversion, and it can help to
explain both the riskfree rate puzzle and the equity premium puzzle
(Hansen, Sargent, and Tallarmni 1999).
To explain the volatility puzzle, a more complicated model of
irrationality is needed. Suppose now that log consumption growth follows
an AR(1) process, but that investors overestimate the persistence of
this process. In this model the equity premium falls when consumption
growth has been rapid, and rises when consumption growth has been weak.
This model, which can be seen as a general equilibrium version of Barsky
and De Long (1993) or Shiller (2000) fits the apparent cyclical
variation in the market price of risk. One difficulty with this story is
that it has strong implications for bond market behavior. When investors
become irrationally exuberant, their optimism should lead to a strong
desire to borrow from the future, which should drive up the riskless
interest rate even while it drives down the equity premium. Cecchetti,
Lam, and Mark (2000) handle this problem by allowing the degree of
investors' irrationality itself to be stochastic and time-varying.
4. Implications for portfolio choice
I have argued that the price of risk is time varying. It follows
that a rational investor, who lives entirely off financial wealth
without idiosyncratic labor income, must have time-varying risk aversion
in order to buy and hold an aggregate equity index. This leads naturally
to the question, what should a rational investor do if he lives off
financial wealth and has constant risk aversion?
This topic of portfolio choice is the original subject of modem
financial economics. Mean-variance analysis, developed almost fifty
years ago by Markowitz (1952), has provided a basic paradigm for
portfolio choice. This approach usefully emphasizes the ability of
diversification to reduce risk, but it ignores several critically
important factors. Most notably, the analysis is static; it assumes that
investors care only about risks to wealth one period ahead. However many
investors, both individuals and institutions such as charitable
foundations or universities, seek to finance a stream of consumption
over a long lifetime.
Merton (1969, 1971, 1973) showed thirty years ago that the solution
to a long-term portfolio choice problem can be very different from the
solution to a short-term problem. In particular, if investment
opportunities are varying over time, then long-term investors care about
shocks to investment opportunities--the productivity of wealth--as well
as shocks to wealth itself. They may seek to hedge their exposures to
wealth productivity shocks, and this gives rise to intertemporal hedging
demands for financial assets. Brennan, Schwartz, and Lagnado (1997) have
coined the phrase "strategic asset allocation" to describe
this far-sighted response to time-varying investment opportunities.
Unfortunately Merton's intertemporal model is hard to solve.
Until recently solutions to the model were only available in those
trivial cases where it reduces to the static model. Therefore the Merton
model has not become a usable empirical paradigm, has not displaced the
Markowitz model, and has had only limited influence on investment
practice. Recently this situation has begun to change as a result of
advances in both analytical and numerical methods. A new empirical
paradigm is emerging. Interestingly, this paradigm both supports and
qualifies traditional rules of thumb used by financial planners.
Campbell and Viceira (1999, 2001, 2002) present an integrated empirical
approach to the recent portfolio choice literature.
Time-variation of the equity premium has two effects on optimal
portfolio choice for investors with constant risk aversion. First, it
implies that investors should "time the market," increasing
their equity allocations at times when the equity premium is high and
reducing them at times when the equity premium is low. (15)
A second effect on portfolio choice arises from the fact that the
equity premium tends to fall when stock prices rise, because valuation
ratios such as D/P move inversely with prices. This implies
mean-reversion in stock returns, that is, a tendency for the annualized
volatility of returns to fall with the investment horizon. Direct
evidence for reduction in volatility at long horizons is presented by
Siegel (1998). Campbell and Viceira (2002) use a simple time-series
model, related to the evidence presented earlier on stock return
predictability, to generate implied volatilities of returns on stocks,
bonds, and Treasury bills at all horizons. Their results are summarized
graphically in Figures 2 and 3 for quarterly postwar and long-term
annual US data. In both data sets equity volatility is in the range 16%
to 18% over one year, but it falls to 9% in the quarterly data and 13%
in the annual data over longer holding periods. The volatility of
Treasury bill investments, by contrast, increases with the holding
period because real interest rates vary over time in a persistent
fashion.
Mean-reversion in stock returns creates a horizon effect on
portfolio choice: Long-term investors may invest differently from
short-term investors. The reduction in long-term stock market risk is
directly relevant for long-term buy-and-hold investors (Barberis 2000).
These investors will increase their equity holdings, relative to the
holdings of otherwise identical short-term investors, because they
perceive equities as having lower risk.
Long-term investors who can rebalance their portfolios each period
have intertemporal hedging demand (Merton 1973). They may wish to hedge
the risk that future investment opportunities will deteriorate. If their
risk aversion is greater than one, they wish to hold assets that
increase in value when investment opportunities deteriorate. The most
obvious example of such an asset is an inflation-indexed bond, whose
value increases when real interest rates fall. But stocks also have this
property, because an increase in stock prices signals a decrease in
future stock returns and thus a deterioration in investment
opportunities (Campbell and Viceira 1999).
Figure 4 illustrates alternative portfolio rules. The horizontal
axis shows the equity premium, with its long-run average marked by a
vertical dashed line. In the presence of mean-reversion, the equity
premium will fall if stock prices have risen (one will move to the left
in the diagram) and will rise if stock prices have fallen (one will move
to the right). The vertical axis shows the portfolio allocation to
stocks, assuming that the alternative is to hold cash at a constant
riskless interest rate and that there are no constraints on leverage or
short sales.
The three lines in the figure are three alternative portfolio
rules. The horizontal line marked "Myopic Investor" is the
traditional buy-and-hold allocation that would come out of a
single-period mean-variance analysis, ignoring time-variation in the
equity premium. The sloped line marked "Tactical Investor" is
the allocation that would be recommended by single-period mean-variance
analysis that takes account of time-variation in the equity premium, in
the manner of commercial tactical asset allocation strategies. This line
passes through the origin, because an equity premium of zero would imply
zero allocation to stocks. The sloped line marked "Strategic
Investor" is the optimal portfolio rule derived by Campbell and
Viceira (1999) for long-term investors with constant relative risk
aversion greater than one. It has almost the same slope as the tactical
portfolio rule (if anything it is slightly steeper), but it is shifted
upward by positive intertemporal hedging demand. A strategic investor
should hold some equities even if the equity premium temporarily dips to
zero, in order to hedge against further deterioration in investment
opportunities.
It is interesting to relate these results to recent discussions of
stock market risk. Equities have traditionally been regarded as risky
assets. They may be attractive because of their high average returns,
but these returns represent compensation for risk; thus equities should
be treated with caution by all but the most aggressive investors. In
recent years, however, authors such as Siegel (1998) and Glassman and
Hassett (1999) have argued that equities are actually relatively safe
assets for investors who are able to hold for the long term.
The revisionist view that stocks are safe assets is based on the
evidence that excess stock returns are less volatile when they are
measured over long holding periods. Mathematically, such a reduction in
stock market risk at long horizons can only be due to mean-reversion in
excess stock returns, which is equivalent to time-variation in the
equity premium. Yet revisionist investment advice typically ignores the
implications of a time-varying equity premium. Siegel (1998) recommends
an aggressive buy-and-hold strategy, like the horizontal line in Figure
4 but shifted upwards to reflect the reduced risk of stocks for
long-term investors. The optimal policy is instead the sloped line
marked "Strategic Investor" in Figure 4.
The difference between the optimal strategy and the strategy
recommended by Siegel is particularly dramatic in the aftermath of a
bull market in equities. At such a time, the optimal equity allocation
may be no higher--it may even be lower--than the allocation implied by a
traditional short-term portfolio analysis. To put it another way,
investors who are attracted to the stock market by the prospect of high
returns combined with low long-term risk are trying to have their cake
and eat it too. If expected stock returns are constant over time, then
one can hope to earn high stock returns in the future similar to the
high returns of the past; but in this case stocks are much riskier than
bonds in the long term, just as they are in the short term. If instead
stocks mean-revert, then they are relatively safe assets for long-term
investors; but in this case future returns are likely to be meagre as
mean-reversion unwinds the spectacular stock market runup of the last
two decades of the 20th Century.
It is important to keep in mind two limitations of this portfolio
analysis. First, it ignores constraints that might prevent investors
from short-selling or from borrowing to invest in risky assets. The
Siegel strategy of buying and holding stocks might be much closer to
optimal for an aggressive investor who cannot borrow to leverage a stock
market position, and who therefore normally holds the maximum 100%
weight in equities.
Second, I have solved the microeconomic portfolio choice problem of
a rational investor with constant relative risk aversion and no human
wealth, but such an investor cannot be the representative investor. As
we discussed earlier on, the representative investor must have different
preferences, constraints, or beliefs in order to be content to hold the
aggregate wealth portfolio. Thus the portfolio advice of Figure 4 can
only be used by atypical investors.
5. Conclusion
In this paper I have described two puzzles of asset pricing, the
equity premium puzzle and the equity volatility puzzle. The equity
premium puzzle may in the end be explained by a combination of factors,
including both high risk aversion of investors and unexpectedly high
returns in the late 20th Century (possibly caused by a one-time
correction of historical equity mispricing). The equity volatility
puzzle is more fundamental. The data suggest that historical variations
in stock prices have been driven primarily by changes in expected stock
returns, rather than changes in expected future dividends.
In the second part of this paper I have argued that the stock
market moves as if risk aversion is volatile and countercyclical. This
behavior could be caused by habit formation, countercyclical
idiosyncratic labor income risk, heterogeneity in risk aversion, or
irrational expectations.
These findings have interesting implications for the optimal
portfolio choice of investors with constant risk aversion and no labor
income risk. Such investors should invest more aggressively when
consumption and stock prices are low than when they are high. Also,
long-term investors with constant risk aversion greater than one should
invest more aggressively on average than short-term investors with the
same risk aversion. This last result supports the view, sometimes
expressed by financial planners, that investors can afford to take
greater stock market risk if they have a long investment horizon.
[FIGURE 1 OMITTED]
[FIGURE 2 OMITTED]
[FIGURE 3 OMITTED]
TABLE 1
The Equity Premium Puzzle
Country Sample Period [aer.sub.c] [sigma]([er.sub.c]) [sigma](m)
USA 1947.2-1998.3 8.071 15.271 52.853
AUL 1970.1-1998.4 3.885 22.403 17.342
CAN 1970.1-1999.1 3.968 17.266 22.979
FR 1973.2-1998.3 8.308 23.175 35.848
GER 1978.4-1997.3 8.669 20.196 42.922
ITA 1971.2-1998.1 4.687 27.068 17.314
JAP 1970.2-1998.4 5.098 21.498 23.715
NTH 1977.2-1998.3 11.421 16.901 67.576
SWD 1970.1-1999.2 11.539 23.518 49.066
SWT 1982.2-1998.4 14.898 21.878 68.098
UK 1970.1-1999.1 9.169 21.198 43.253
USA 1970.1-1998.3 6.353 16.976 37.425
SWD 1920-1997 6.540 18.763 34.855
UK 1919-1997 8.674 21.277 40.767
USA 1891-1997 6.723 18.496 36.345
Country [sigma]([DELTA]c) [rho]([er.sub.c], [DELTA]c)
USA 1.071 0.205
AUL 2.059 0.144
CAN 1.920 0.202
FR 2.922 -0.093
GER 2.447 0.029
ITA 1.665 -0.006
JAP 2.561 0.112
NTH 2.510 0.032
SWD 1.851 0.015
SWT 2.123 -0.112
UK 2.511 0.093
USA 0.909 0.274
SWD 5.622 0.167
UK 5.630 0.351
USA 6.437 0.495
Country cov([er.sub.c], [DELTA]C) RRA(1) RRA(2)
USA 3.354 240.647 49.326
AUL 6.640 58.511 8.421
CAN 6.694 59.266 11.966
FR -6.315 < 0 12.270
GER 1.446 599.468 17.542
ITA -0.252 < 0 10.400
JAP 6.171 82.620 9.260
NTH 1.344 849.991 9.260
SWD 0.674 1713.197 26.501
SWT -5.181 < 0 32.076
UK 4.930 185.977 17.222
USA 4.233 150.100 41.178
SWD 8.830 74.062 12.400
UK 21.042 41.223 14.483
USA 29.450 22.827 11.293
TABLE 2
The Riskfree Rate Puzzle
Country Sample Period [[gamma].sub.f] [DELTA]c [sigma]([DELTA]C)
USA 1947.2-1998.3 0.896 1.951 1.071
AUL 1970.1-1998.4 2.054 2.071 2.059
CAN 1970.1-1999.1 2.713 2.170 1.920
FR 1973.2-1998.3 2.715 1.212 2.922
GER 1978.4-1997.3 3.219 1.673 2.447
ITA 1971.2-1998.1 2.371 2.273 1.665
JAP 1970.2-1998.4 1.388 3.233 2.561
NTH 1977.2-1998.3 3.377 1.671 2.510
SWD 1970.1-1999.2 1.995 1.001 1.851
SWT 1982.2-1998.4 1.393 0.559 2.123
UK 1970.1-1999.1 1.301 2.235 2.511
USA 1970.1-1998.3 1.494 1.802 0.909
SWD 1920-1997 2.209 1.730 2.811
UK 1919-1997 1.255 1.472 2.815
USA 1891-1997 2.020 1.760 3.218
Country RRA (1) TPR (1) RRA (2) TPR (2)
USA 240.647 -136.270 49.326 -81.393
AUL 58.511 -46.512 8.421 -13.880
CAN 59.266 -61.154 11.966 -20.618
FR < 0 N/A 12.270 -5.735
GER 599.468 9757.265 17.542 -16.910
ITA < 0 N/A 10.400 -19.765
JAP 82.620 -41.841 9.260 -25.735
NTH 849.991 21349.249 26.918 -18.769
SWD 1713.197 48590.956 26.501 -12.506
SWT < 0 N/A 32.076 6.636
UK 185.977 676.439 17.222 -27.838
USA 150.100 -175.916 41.178 -65.701
SWD 74.062 90.793 12.400 -13.165
UK 41.223 7.913 14.483 -11.749
USA 22.827 -11.162 11.293 -11.247
Notes
(1.) The gap between average stock and bill returns is even higher
if one computes an average of simple returns (an arithmetic return
average) rather than an average of log returns (a geometric return
average). In this paper I work with log returns throughout, but I adjust
average log returns as required by the theoretical models I explore. In
practice this means adding one-half the variance to the difference of
average log returns, in effect converting from geometric to arithmetic
average returns.
(2.) Jorion and Goetzmann (1999) consider international stock-price
data from earlier in the 20th Century and argue that the long-term
average real growth rate of stock prices has been higher in the US than
elsewhere. However they do not have data on dividend yields, which are
an important component of total return and were particularly important
in Europe during the troubled interwar period. Dimson, Marsh, and
Staunton (2002) do measure dividend yields and find that total returns
in the US did not exceed returns in all other countries in the early
20th Century.
(3.) A few other utility functions also have this property. Epstein
and Zin (1991) and Weil (1989) have proposed a recursive utility
specification that preserves the scale-invariance of power utility but
relaxes the restriction of power utility that the coefficient of
relative risk aversion is the reciprocal of the elasticity of
intertemporal substitution. Models of habit formation make relative risk
aversion constant in the long run but variable in the short run.
(4.) The calculation is done correctly, in natural units, even
though the table reports average excess returns and covariances in
percentage point units. Equivalently, the ratio of the quantities given
in the table is multiplied by 100.
(5.) Glassman and Hassett (1999) take this argument to an extreme.
They argue that the equity premium should be zero, and that US stock
prices will rise three-fold from 1999 levels as the transition
continues. Events since 1999 have not been kind to this view, but it is
certainly possible that the equity premium remains lower today than it
was for most of the 20th Century.
(6.) Experimental evidence is well described by the prospect theory
of Kahneman and Tversky (1979), but it is not at all clear that this
theory can be used to describe people's responses to the
significant lifetime risks involved in financial markets.
(7.) As Abel (1996) and Kocherlakota (1996) point out, negative
time preference is consistent with finite utility in a time-separable
model provided that consumption is growing, and marginal utility
shrinking, sufficiently rapidly. The question is whether negative time
preference is plausible.
(8.) There are however several reasons to rule out such bubbles.
The theoretical circumstances under which bubbles can exist are quite
restrictive; Tirole (1985), for example, uses an overlapping generations framework and finds that bubbles can only exist if the economy is
dynamically inefficient, a condition which seems unlikely on prior
grounds and which is hard to reconcile with the empirical evidence of
Abel, Mankiw, Summers, and Zeckhauser (1989). Santos and Woodford (1997)
also conclude that the conditions under which bubbles can exist are
fragile. Empirically, bubbles imply explosive behavior of prices in
relation to dividends and other measures of fundamentals; there is no
evidence of this, although nonlinear bubble models are hard to reject
using standard linear econometric methods.
(9.) Campbell (1999) analyzes the more general Epstein-Zin-Weil
model, where relative risk aversion need not equal the reciprocal of the
elasticity of intertemporal substitution [phi]. In that model the
coefficient on expected consumption growth is actually the reciprocal
1/[phi].
(10.) A similar point can be understood by looking at equation
(15). If the dividend-price ratio varies, then either the expected rate
of dividend growth or the expected rate of return must vary. Note
however that this is a slightly different point. The total rate of
return includes both the dividend yield and the rate of price
appreciation. This is why the argument based on equatiOn (15) does not
rely on stationarity of the dividend-price ratio. Earlier work on the
ability of stock prices to predict dividends includes Shiller (1981) and
Campbell and Shiller (1988).
(11.) In Table I, we saw that consumption is far less volatile than
stock returns. While equities are not the only component of wealth,
other components are not smooth enough to compensate for the volatility
of stock returns (Campbell 1996, Lettau and Ludvigson 2001).
(12.) Even on a one-time basis, it is hard to get large effects of
expanding participation because new participants tend to be much poorer
than old participants, so the wealth-weighted expansion in participation
is relatively small.
(13.) This is similar to the peso problem story of Rietz (1988)
except that investor fears are no longer required to be rational.
(14.) Shiller (2000) discusses psychological factors that
contribute to the formation of extrapolative expectations, with special
reference to the runup in stock prices during the 1990's. Barberis,
Shleifer, and Vishny (1998) present a related model.
(15.) Note that these adjustments take place gradually, since the
variables that predict the equity premium move relatively slowly. Thus
they are nothing like the rapid moves that are sometimes recommended by
commercial market timing or tactical asset allocation models.
References
Abel, Andrew B., 1990, "Asset Prices under Habit Formation and
Catching Up with the Joneses," American Economic Review Papers and
Proceedings 80, 38-2.
Abel, Andrew B., 1999, "Risk Premia and Term Premia in General
Equilibrium," Journal of Monetary Economics 43, 3-33.
Abel, Andrew B., M. Gregory Mankiw, Lawrence H. Summers, and
Richard J. Zeckhauser, 1989, "Assessing Dynamic Efficiency: Theory
and Evidence," Review of Economic Studies 56, 1-20.
Barberis, Nicholas, 2000, "Investing for the Long Run when
Returns are Predictable," Journal of Finance 55, 225-264.
Barberis, Nicholas, Ming Huang, and Tano Santos, 2001,
"Prospect Theory and Asset Prices," Quarterly Journal of
Economics 116, 1-53.
Barberis, Nicholas, Andrei Shleifer, and Robert Vishny, 1998,
"A Model of Investor Sentiment," Journal of Financial
Economics 49, 307-343.
Barsky, Robert B. and J. Bradford De Long, 1993, "Why Does the
Stock Market Fluctuate?," Quarterly Journal of Economics 107, 29
1-311.
Bernstein, Peter, 1996, Against the Gods: The Remarkable Story of
Risk, Wiley.
Blanchard, Olivier J. and Mark Watson, 1982, "Bubbles,
Rational Expectations, and Financial Markets," in P. Wachtel ed.,
Crises in the Economic and Financial Structure: Bubbles, Bursts, and
Shocks, Lexington: Lexington, MA.
Breeden, Douglas T., 1979, "An Intertemporal Asset Pricing
Model with Stochastic Consumption and Investment Opportunities,"
Journal of Financial Economics 7, 265-296.
Brennan, Michael J., Eduardo S. Schwartz, and Ronald Lagnado, 1997,
"Strategic Asset Allocation," Journal of Economic Dynamics and
Control 21, 1377-1403.
Campbell, John Y, 1987, "Stock Returns and the Term
Structure," Journal of Financial Economics 18, 373-399.
Campbell, John Y, 1991, "A Variance Decomposition for Stock
Returns," Economic Journal 101, 157-179.
Campbell, John Y, 1996, "Understanding Risk and Return,"
Journal of Political Economy 104, 298-345.
Campbell, John Y, 1999, "Asset Prices, Consumption, and the
Business Cycle," Chapter 19 in John Taylor and Michael Woodford eds. Handbook of Macroeconomics Vol. 1, 1231-1303, North-Holland,
Amsterdam.
Campbell, John Y, 2000, "Asset Pricing at the
Millennium," Journal of Finance 55, 1515-1567.
Campbell, John Y, 2003, "Consumption-Based Asset
Pricing," forthcoming in George Constantinides, Milton Harris, and
Rene Stulz eds. Handbook of the Economics of Finance, NorthHolland,
Amsterdam.
Campbell, John Y and John H. Cochrane, 1999, "By Force of
Habit: A Consumption-Based Explanation of Aggregate Stock Market
Behavior," Journal of Political Economy 107, 205-251.
Campbell, John Y and N. Gregory Mankiw, 1989, "Consumption,
Income, and Interest Rates: Reinterpreting the Time Series
Evidence," in Olivier J. Blanchard and Stanley Fischer eds.,
National Bureau of Economic Research Macroeconomics Annual 4, 185-216.
Campbell, John Y and Robert J. Shiller, 1988, "The
Dividend-Price Ratio and Expectations of Future Dividends and Discount
Factors," Review of Financial Studies 1, 195-228.
Campbell, John Y. and Robert J. Shiller, 2003, "Valuation
Ratios and the Long-Run Stock Market
Outlook: An Update," forthcoming in Richard Thaler ed.
Advances in Behavioral Finance, Vol. II, Russell Sage Foundation.
Campbell, John Y. and Luis M. Viceira, 1999, "Consumption and
Portfolio Decisions When Expected Returns Are Time Varying, Quarterly
Journal of Economics 114, 433-495.
Campbell, John Y. and Luis M. Viceira, 2001, "Who Should Buy
Long-Term Bonds?," American Economic Review 91, 99-127.
Campbell, John Y. and Luis M. Viceira, 2002, Strategic Asset
Allocation. Portfolio Choice for Long-Term Investors, Oxford University
Press, Oxford.
Campbell, John Y. and Motohiro Yogo, 2002, "Efficient Tests of
Stock Return Predictability," unpublished paper, Harvard
University.
Cecchetti, Stephen G., Pok-sang Lam, and Nelson C. Mark, 1993,
"The Equity Premium and the Risk-Free Rate: Matching the
Moments," Journal of Monetary Economics 31, 21-45.
Cecchetti, Stephen G, Pok-sang Lam, and Nelson C. Mark, 2000,
"Asset Pricing with Distorted Beliefs: Are Equity Returns Too Good
to Be True?" American Economic Review 90, 787-805.
Chan, Y. Lewis and Leonid Kogan, 2002, "Catching Up with the
Joneses: Heterogeneous Preferences and the Dynamics of Asset
Prices," Journal of Political Economy 110, 1255-1285.
Cochrane, John H., 2001, Asset Pricing, Princeton University Press,
Princeton, NJ.
Cogley, Timothy, 1998, "Idiosyncratic Risk and the Equity
Premium: Evidence from the Consumer Expenditure Survey," working
paper, Federal Reserve Bank of San Francisco.
Constantinides, George M., 1990, "Habit Formation: A
Resolution of the Equity Premium Puzzle," Journal of Political
Economy 98, 519-543.
Constantinides, George M., John B. Donaldson, and Rajnish Mehra,
2002, "Junior Can't Borrow: A New Perspective on the Equity
Premium Puzzle," Quarterly Journal of Economics 117, 269-296.
Constantinides, George M. and Darrell Duffie, 1996, "Asset
Pricing with Heterogeneous Consumers," Journal of Political Economy
104, 219-240.
Dimson, Elroy, Paul Marsh, and Mike Staunton, 2002, Triumph of the
Optimists, Princeton University Press, Princeton, NJ.
Dumas, Bernard, 1989, "Two-person Dynamic Equilibrium in the
Capital Market, Review of Financial Studies 2,157-188.
Epstein, Lawrence and Stanley Zin, 1989, "Substitution, Risk
Aversion, and the Temporal Behavior of Consumption and Asset Returns: A
Theoretical Framework," Econometrica 57, 937-969.
Epstein, Lawrence and Stanley Zin, 1991, "Substitution, Risk
Aversion, and the Temporal Behavior of Consumption and Asset Returns: An
Empirical Investigation," Journal of Political Economy 99, 263-286.
Fama, Eugene and Kenneth R. French, 2002, "The Equity
Premium," Journal of Finance 57, 637-659.
Glassman, James K. and Kevin A. Hassett, 1999, Dow 36,000, Times
Books: New York, NY
Gordon, Myron, 1962, The Investment, Financing, and Valuation of
the Corporation, Irwin: Homewood, IL.
Graham, Benjamin and David L. Dodd, 1934, Security Analysis,
McGraw-Hill: New York, NY
Grossman, Sanford J. and Robert J. Shiller, 1981, "The
Determinants of the Variability of Stock Market Prices," American
Economic Review 71, 222-227.
Grossman, Sanford J. and Robert J. Shiller, 1982, "Consumption
Correlatedness and Risk Measurement in Economies With Non-Traded Assets
and Heterogeneous Information," Journal of Financial Economics 10,
195-210.
Grossman, Sanford J. and Z. Zhou, 1996, "Equilibrium Analysis
of Portfolio Insurance," Journal of Finance 51, 1379-1403.
Hall, Robert E., 1988, "Intertemporal Substitution in
Consumption," Journal of Political Economy 96, 221-273.
Hamilton, James D., 1989, "A New Approach to the Analysis of
Nonstationary Returns and the Business Cycle," Econometrica 57,
357-384.
Hansen, Lars P. and Ravi Jagannathan, 1991, "Restrictions on
Intertemporal Marginal Rates of Substitution Implied by Asset
Returns," Journal of Political Economy 99, 225-262.
Hansen, Lars P., Thomas J. Sargent, and Thomas Tallarini, 1999,
"Robust Permanent Income and Pricing," Review of Economic
Studies 66, 873-907.
Harvey, Campbell R., 1989, "Time-Varying Conditional
Covariances in Tests of Asset Pricing
Models," Journal of Financial Economics 24, 289-317.
Heaton, John and Deborah Lucas, 1996, "Evaluating the Effects
of Incomplete Markets on Risk Sharing and Asset Pricing," Journal
of Political Economy 104, 668-712.
Heaton, John and Deborah Lucas, 1999, "Stock Prices and
Fundamentals," NBER Macroeconomics Annual 213-242, MIT Press,
Cambridge, MA.
Jagannathan, Ravi, Ellen R. McGrattan, and Anna Scherbina, 2000,
"The Declining US Equity Premium," Federal Reserve Bank of
Minneapolis Quarterly Review 24(4), 3-19.
Jorion, Philippe and William N. Goetzmann, 1999, "Global Stock
Markets in the Twentieth Century," Journal of Finance 54, 953-980.
Kandel, Shmuel and Robert F. Stambaugh, 1991, "Asset Returns
and Intertemporal Preferences," Journal of Monetary Economics 27,
39-71.
Kocherlakota, Narayana, 1996, "The Equity Premium: It's
Still A Puzzle," Journal of Economic Literature 34, 42-71.
Krusell, Per and A.A. Smith, Jr., 1997, "Income and Wealth
Heterogeneity, Portfolio Choice, and Equilibrium Asset Returns,"
Macroeconomic Dynamics 1, 387-422.
Lettau, Martin, 2002, "Idiosyncratic Risk and Volatility
Bounds, or Can Models with Idiosyncratic Risk Solve the Equity Premium
Puzzle?," Review of Economics and Statistics 84, 376-380.
Lettau, Martin and Sydney Ludvigson, 2001, "Consumption,
Aggregate Wealth, and Expected Stock Returns," Journal of Finance
56, 815-849.
Lewellen, Jonathan, 2003, "Predicting Returns with Financial
Ratios," forthcoming, Journal of Financial Economics.
Liang, Nellie and Steven A. Sharpe, 1999, "Share Repurchases
and Employee Stock Options and their Implications for S&P500 Share
Retirements and Expected Returns," unpublished paper, Board of
Governors of the Federal Reserve System.
Lucas, Robert E. Jr., 1978, "Asset Prices in an Exchange
Economy," Econometrica 46, 1429-1446.
Mankiw, N. Gregory, 1986, "The Equity Premium and the
Concentration of Aggregate Shocks," Journal of Financial Economics
17, 211-219.
Markowitz, Harry, 1952, "Portfolio Selection," Journal of
Finance 7, 77-91.
McGrattan, Ellen R., and Edward C. Prescott, 2000, "Is the
Stock Market Overvalued?," Federal Reserve Bank of Minneapolis
Quarterly Review 24(4), 20-40.
Mehra, Rajnish and Edward C. Prescott, 1985, "The Equity
Premium: A Puzzle," Journal of Monetary Economics 15, 145-161.
Merton, Robert C., 1969, "Lifetime Portfolio Selection Under
Uncertainty: The Continuous Time Case," Review of Economics and
Statistics 51, 247-257.
Merton, Robert C., 1971, "Optimum Consumption and Portfolio
Rules in a Continuous-Time Model," Journal of Economic Theory 3,
373-413.
Merton, Robert C., 1973, "An Intertemporal Capital Asset
Pricing Model," Econometrica 41, 867-887.
Modigliani, Franco and Richard A. Cohn, 1979, "Inflation and
the Stock Market," Financial Analysts Journal 35, 24-44.
Rietz, Thomas, 1988, "The Equity Premium Puzzle: A
Solution?," Journal of Monetary Economics 21, 117-132.
Ritter, Jay R. and Richard S. Warr, 2002, "The Decline of
Inflation and the Bull Market of 1982 to 1997," Journal of
Financial and Quantitative Analysis 37, 29-61.
Rubinstein, Mark, 1976, "The Valuation of Uncertain Income
Streams and the Pricing of Options," Bell Journal of Economics 7,
407-25.
Santos, Manuel S. and Michael Woodford, 1997, "Rational Asset
Pricing Bubbles," Econometrica 65, 19-57.
Sharpe, Steven A., 1999, "Reexamining Stock Valuation and
Inflation: The Implications of Analysts' Earnings Forecasts,"
Review of Economics and Statistics 84, 632-648.
Shiller, Robert J., 1981, "Do Stock Prices Move Too Much to Be
Justified by Subsequent Changes in Dividends?," American Economic
Review 71, 421-436.
Shiller, Robert J., 1982, "Consumption, Asset Markets, and
Macroeconomic Fluctuations," Carnegie-Rochester Conference Series
on Public Policy 17, 203-238.
Shiller, Robert J., 2000, Irrational Exuberance, Princeton
University Press, Princeton, NJ.
Siegel, Jeremy, 1998, Stocks for the Long Run, 2nd ed.,
McGraw-Hill: New York, NY
Sundaresan, Suresh M., 1989, "Intertemporally Dependent
Preferences and the Volatility of Consumption and Wealth," Review
of Financial Studies 2, 73-88.
Thaler, Richard H. and Eric J. Johnson, 1990, "Gambling With
the House Money and Trying to Break Even: The Effects of Prior Outcomes
on Risky Choice," Management Science 36, 643-660.
Tirole, Jean, 1985, "Asset Bubbles and Overlapping
Generations," Econometrica 53, 1499-1527.
Vissing-Jorgensen, Annette, 2002, "Limited Stock Market
Participation and the Elasticity of Intertemporat Substitution,"
Journal of Political Economy 110, 825-853.
Wang, Jiang, 1996, "The Term Structure of Interest Rates in a
Pure Exchange Economy with Heterogeneous Investors," Journal of
Financial Economics 41, 75-110.
John Y. Campbell *
Well, Philippe, 1989, "The Equity Premium Puzzle and the
Risk-Free Rate Puzzle," Journal of Monetary Economics 24, 401-421.
Williams, John Burr, 1938, The Theory of Investment Value, Harvard
University Press: Cambridge, MA.
John Y. Campbell *
* Otto Eckstein Professor of Applied Economics, Harvard University.
This paper was developed from my 2001 Marshall Lectures, University of
Cambridge.