Explaining the Increased Variability in Long-Term Interest Rates.
Watson, Mark W.
Monetary policy affects the macroeconomy only indirectly. In the
standard mechanism, changes in the federal funds rate, the Federal
Reserve's main policy instrument, lead to changes in longer-term
interest rates, which in turn lead to changes in aggregate demand. But
the links between the funds rate, long rates, and demand may be far from
tight, and this potential slippage is a fundamental problem for monetary
policymakers. In particular, long-term interest rates sometimes move for
reasons unrelated to short-term rates, confounding the Federal
Reserve's ability to control these long-term rates and effect
desired changes in aggregate demand. Has the link between long rates and
short rates weakened over time, therefore making it more difficult for
the Federal Reserve to achieve its macroeconomic policy objectives
through changes in the federal funds rate?
Such questions naturally arise when one observes the behavior of
long-term interest rates. For example, Figure 1 plots year-to-year
changes in ten-year Treasury bond yields from 1965 through 1998. (The
volatile period of the late 1970s and early 1980s has been masked to
highlight differences between the early and later periods.) The most
striking feature of the plot is the increase in the variability of
long-term rates in the recent period relative to the earlier period.
Indeed, the standard deviation of long rates essentially doubled across
the two time periods. What caused this increase in variability? Did a
change in the behavior of short-term interest rates (caused, for
example, by a change in Federal Reserve policy) lead to this dramatic
increase in long-rate variability? Or, rather, is this change in
variability caused by changes in factors unrelated to short-term rates,
often described under the rubric of "term" or "risk"
premia?
In what follows, we study the behavior of short-term interest rates
over the two sample periods, 1965-1978 and 1985-1998, highlighted in
Figure 1. It focuses on two key questions. First, has the short-term
interest rate process changed? Second, can these changes in the behavior
of short-term interest rates explain the increased volatility in
long-term interest rates? The answer to both of these questions is yes;
our findings suggest no weakening of the link between short rates and
long rates and thus no weakening of the link between the Federal
Reserve's policy instrument and its ultimate objectives.
The variability in long-term interest rates is tied to two distinct
features of the short-rate process: (1) the variability of
"shocks" or "innovations" to short-term interest
rates, and (2) the persistence (or half-life) of these shocks. In the
standard model of the term structure, changes in the variability of
short-rate innovations lead to proportional changes in the variability
of the long rate. Thus, holding everything else constant, doubling the
standard deviation of the innovation in short-term interest rates would
lead to doubling the standard deviation of long rates evident in Figure
1.
The relationship between short-rate persistence and long-rate
variability is more complicated. To explain this relationship it is
useful to consider an example in which the short-term interest rate
process can be described by an autoregressive model with one lag (an
AR(1)). Let [rho] denote the autoregressive coefficient associated with
the process. When [rho] = 0, short rates are serially uncorrelated, and
shocks have only a one-period effect on the short-term interest rate. In
contrast, when [rho] = 1, short rates follow a random walk so that
shocks to the current value of short rates lead to a one-for-one change
in all future short rates. When long-term interest rates are viewed as
discounted sums of expected future short-term rates, these different
values of [rho] imply very different behavior for long-term rates. For
example, when [rho] = 0, a change in the current short rate has no
implications for future values of short rates, so long rates move very
little. In contrast, when [rho] = 1, any change in the current short
rate is expected to be permanent and all future short rates are expected
to change. This change in expected future short rates leads to a large
change in the long-term rate. Values of [rho] between 0 and 1 are
intermediate between these two extremes, but in a subtle way that will
turn out to be important for explaining the increased variability in
long-term interest rates evident in Figure 1. In particular, for
long-lived bonds, a short-rate process with [rho] = 0.9 generates long
rates that behave much more like those associated with [rho] = 0 than
with [rho] = 1. Put another way, changes in the autoregressive parameter
[rho] have large effects on the behavior of long-term rates only when
[rho] is very close to 1. Such a result is familiar from studies of
consumption behavior using the present-value model, where the
variability of changes in consumption increase dramatically as income
approaches a "unit-root" process (Deaton 1987, Christiano and
Eichenbaum 1990, Goodfriend 1992, and Quah 19 92).
As a preview of the empirical results in later sections, we find
that the variability of short-term interest rate shocks was smaller in
the later sample period than in the earlier period. If there were no
other changes in the short-rate process, this decline in short-rate
variability should have led to a fall in the standard deviation of
long-term interest rates of approximately 50 percent, as opposed to the
100 percent increase shown in Figure 1. However, we also find evidence
of an increase in persistence: for example, the estimate of the largest
autoregressive root in the short-rate process (the analogue of [rho]
from the AR(1) model) increased from 0.96 in the early period to nearly
1.0 in the later period. By itself, the increase in persistence should
have led to a three-fold increase in the standard deviation of long
rates. Taken together, the decrease in short-rate variability and
increase in persistence explain remarkably well the increase in the
variability of long rates evident in the data.
The estimated change in the persistence of the federal funds process has important implications for the Federal Reserve's
leverage on long-term rates. For example, the estimated autoregressive
process for the early sample period implies that a 25 basis point
increase in the federal funds rate will lead to only a 3 basis point
increase in ten-year rates. The autoregressive process for the later
period implies that the same increase in the federal funds rate will
lead to a 15 basis point increase in ten-year rates. Alternatively, the
increase in persistence makes it possible to achieve a given change in
the long rate with a much smaller change in the federal funds rate. The
"cost" of increased leverage is the implicit commitment not to
reverse changes in the federal funds rate, that is, to maintain the
persistence in the short-rate process. The benefit of increased leverage
is the reduced variability in the short-term rate. These costs and
benefits are discussed in detail by Woodford (1999), who argues that it
may be beneficial for the monetary authority to commit to making only
persistent changes in its policy instrument.
The article is organized as follows. Section 1 documents changes in
the variability of both long-term and short-term interest rates from the
1960s to the present. Here we document the decrease in variability of
short-term interest rates (the federal funds rate and three-month
Treasury bill rates) but an increased variability in longer-term rates
(one-, five-, and ten-year Treasury bond rates). The relative increase
in variability is shown to depend on the horizon of the interest
rate--it is much higher for ten-year bonds than for one-year bonds, for
example.
Section 2 studies changes in the persistence of short-term interest
rates over the two sample periods. It begins by using a hypothetical AR(1) model for short-term interest rates to quantify the potential
effects of short-rate persistence on the variability of long-term
interest rates. The calculations are carried out using a standard model
linking long rates to short rates--the expectations model with a
constant term/risk premium. In this model, changes in long-term interest
rates reflect changes in current and future values of short-term
interest rates. The persistence of short-term interest rates is
important because it affects the forecastability of short-term rates and
thus the effect of changes in the short rate on long rates. The results
indicate that, when [rho] is very near 1, a relatively small change in
[rho] can lead to a large change in the variability of long-term
interest rates.
Also in Section 2 we present empirical estimates of the short-term
interest rate processes for the early and later sample periods using
monthly values of the federal funds rate. These estimated processes show
a fall in the variance of the short rate but an increase in persistence.
Statistical inference about persistence is complicated by the near
unit-root behavior of the short rate. This behavior leads to bias in the
ordinary least squares (OLS) estimates and a nonstandard sampling
distribution for test statistics for shifts in the process across the
two sample periods. The article corrects the OLS estimates for bias
using a procedure developed in Stock (1991) and develops a new
statistical test for a change in an autoregression that can be applied
when data are highly persistent.
In Section 3 the variance of long-term interest rates is calculated
using the expectations model together with the estimated processes for
the short rate. These calculations show that the changes in the
estimated short-rate process lead to increases in long-rate variability
quite similar to the change found in the long-rate data.
Finally, Section 4 discusses the robustness of the empirical
conclusions to specifics of the econometric specification, and Section 5
concludes. Econometric details concerning tests for changes in the
persistence of the short-rate process are given in the Appendix.
1. CHANGES IN THE VARIABILITY OF U.S. INTEREST RATES
The first task is to examine shifts in the volatility of market
interest rates. Figure 2 plots year-over-year changes in six different
interest rates over 1965 to 1998. As in Figure 1, the data from 1979 to
1984 are masked to highlight differences between the early sample period
(1965:1-1978:9) and the more recent period (1985:1-1998:9). The interest
rates range from very short maturity (the federal funds rate) to long
maturity (ten-year Treasury bonds and AAA corporate bonds). [1] Each
series is a monthly average of daily observations of the interest rates
measured in percentage points at annual rates. Table 1 presents standard
deviations for changes in interest rates over different sample periods.
Panel a reports results for the year-over-year changes plotted in Figure
2, panel b reports results for monthly changes ([R.sub.t] -
[R.sub.t-1]), and panel c reports standard deviations of residuals from
estimated univariate autoregressions.
As seen in the figures and table, the volatility of long-term rates
is much higher in the recent period than in the 1965-1978 sample period,
but that is not the case for short-term rates. For example, from panel a
of Table 1, the standard deviation of year-over-year changes in ten-year
Treasury bond rates increased from 0.69 (69 basis points) in the
1965-1978 period to 1.29 (129 basis points) in the 1985-1998 period. A
similar large increase is evident for AAA Corporate bond rates and for
five-year Treasury bonds. At the shorter end of term structure,
volatility did not increase. Indeed there is a substantial fall in the
variability of the federal funds rate from 2.44 (244 basis points) to
1.50 (150 basis points).
The remainder of the table investigates the robustness of this
conclusion about volatility both with respect to sample period and data
transformation. As shown on the table, this conclusion does not depend
on the precise dates used to define the "early" and
"recent" periods. These 1965-1978 and 1985-1998 dates were
chosen somewhat arbitrarily, and the same volatility results hold for a
wide range of cutoff dates used to define the sample periods.
Consequently, defining the early period as 1955-1978 and the recent
period as 1992-1998 leads to the same conclusions. However, results do
change if the volatile period of the late 1970s and early 1980s is
included: from Table 1 interest rates were much more volatile in this
period than they were either before 1979 or after 1984. Finally, the
results from different panels show that the same qualitative conclusion
follows when year-over-year differences are replaced with monthly
differences or with residuals from univariate autoregressions. For
example, the standard deviation of the residuals in a univariate
autoregression for ten-year Treasury bond rates increased from 19 basis
points in 1965-1978 to 25 basis point s during 1985-1998 (see panel c).
The corresponding standard deviation for the federal funds rate fell
from 38 to 21 basis points.
Since the variability of short-term rates was smaller in the later
sample period than in the early period, it is clear that changes in the
variability of short rates cannot explain the increased variability of
long rates. We will have to look elsewhere, and with that in mind, the
next section investigates changes in persistence in the short-rate
process.
2. CHANGES IN THE PERSISTENCE OF U.S. INTEREST RATES
Before examining the empirical results on the persistence of
short-term interest rates, it is useful to review the mechanism that
links changes in short-rate persistence with changes in long-rate
variability. This mechanism can be described using a simple expectations
model of the term structure. Thus, let [[R.sup.h].sub.t] denote the
yield to maturity on an h-period pure discount bond, and assume that
these yields are related to short-term rates by
[[R.sup.h].sub.t] = [frac{1}{h}] [[[sum].sup.h-1].sub.i=0]
[E.sub.t][[R.sup.1].sub.t+i],
where [[R.sup.1].sub.t] is the corresponding rate on a one-period
bond. This relation can be interpreted as a risk-neutral arbitrage relation. Now, suppose that short-term rates follow an AR(1) process
[[R.sup.1].sub.t] = [rho][[R.sup.1].sub.t-1] + [[varepsilon].sub.t]
so that [E.sub.t][[R.sup.1].sub.t+i] =
[[rho].sup.i][[R.sup.1].sub.t+i] for i [geq] 1. Then
[[R.sup.h].sub.t] = [[R.sup.1].sub.t] [[frac{1}{h}]
[[[sum].sup.h-1].sub.i=0] [[rho].sup.i]]
so that long rates are proportional to short rates, with a factor
of proportionality that is an increasing function of the persistence
parameter [rho]. Complications to the model (incorporation of term/risk
premia, allowance for coupon payments, etc.) change details of the link
between long rates and short rates. They do not, however, change the key
feature of the model--namely, that long rates depend on a sequence of
expected future short rates and that the variance of this sequence
depends critically on the persistence of shock to short-term rates.
Of crucial importance is the quantitative impact of short-rate
persistence on long-rate variability. Figure 3 gives a sense of this
impact. Using the expectations relation given above, it plots the
standard deviation of year-over-year changes in [[R.sup.h].sub.t] (that
is, [[R.sup.h].sub.t] - [[R.sup.h].sub.t-12]) as a function of [rho].
Results are shown for different maturities h, and the scale of the plot
is fixed by setting the innovation variance of short rates
([[[sigma].sup.2].sub.[varepsilon]]) equal to 1. The plot shows the
functions for values of p between 0.95 and 1.00, which is the relevant
range for the monthly data studied in this article. For short maturities
(small values of h) [rho] does not have much of an effect on the
standard deviation interest rate. For example, as [rho] increases from
0.95 to 1.00, the standard deviation of one-period rates increases by a
factor of 1.1 (from 3.1 to 3.5). However, [rho] has a large effect on
the variability of long-term interest rates. When h = 120 (a t en-year
bond when the period is a month), then as [rho] increases from 0.95 to
1.00, the resulting standard deviation of long rates increases by a
factor of 7 (from 0.5 to 3.5). Moreover, the rate of increase in the
standard deviation increases with the value of [rho]. Thus, the implied
changes in the volatility of long rates across sample periods will
depend both on the level of [rho] and on its change.
Having considered the analytical importance of persistence, we now
examine the empirical evidence on it. Table 2 contains estimates of the
persistence in short-term rates for the two sample periods. Results are
presented for both the federal funds and the three-month Treasury bill
rate. Univariate autoregressions are fit to the series, and persistence
is measured by the largest root of the implied autoregressive process.
This largest autoregressive root determines the effect of shocks on long
horizon forecasts of short rates and therefore summarizes most of the
information about the link between short rates and long-term interest
rate variability. We denote the parameter by [rho] as in the AR(1) model
discussed above.
The first entry for each sample period is the OLS estimate of [rho]
(denoted [[rho].sub.ols]) computed from an AR(6) model. (The next
section will discuss the robustness of results to the lag length in the
autoregression.) The values of [[rho].sub.ols] are very large both for
the two interest rates and the two sample periods. The implication is
that short rates were apparently highly persistent in both sample
periods. There is some evidence of a small increase in [rho] in the
latter sample period: the value of [[rho].sub.ols] increases from 0.97
to 0.98 for federal funds and from 0.96 to 0.98 for three-month Treasury
bills. However, interpreting these changes is difficult because of
statistical sampling problems associated with highly persistent
autoregressions. These problems are well known in autoregressions with
unit roots, but similar problems also arise when roots are close to
unity. To aid the reader, we digress with a short statistical primer before discussing the other entries in Table 2.
When values of [rho] are close to 1 and the sample size is moderate
(as it is here), then the sampling distributions of OLS estimators and
test statistics differ markedly from the distributions that arise in the
classical linear regression model. In particular, [[rho].sub.ols] is
biased, and the usual t-statistics have non-normal distributions. One
cannot construct confidence intervals for [rho] in the usual way. Of
course, as long as [rho] is strictly less than 1, the usual asymptotic
statistical arguments imply that these difficulties disappear for a
"suitably" large sample size. Unfortunately, the sample size
used in this article (like that commonly used in empirical macroeconomic
research) is not large enough for the conventional asymptotic normal
distributions (based on stationarity assumptions) to provide an accurate
approximation to the sampling distribution of the usual OLS statistics.
We must use alternative and more accurate approximations.
In empirical problems when [rho] is close to 1 (say in the range
0.90-1.01) and the sample size is moderate (say less than 200
observations), econometricians have found that
"local-to-unity" approximations provide close approximations
to the sampling distribution of OLS statistics. [2] In the present
context, these approximations will be used to construct unbiased
estimators of [rho], confidence intervals for [rho], and Prob-values in
tests for changes in [rho] over the two sample periods. Specifically,
"median-unbiased" estimates and confidence intervals for [rho]
are constructed from the Dickey-Fuller [[tau].sup.[mu]] statistic using
the procedures developed in Stock (1991). [3] Tests for changes in [rho]
across the two sample periods are carried out using the usual Chow-F
statistic. This statistic is computed as the Wald statistic from changes
in the values of [[rho].sub.ols] over the two sample periods. The
regressions are estimated separately in each sample period, so that all
of the coefficients are al lowed to change, but the Wald statistic tests
for a change in the largest root only. (Changes in the other
autoregressive parameters will have little effect on the variance of
long rates, so we focus the test on the largest root.) The statistical
significance of the Chow statistic can be determined using Prob-values
computed from the local-to-unity probability distributions. These
alternative Prob-values are described in detail in the Appendix. As the
Appendix shows, the Prob-value depends on the true, and unknown, value
of [rho]. Thus, rather than reporting a single Prob-value, we report an
upper and lower bound.
With this background, the reader can now understand other entries
in Table 2. The unbiased estimates are reported in the column labeled
[[rho].sub.mub] (the mub subscript stands for "Median
UnBiased"), and these are followed by the 90 percent confidence
interval for [rho]. The point estimates suggest that persistence was
higher in the second period; for example, using the federal funds rate,
the value of [[rho].sub.mub] increased from 0.96 to 1.00. However, the
confidence intervals show that there is a rather wide range of values of
[rho] that are consistent with the data--the confidence interval, which
for federal funds in the first period is 0.91-1.01, shifts up to
0.94-1.02 in the second period. The overlap in these confidence
intervals suggests that the apparent shift in [rho] is not highly
statistically significant, and this conjecture is verified by the
Chow-statistic, which has a Prob-value that falls between 0.30 and 0.64.
Thus, there is some evidence of a shift in the largest root, in a
direction co nsistent with the behavior of long-term rates, but the
shift is small and the exact magnitude is difficult to determine because
of sampling error. However, when [rho] is near 1, small changes in its
value can cause large changes in the variability of long-term interest
rates.
3. IMPLICATIONS OF THE CHANGES IN THE SHORT-RATE PROCESS ON
LONG-RATE VARIABILITY
The changes in long-rate volatility associated with the changes in
the short-rate process depend on the specifics of the model linking
short rates to long rates. Before we compute the variability of long
rates associated with the estimated short-rate processes from the last
section, three issues need to be addressed in the present context.
First, the data used here, while standard, are not ideal. The data
are not point sampled but rather are monthly observations of daily
averages. The bonds contain coupon payments, which were missing in the
simple theory presented above. The calculations presented below are
based on two approximations. First, the process for one-month rates is
estimated using the federal funds data. This is a rough approximation
that uses a monthly average of daily rates as a monthly rate. As it
turns out, similar results obtain if the federal funds process was
replaced with the estimated process for the three-month Treasury bills,
so the precise choice of short rate does not seem to matter much. The
second approximation adjusts the present-value expectations model for
coupon payments using the approximation in Shiller, Campbell, and
Schoenholitz (1983). Specifically, the expectational equation for long
rates becomes
[[R.sup.h].sub.t] = [frac{1 - [beta]}{1 - [[beta].sup.h]}]
[[[sum].sup.h-1].sub.i=0] [[beta].sup.i][E.sub.t][[R.sup.1].sub.t+i],
where [beta] = 0.997.
The second issue involves the expectations theory described above.
That model used an AR(l) driving process for short rates, and
constructed expectations using this process. The univariate process for
short rates is more complicated than an AR(l) process; moreover, one can
form short-rate expectations using a richer information set than one
containing only lags of short rates. Extending the calculations to
account for a higher-order univariate AR process is straightforward, as
the exercise merely involves computing the terms
[E.sub.t][[R.sup.1].sub.t+i] from a higher-order AR model. However; to
account for a wider information set is more problematic. A standard and
powerful approach to this problem is to construct bounds on the implied
variance of long rates from the short process, using, for example, the
approach in Shiller (1981). Unfortunately, this approach requires
stationarity of the underlying data, so the bounds are likely to be
inaccurate for the highly persistent data studied here. West (1988) prop
oses bounds for the expectational present-value model based on the
innovations in the univariate processes and shows that these bounds hold
for integrated as well as stationary processes. But as it turns out,
West's results hold only for the infinite horizon model, and the
model here is finite horizon. [4] Another approach is simply to specify
a more general information set and carry out the analysis using, say, a
vector autoregression (VAR) instead of a univariate autoregression.
However, the statistical analysis becomes increasingly complicated in a
VAR with highly persistent variables. For all of these reasons, the
analysis here will be carried out using a univariate AR.
Finally, the calculations reported here ignore all term/risk premia
and other deviations from the simple expectations theory. As mentioned
above, even in more complicated versions of the models, the first-order
impact of short-rate persistence on long-rate variability occurs through
the expected present-value expression from the version of the model used
here.
With these limitations in mind, consider now the implied
variability in long-term rates. The results are summarized in Figure 4
and in Table 3, which shows the implied variability of interest rates
computed from the expectations model, using the estimated short-rate
process over the different sample periods and for different values of
[rho]. Results are shown for four maturities. Each panel of Figure 4
shows the variability of year-over-year changes in the interest rate
implied by the estimated AR(6) model for the federal funds rate, where
the estimates are derived by imposing the value of [rho] shown on the x
axis. Results are shown for both sample periods. Highlighted on the
graphs are the results that impose the OLS and the median-unbiased
estimates of [rho] from Table 2. (A circle denotes the value of
[[rho].sub.ols]; a square denotes [[rho].sub.mub].) In each panel, the
variance function for the second period lies below the function for the
first period. This shift is caused by the decrease in variance of the AR
errors estimated for the second period. The vertical distance between
the curves shows the change in variance for a given value of [rho]. To
compute the variance across periods, the value of [rho] in each sample
period must be specified. In terms of the figures, the vertical
displacement of the plotted circles gives the change in variability
across the two periods using the OLS estimates of [rho]
([[rho].sub.ols]). The displacement of the squares gives the change
using the median-unbiased estimator ([[rho].sub.mub]). The implied
standard deviation for the four maturities in both sample periods and
for [[rho].sub.ols] and [[rho].sub.mub] are given in Table 3. For
comparison, the table also gives the period-specific sample standard
deviations for the federal funds rate and the rates on one-, five-, and
ten-year Treasury bonds.
There are substantial differences in the results across the four
panels in Figure 4. For one-month rates (panel a), variability is
essentially independent of [rho] and thus the model predicts a
substantial decrease in variability during the second period. Since the
federal funds rate data were used to estimate the short-rate process,
this decrease in variability is essentially equal to the sample
values--see the first row of Table 3. For one-year rates (panel b of
Figure 4 and the second row of Table 3), variability is also predicted
to decrease in the second period, but the decrease is far less than for
one-month rates and depends on which estimator is used for [rho]. (The
implied decrease in the standard deviation is 49 basis points using
[[rho].sub.ols] and 30 basis points using [[rho].sub.mub].) In the
sample, there was a small increase (14 basis points) in the standard
deviation of one-year interest rates. At longer maturities (panels c and
d of Figure 4 and the last two rows of Table 3), variability is
predicted to increase in the second period, and again, the amount of the
increase depends on the estimator of [rho] that is used. The increase is
not particularly large using [[rho].sub.ols] (less than 20 basis
points); however, it is much larger using [[rho].sub.mub] (70 basis
points). The small bias correction incorporated in [[rho].sub.mub]
results in this large difference because it pushes the second-period
estimate of [rho] very close to 1 and because the variance function is
rapidly increasing in this region.
While the estimated difference in persistence, as measured by
[[rho].sub.mub], explains much of the increase in variability in
long-term interest rates, much of that variability is still unexplained.
For example, in the first sample period the model's implied
standard deviation for five-year rates is 52 basis points, while the
sample standard deviation of actual five-year rates is 89 basis points.
This leaves a "residual" component, orthogonal to short-term
rates, with a standard deviation of 72 basis points (72
=[sqrt{[89.sup.2] - [52.sup.2]}] representing the difference between the
actual five-year rates and the value implied by the expectations model.
Interestingly, a residual component of similar size (69 basis points) is
necessary in the second sample period. (A somewhat larger residual is
required for ten-year rates.) Thus, although the simple expectations
model with constant term/risk premia and simple information structure
leaves much of the variability in long rate unexplained in both sample
periods , it does explain the lion's share of the increase in
variability across the two sample periods.
The results derived here, based on a simple version of the
expectations theory of the term structure, are consistent with results
derived by other researchers using reduced-form time-series methods. For
example, the expectations theory, together with a process for the
short-term interest, can be used to calculate the change in the long
rate associated with a given change in the short rate. Using the
first-period estimates (and the values of [[rho].sub.mub] shown in Table
2) the model predicts that a 25 basis point change in the federal funds
rate would lead to a 3 basis point change in the ten-year bond rate. The
second-period estimates imply that the same 25 basis point change in the
federal funds rate would lead to a 15 basis point change in the long
rate. Mehra (1996) estimates a reduced-form time-series model (a vector
error correction model) of long rates and inflation over the 1957-1978
and 1979-1995 sample periods. His estimated models predict that a 25
basis point change in the federal funds rate led to a 3 to 7 basis point
change in long rates in the early period and a 7 to 12 basis point
change in the later period.
4. ROBUSTNESS OF RESULTS
This section discusses the robustness of the article's main
findings to specification of lag length in the autoregression and to
choice of sample period. The empirical results are summarized in Table
3. The first row in each panel of the table shows the results from the
specification used in the last section, so these results are the same as
reported in Table 2. Each of the following rows summarizes results from
a different specification of either lag-length or sample period. Panel a
of Table 4 shows results for the federal funds rate and panel b shows
results for the three-month Treasury bill rate.
The AR lag length of 6 used in the baseline specification was
suggested by the Akaike Information Criteria (AIC) and by t-tests on the
autoregressive coefficients. Much shorter lag lengths were suggested by
the Schwartz criteria (BIC). Table 2 shows results from specifications
using 2, 4, and 8 lags. Each of these alternative specifications yield
first-period estimates of [rho] that are lower than the estimates from
the AR(6) model; second-period estimates are essentially unchanged. The
first-period differences in [[rho].sub.ols] are small, but the
differences are more substantial for the [[rho].sub.mub]. Ignore for the
moment the large amount of sampling error associated with these
estimates. Even so, the new point estimates have little effect on the
variance of long-term rates. From Figure 3, the long-rate variance
function is relatively flat over the range of first-period [rho]
estimates given in Table 3. Thus, from Figure 3, the implied
first-period standard deviation of long-term interest rate changes i s
0.11 when [rho] = 0.93 and increases to only 0.18 as [rho] increases to
0.96. (The first [rho] figure is the value of [[rho].sub.mub] from the
AR(4) first-period model; the second figure is the corresponding value
of [[rho].sub.mub] in the AR(6) model.) Both of these specifications
imply a much larger second-period standard deviation (1.48 and 0.975 for
the AR(4) and AR(6) models, respectively) since the second-period values
of [[rho].sub.mub] are very close to 1.0 in both specifications. Thus
lag-length choice appears to have little effect on the qualitative
conclusions.
The choice of sample period has a more important effect. The
baseline sample periods 1965:1-1978:9 and 1985:1-1998:9 were chosen to
eliminate the large variability in interest rates during the late 1970s
and early 1980s. With this volatile period eliminated, two samples of
equal size were chosen (with 1998:9 being the last sample period
available when this research was started). There is no compelling
reason, other than equating statistical power in each sample, why the
early and recent samples should be of equal size. The last two rows of
the table show results from increasing the early sample period (by
changing the beginning date to 1955:1) and decreasing the recent sample
period (by changing the beginning date to 1992:1). Since the 1992-1998
sample period is very short, an AR(2) model was used for this
specification. Evidently, the choice of the second period has little
effect on the estimates of [rho], but the choice of first sample period
does. Estimates of [rho] are larger for both interest rates in t he
extended sample period 1955-1978 than in the 1965-1978 period. This
increase should not be surprising given the behavior of interest rates
over the 1955-1978 period, where the dominant feature of the data is an
increase in the "trend" level of interest rates. However,
since this article's analysis focuses on the behavior of long rates
as they are affected by expected future short rates, the question is
whether investors in the late 1950s anticipated this trend rise in
interest rates, as would be suggested by ex post fitted values from the
univariate autoregression. Such prescience seems unlikely.
5. SUMMARY AND DISCUSSION
We have documented the increase in the variability of long-term
interest rate changes during the 1985-1998 period relative to the
1965-1978 period. In contrast, the variability of short-term interest
rates decreased in the later period. A possible explanation for this
differential behavior is a change in the persistence of changes in
short-term rates: expectations theories of the term structure imply that
such shifts in persistence will have a large effect on the variability
of changes in long-term rates but have little effect on the variability
of changes in short rates. Point estimates of the largest autoregressive
root for short rates show an increase in persistence that is large
enough to explain the increased variability in long rates. However, the
short-rate persistence parameter is imprecisely estimated, so that it is
impossible to reach definitive conclusions based on this analysis. The
lack of precision raises two issues: one related to statistical
technique and one related to learning about changes in central bank
policy.
The first issue concerns using the behavior of long rates to infer
the persistence of the short-rate process. This is appropriate if long
rates and short rates are connected by the present-value model. This
procedure is used in Valkanov (1998), where the model's implied
cointegration between long and short rates yields improved estimators
for [rho]. Valkanov then uses the improved estimator to overcome
inference problems identified by Elliott (1998) in his critique of
cointegration methods. Indeed, in a comment on a preliminary draft of
this article, Valkanov (1999) uses his method to construct estimates of
[rho] together with 90 percent confidence intervals for the time periods
1962:1-1978:8 and 1983:1-1991:2 using data on the federal funds rate and
ten-year Treasury bonds. He finds an estimate of [rho] of 0.96 (with a
90 percent confidence interval of 0.93-0.98) in the early period and an
estimate of 0.99 (with a 90 percent confidence interval of 0.99-1.00) in
the later period (Valkanov 1999, Table 2c). Hi s point estimates are
essentially identical to the values of [[rho].sub.mub] reported in our
Table 2, but as expected from the use of a more efficient procedure, his
confidence intervals are considerably narrower than the results
presented in Table 2.
The large sampling uncertainty associated with estimates of the
short-rate persistence suggests that the market will learn about changes
in persistence very slowly from observing short-term interest rates. A
central bank interested in increasing the persistence of short-term
interest rates (for the reason suggested in Woodford [1999], for
example) would have to follow this policy for a considerable time to
convince a market participant who relied only on econometric evidence
that such a change had indeed taken place. For example, suppose that the
federal funds process changed from one with a largest root of 0.96 to
one with a largest root of 0.99, and after ten years in the new regime
an econometrician tested the null hypothesis that [rho] = 0.96 versus
the alternative that [rho] [greater than] 0.96 using a standard t-test
at the 5 percent significance level. The econometrician would
(correctly) reject this null only about 50 percent of the time. (That
is, the power of the test using ten years of data is rou ghly 0.50.)
Thus, it is likely the econometrician would have to observe the new
federal funds process for quite some time before he concluded that the
process had changed. This failure immediately to recognize policy shifts
highlights the importance of other devices (institutional constraints,
public statements, etc.) to more quickly convince a wary public that
such shifts have occurred.
This article has presented econometric evidence suggesting that
changes in the federal funds rate are more persistent now than they were
in the 1960s and 1970s. Why did this change occur? We can offer but a
few remarks on this important question. Here is one possible
explanation. Suppose we decompose the funds rate into a real rate and an
inflation component. If movements in the real rate are transitory, then
the persistence in the funds rate will be driven by the inflation
component. Therefore, an increase in the persistence of inflation
possibly explains the increased persistence in the funds rate. This
explanation, however, does not seem promising. For example, the values
of [[rho].sub.mub] computed using CPI inflation fell from 0.98 in the
earlier sample period to 0.92 in the later period. As a result,
inflation seems to have become less persistent, and this implies that
some of the explanation must lie in the persistence of the real
component of the funds rate. There is growing econometric evidence that
the Federal Reserve's "reaction function" linking the
federal funds rate to expected future inflation and real activity has
been quite different under Chairmen Volker and Greenspan than under the
previous three chairmen. For example, Clarida, Gal[acute{i}], and
Gertler (1999) present evidence suggesting that the Federal Reserve
responded more aggressively to expected future inflation after 1979 than
in the previous two decades. Their evidence also suggests that the
Federal Reserve more aggressively smoothed the funds rate in this latter
period, consistent with the increased persistence found here. Changes in
this reaction function undoubtedly contain the key to explaining the
increased persistence in the federal funds rate process.
I thank Michael Dotsey, Robert Hetzel, Thomas Humphrey, Yash Mehra,
Pierre Sarte, and Ross Valkanov for comments on a previous draft of this
article. This research was supported by National Science Foundation
grant SBR-9730489. The views herein are the author's and not
necessarily those of the Federal Reserve Bank of Richmond or the Federal
Reserve System.
(1.) All of the data are from the DRI database. The series are
FYFF, FYGM3, FYGT1, FYGT5, FYGT10, and FYAAAC.
(2.) Important early references in econometrics include Cavanagh
(1985), Phillips (1987), and Stock (1991).
(3.) The median-unbiased estimator, which will be denoted
[[rho].sub.mub], has the property that Prob([[rho].sub.mub] [leq] [rho])
= Prob([[rho].sub.mub] [geq] [rho]) = 0.5.
(4.) In West's present-value model [y.sub.t] = [E.sub.t]
[[[sum].sup.h].sub.i=0], [[beta].sup.i][x.sub.t+i] the key restriction
is that [E.sub.t][[beta].sup.h][x.sub.t+h] converges to zero in mean
square as h [right arrow] [infty]. This suggests that West's bounds
will provide a good approximation in the finite horizon model so long as
[E.sub.t][[beta].sup.h][x.sub.t+h] is small. Thus, the quality of the
approximation will depend on the size of [([beta][rho]).sup.h] where
[rho] is the largest autoregressive root. In the term structure model
[beta] = 0.997, and the [x.sub.t] process is highly persistent, with a
largest autoregessive root of, say, [rho] = 0.99. Thus, for h = 120
[([beta][rho]).sup.h] = 0.16, which implies that
[E.sub.t][[beta].sup.h][x.sub.t+h] will often be substantially different
from 0.
(5.) To see this, contrast the discontinuous results
[lim.sub.T[rightarrow][infty]][[rho].sup.T] = {0 when \[rho]\ [less
than] 1
1 when [rho] = 1
[infty] when [rho] [greater than] 1}
with the continuous result
[lim.sub.T[rightarrow][infty] [([rho]T).sup.T] = [e.sup.c] when
[rho]T = 1 + c/T.
REFERENCES
Cavanagh, C. L. "Roots Local to Unity." Manuscript.
Department of Economics, Harvard University, 1985.
Christiano, Lawrence J., and Martin Eichenbaum. "Unit Roots in
Real GNP--Do We Know, and Do We Care?" Carnegie-Rochester
Conference Series on Public Policy, vol. 32 (1990), PP. 7-32.
Clarida, Richard, Jordi Gal[acute{i}], and Mark Gertler.
"Monetary Policy Rules and Macroeconomic Stability: Evidence and
Some Theory," Quarterly Journal of Economics, vol. 115 (February
2000), pp. 147-80.
Deaton, Angus S. "Life-Cycle Models of Consumption: Is the
Evidence Consistent with the Theory?" in Truman F. Bewley, ed.,
Advances in Econometrics, Vol II. Amsterdam: North Holland, 1987, pp.
121-48.
Elliott, Graham. "On the Robustness of Cointegration Methods
when Regressors Almost Have Unit Roots," Econometrica, vol. 66
(January 1998), pp. 149-58.
Goodfriend, Marvin. "Information-Aggregation Bias,"
American Economic Review, vol. 82 (June 1992), pp. 508-19.
Mehra, Yash P. "Monetary Policy and Long-Term Interest
Rates," Federal Reserve Bank of Richmond Economic Quarterly, vol.
82 (Summer 1996), pp. 27-29.
Phillips, P. C. B. "Toward a Unified Asymptotic Theory for
Autoregression," Biometrika, vol. 74 (1987), pp. 535-47.
Quah, Danny. "The Relative Importance of Permanent and
Transitory Components: Identification and Some Theoretical Bounds,"
Econometrica, vol. 60 (January 1992), pp. 107-18.
Shiller, Robert J. "Do Stock Prices Move Too Much to be
Justified by Subsequent Changes in Dividends?" American Economic
Review, vol. 71 (June 1981), pp. 421-36.
_____, John Y. Campbell, and Kermit L. Schoenholitz. "Forward
Rates and Future Policy: Interpreting the Term Structure of Interest
Rates," Brookings Papers on Economic Activity, 1:1983, pp. 173-223.
Stock, James H. "Confidence Intervals for the Largest
Autoregressive Root in U.S. Macroeconomic Time Series," Journal of
Monetary Economics, vol. 28 (December 1991), pp. 435-59.
Valkanov, Rossen. "Notes on Watson's 'Explaining the
Increased Variability in Long-Term Interest Rates.' "Research
Memorandum. Princeton University, 1999.
_____. "The Term Structure with Highly Persistent Interest
Rates." Manuscript. Princeton University, 1998.
West, Kenneth D. "Dividend Innovations and Stock Price
Variability," Econometrica, vol. 56 (January 1988), pp. 37-61.
Woodford, Michael. "Policy Inertia." Manuscript.
Princeton University, 1999.
APPENDIX
A. Computing Prob-Values for the Chow Test Statistic for the
Largest Autoregressive Root
This Appendix describes the method used to compute the Prob-values
for tests of changes in the largest autoregressive root of a univariate
autoregression. The specification is the AR(p) autoregression
[x.sub.t] = [mu] + [u.sub.t]
with
[u.sub.t] = [[[sum].sup.p].sub.i=1][[phi].sub.i][u.sub.t-1] +
[[varepsilon].sub.t],
where [x.sub.t] denotes the level of the interest rate, [mu] is a
constant denoting the average level of the process in the stationary
model, and [u.sub.t] is a stochastic term. The [u.sub.t] process can be
rewritten as
[u.sub.t] = [rho][u.sub.t-1] +
[[[sum].sup.p-1].sub.i=1][[pi].sub.i]([u.sub.t-i] - [u.sub.t-i-1]) +
[[varepsilon].sub.t],
where [rho] = -[[[sigma].sup.p].sub.i=1][[phi].sub.i] and
[[pi].sub.i] = -[[[sigma].sup.p].sub.j=i+1][[phi].sub.j]. The parameter
[rho] is thus the sum of the AR coefficients. When one root of the AR
polynomial 1- [[[sigma].sup.p].sub.i=1][[phi].sub.i][z.sup.i] is close
to 1 and all of the other roots are larger than 1, then [rho] is also
approximately equal to the inverse of the root closest to unity. In this
case [rho] is usually called the "largest" root because its
inverse is the largest eigenvalue of the companion matrix of the model
VAR(1) representation.
We study the behavior of statistics in a setting where [rho] is
modeled as close to 1.0, written as
[rho]T = 1 + [frac{c}{T}]
The artificial dependence of [rho] on the sample size T facilitates
the analysis of continuous asymptotic limits as T [rightarrow] [infty].
[5] To simplify notation, we will present the AR(1) model, so that
[[pi].sub.i] = 0, for i = 1, ...,p - 1. For the test statistics used in
this article, the inclusion of extra lags has no effect on the limiting
distribution, and in this sense the presentation here is without loss of
generality. Following the discussion of the limiting distribution of the
Chow test statistic, Appendix A2 discusses the numerical procedure used
to compute the Prob-values shown in Tables 2 and 3.
A1 Asymptotic Distribution in the AR(1) Model
Assume
[u.sub.t] = [rho][T.sup.[u].sub.t-1] + [[varepsilon].sub.t],
where [u.sub.0] is a finite fixed constant, t = 1, ..., T, and
[[varepsilon].sub.t] is a martingale difference sequence with
E([[[varepsilon].sup.2].sub.t] [[varepsilon].sub.t-1],[[varepsilon].sub.t-2],...) = 1, and with
[sup.sub.t] E[[[varepsilon].sup.4].sub.t] [greater than] [infty], where
[rho]T = 1 + [frac{c}{T}].
Let [[hat{[rho]}].sub.1] denote the OLS estimator of [rho]
constructed from the regression of [x.sub.t] onto (1,[x.sub.t-1]) using
the early sample period t = 1,..., [T.sub.1], and let
[[hat{[rho]}].sub.2] denote the corresponding estimator constructed
using the later sample period t = [T.sub.2],..., T. Assume
[lim.sub.T[rightarrow][infty]] [frac{[T.sub.1]}{T}] = [[tau].sub.1]
and
[lim.sub.T[rightarrow][infty]] [frac{[T.sub.2]}{T}] = [[tau].sub.2]
with 0 [less than] [[tau].sub.1] [less than] [[tau].sub.2] [less
than] 1. Denote the sample means by
[bar{[x.sub.1,T]}] = [frac{1}{[T.sub.1]}]
[[[sum].sup.[T.sub.1]].sub.t=1][x.sub.t]
[bar{[x.sub.2,T]}] = [frac{1}{T - [T.sub.2] +
1}][[[sum].sup.T].sub.t=[T.sub.2]][x.sub.t]
and the demeaned series by
[[x.sup.[mu]].sub.1,t] = [x.sub.t] - [bar{[x.sub.1,T]}]
[[x.sup.[mu]].sub.2,t] = [x.sub.t] - [bar{[x.sub.2,T]}].
The limiting behavior of these series is related to the behavior of
the diffusion process [J.sub.c](s), generated by
d[J.sub.c](s) = c[J.sub.c](s)ds + dW(s)
for 0 [leq] s [leq] 1, where W(s) is a standard Wiener process. In
particular,
[frac{1}{[sqrt{T}]}][[x.sup.[mu]].sub.1,[sT]] [Rightarrow]
[J.sub.c](s) - [[[tau].sup.-1].sub.1] [[[integral
of].sup.[[tau].sub.1]].sub.0] [J.sub.c](r)dr [equiv]
[[J.sup.[mu]].sub.1,c](s) for 0 [less than] s [leq] [[tau].sub.1]
[frac{1}{[sqrt{T}]}][[x.sup.[mu]].sub.2,[sT]] [Rightarrow] [J.sub.c](s)
- [(1 - [[tau].sub.2]).sup.-1] [[integral of].sup.1].sub.[tau].sub.2]
[J.sub.c](r)dr [equiv] [[J.sup.[mu]].sub.2,c](s) for [[tau].sub.2] [leq]
s [less than] 1.
The Chow F-statistic for testing [H.sub.0] : [[rho].sub.1] =
[[rho].sub.2] is
F = [frac{[([[hat{[rho]}].sub.1] -
[[hat{[rho]}].sub.2]).sup.2]}{[[[[[sum].sup.[T.sub.1]].sub.t=1][([[x.
sup.[mu]].sub.1,t-1]).sup.2]].sup.-1] +
[[[[[sum].sup.T].sub.t=[T.sub.2]][([[x.sup.[mu]].sub.2,t-1]).sup.2]].
sup.-1]}].
The limiting behavior follows from considering the terms
[U.sub.1,T] [equiv] [frac{1}{T}]
[[[sum].sup.[T.sub.1]].sub.t[equiv]1]
[[varepsilon].sub.t][[x.sup.[mu]].sub.1,t-1] [Rightarrow] [[[[integral
of].sup.[tau].sup.1].sub.0] [[J.sup.[mu]].sub.1,c](s)dW(s) [equiv]
[U.sub.1]
[U.sub.2,T] [equiv] [frac{1}{T}]
[[[sum].sup.T].sub.t[equiv][T.sub.2]]
[[varepsilon].sub.t][[x.sup.[mu]].sub.2,t-1] [Rightarrow] [[[integral
of].sup.1].sub.[[tau].sub.2]] [[J.sup.[mu]].sub.2,c](s)dW(s) [equiv]
[U.sub.2]
[V.sub.1,T] [equiv] [frac{1}{[T.sup.2]}]
[[[sum].sup.[T.sub.1]].sub.t[equiv]1] [([[x.sup.[mu]].sub.1,t-1]).sup.2]
[Rightarrow] [[[integral of].sup.[[tau].sub.1]].sub.0]
[([[J.sup.[mu]].sub.1,c](s)).sup.2]ds [equiv] [V.sub.1]
[V.sub.2,T] [equiv] [frac{1}{[T.sup.2]}]
[[[sum].sup.T].sub.t[equiv][T.sub.2]] [([[x.sup.[mu]].sub.2,t-1]).sup.2]
[Rightarrow] [[[integral of].sup.1].sub.[[tau].sub.2]]
[([[J.sup.[mu]].sub.2,c](s)).sup.2]ds [equiv] [V.sub.2].
Defining
[[gamma].sub.1,T] = T([[hat{[rho].sub.1}]] - [rho]) and
[[gamma].sub.2,T] = T([[hat{[rho].sub.2}]] - [rho]),
the F can be written as
F = [frac{[([[gamma].sub.1,T] -
[[gamma].sub.2,T]).sup.2]}{[[V.sup.-1].sub.1,T] +
[[V.sup.-1].sub.2,T]}].
Since
[[gamma].sub.1,T] =
[frac{[frac{1}{T}][[[sum].sup.[T.sub.1]].sub.t=1]
[[varepsilon].sub.t][[x.sup.[mu]].sub.1,t-1]}{[V.sub.1,T]}] =
[frac{[U.sub.1,T]}{[V.sub.1,T]}] [Rightarrow]
[frac{[U.sub.1]}{[V.sub.1]}] = [[gamma].sub.1]
and
[[gamma].sub.2,T] =
[frac{[frac{1}{T}][[[sum].sup.T].sub.t=[T.sub.2]]
[[varepsilon].sub.t][[x.sup.[mu]].sub.2,t-1]}{[V.sub.2,T]}] =
[frac{[U.sub.2,T]}{[V.sub.2,T]}] [Rightarrow]
[frac{[U.sub.2]}{[V.sub.2]}] = [[gamma].sub.2]
by the continuous mapping theorem, then
F [Rightarrow] [frac{[([[gamma].sub.1] -
[[gamma].sub.2]).sup.2]}{[[V.sup.-1].sub.1] + [[V.sup.-1].sub.2]}],
which provides a representation for the limiting distribution of F
in terms of functionals of the diffusions [J.sub.c](s).
A2 Approximating Prob-values
The limiting distribution of F is seen to depend on three
parameters [[tau].sub.1] [[tau].sub.2] (through the limits in the
integrals), and the value of c (through the mean reversion in the
diffusion process [J.sub.c]). Quantiles of the limiting distribution
(and hence Prob-values for the test statistic) can be approximated by
repeated simulations of F using a large sample size and for fixed values
of [[tau].sub.1] [[tau].sub.2] and c, and [[varepsilon].sub.t] chosen as
Niid(0, 1) random variables. The Prob-values reported in the article
resulted from 10,000 replications from a sample size of 500. The
parameters [[tau].sub.1] and [[tau].sub.2] were chosen as [T.sub.1]/T
and [T.sub.2]/T where [T.sub.1] denotes the first break point and
[T.sub.2] denotes the second break point. The distribution also depends
on c, which governs how close [rho] is to unity. Unfortunately, this
parameter cannot be consistently estimated. (Equivalently, in finite
samples the distribution of F depends on [rho], and small changes in
[rho]--like those associated with sampling error--lead to large changes
in the quantiles of this distribution.) Thus, selecting the correct
distribution of F requires knowledge of c (equivalently, p). Since c is
unknown, the distribution is computed for a range of values in -25 [leq]
c [leq] 10 and the resulting minimum and maximum Prob-value over all of
the values of c is reported in the table. Viewing c as unknown,
classical approaches (which must hold for all values of the
"nuisance parameter" c) would use the upper Prob-value. The
lower bound gives the smallest Prob-value that would be obtained if c
were known.