文章基本信息

标题：TESTING STRUCTURAL HYPOTHESES ON COINTEGRATION RELATIONS WITH SMALL SAMPLES.
作者：ZHOU, SU
期刊名称：Economic Inquiry
印刷版ISSN：0095-2583
出版年度：2000
期号：October
语种：English
出版社：Western Economic Association International
摘要：This study examines the finite-sample bias of Johansen's [1991] likelihood ratio tests for structural hypotheses on cointegration relations among economic variables through the Monte Carlo experiments. It is found that the Johansen tests with small samples are biased toward rejecting the null hypotheses more often than what asymptotic theory suggests, even after the test statistics are adjusted by Sims's correction. A bootstrap method for obtaining problem-specific critical values for the tests is proposed It is shown that using the bootstrap procedure may substantially reduce the small-sample bias. An empirical application of the procedure is demonstrated. (JEL C12, C22)
关键词：Economic research;Economics

TESTING STRUCTURAL HYPOTHESES ON COINTEGRATION RELATIONS WITH SMALL SAMPLES.

ZHOU, SU

SU ZHOU [*]

This study examines the finite-sample bias of Johansen's [1991] likelihood ratio tests for structural hypotheses on cointegration relations among economic variables through the Monte Carlo experiments. It is found that the Johansen tests with small samples are biased toward rejecting the null hypotheses more often than what asymptotic theory suggests, even after the test statistics are adjusted by Sims's correction. A bootstrap method for obtaining problem-specific critical values for the tests is proposed It is shown that using the bootstrap procedure may substantially reduce the small-sample bias. An empirical application of the procedure is demonstrated. (JEL C12, C22)

I. INTRODUCTION

Since Nelson and Plosser [1982] discovered that most macroeconomic variables have a nonstationary time series structure, cointegration analysis has been substantially developed and widely applied in empirical studies in various areas of economics. Cointegration analysis offers a natural way to test for the existence of long-run relationships among nonstationary economic variables. A set of variables, [X.sub.t], is said to be cointegrated if these variables are individually nonstationary, but at least one linear combination of them, [z.sub.t] = [beta]'[X.sub.t], exists that is stationary. The vector [beta] is referred to as the cointegration vector. If [X.sub.t] is a cointegrated system, we may say that the variables in [X.sub.t] do not drift "too far" apart and there exists one or more long-run equilibrium relationships among these variables.

Cointegration analysis mainly consists of three segments: Tests for the rank of cointegration, estimation of cointegration vectors, and tests for structural hypotheses suggested by economic theory. One of the most popular methods of cointegration analysis is the multivariate maximum-likelihood approach of Johansen [1988, 1991]. The Johansen approach yields trace statistics and maximal eigenvalue statistics for identifying the number of cointegration vectors and provides consistent maximum-likelihood estimates of the coefficients of cointegration vectors. In addition, Johansen [1991] and Johansen and Juselius [1990, 1992] develop the likelihood ratio tests for structural hypotheses, that is, the hypotheses of linear restrictions on the cointegration relations, with a multivariate error correction model.

Johansen and Juselius [1990, 1992] have tabulated the asymptotic critical values for both the maximal eigenvalue and the trace tests. Also, Johansen [1991] has derived the asymptotic distributions of the test statistics for structural hypotheses. Their critical values from the asymptotic distributions have been extensively employed in the cointegration studies. Unfortunately, empirical studies of long-run (cointegration) relationships are limited by finite sample sizes. It could be misleading to draw conclusions based on a small sample study while using the asymptotic critical values, which are appropriate only for large samples.

Cheung and Lai [1993] investigate the accuracy of Johansen and Juselius's asymptotic approximations by computing the finite-sample distributions of the maximal eigenvalue and the trace statistics; they find that the asymptotic critical values are more liberal than the finite-sample critical values. That is, using the asymptotic critical values, one would reject the null of no cointegration too often when the sample size of the data used is rather small. There have been a number of other studies examining the small sample performance of several popular estimation and testing procedures designed for cointegrated systems, including the Johansen approach. [1] However, as far as I know, none of the existing studies has systematically investigated the small sample performance of any test for structural hypotheses. The present study intends to fill this gap and thus expand this body of literature.

There is an obvious reason that researchers have overlooked the small sample problem of the tests for structural hypotheses on cointegration relations. Unlike Johansen's likelihood ratio (LR) tests for cointegration rank, which have nonstandard asymptotic distributions and therefore previous results about small-sample bias of standard LR tests do not apply to those tests, the asymptotic distributions of the LR tests for structural hypotheses are shown by Johansen [1991] to be the usual [[chi].sup.2] distributions. The finite-sample bias of such LR tests has been widely acknowledged. A well- known finite-sample correction of standard LR tests was recommended by Sims [1980] and has been extensively utilized in the empirical literature and introduced in almost every textbook in multivariate time series. Yet it is demonstrated in this paper that Sims's adjustment is hardly capable of correcting the small-sample bias of the Johansen LR tests for structural hypotheses.

In this study, I examine the finite-sample bias of Johansen's [1991] LR tests for structural hypotheses on cointegration relations with and without Sims's correction. A bootstrap procedure for obtaining problem-specific critical values for the tests is proposed.

The rest of the paper is organized as follows. Section II briefly describes Johansen's tests for the rank of cointegration and estimation of cointegration vectors and then introduces the Johansen tests for structural hypotheses in a multivariate cointegrated system. Tests for both long-run exclusion and some other linear restrictions on the cointegration space are considered. Section III gives the design of the Monte Carlo experiments applied in the study for examining the small-sample bias of the LR tests for structural hypotheses. The simulation results are reported and summarized in the same section. In section IV, I analyze the effects of the presence of a constant term in the model and/or of a linear time trend in the data generating process on the finite-sample behavior of the test statistics. Section V suggests the use of the bootstrap procedure to obtain problem-specific critical values for the LR tests and investigates the performance of the bootstrap method. An example of using the bootstrap critic al values is illustrated in section VI. Section VII presents the conclusions of this study.

II. COINTEGRATION TESTS AND TESTS FOR STRUCTURAL HYPOTHESES

The Johansen tests are conducted through a vector error-correction (VEC) mechanism. For a vector of n variables, [X.sub.t], a VEC model can be written as

(1) [delta][X.sub.t] = [[[sigma].sup.k-1].sub.j=1] [[gamma].sub.j][delta][X.sub.t-j] + [pi][X.sub.t-1] + [micro] + [[epsilon].sub.t],

where [[epsilon].sub.t], is a vector of independent Gaussian variables with mean zero and variance matrix [sigma]. [micro] is a constant term. The hypotheses of interest involve [pi]; if the rank of [pi] is q, where q [less than or equal to] n - 1, then [pi] can be decomposed into two n x q matrices [alpha] and [beta] such that [pi] = [alpha][beta]'. The matrix [beta] consists of q linear cointegration vectors, while [alpha] can be interpreted as a matrix of vector error-correction parameters. The maximum likelihood estimates of [alpha] and [beta] are obtained by regressing [delta][X.sub.t] and [X.sub.t-1] on [delta][X.sub.t-1],..., [delta][X.sub.t-k+1] and 1. This gives residuals [R.sub.0t] and [R.sub.1t] and residual product matrices

[S.sub.ij] = [T.sup.-1] [[[sigma].sup.t].sub.t=1] [R.sub.it][R'.sub.jt], i,j = 0, 1.

One may solve the eigenvalue system

\[lambda][S.sub.11] - [S.sub.10][[S.sup.-1].sub.00][S.sub.01]\ = 0

for eigenvalues [[lambda].sub.1] [greater than] ... [greater than] [[lambda].sub.n], and eigenvectors V = ([v.sub.1],..., [v.sub.n]). The estimates of [alpha] and [beta] are given by [alpha] = [S.sub.01][beta] and [beta] = ([v.sub.1],..., [v.sub.q]), where [v.sub.1],..., [v.sub.q] are the eigenvectors associated with the q largest eigenvalues. The number of cointegration vectors could be determined using Johansen's maximal eigenvalue statistics and/or the trace statistics. The maximal eigenvalue ([[lambda].sub.max]) statistic for the null hypothesis of q cointegration vectors against the alternative of q + 1 cointegration vectors is

[[lambda].sub.max] = -T ln(1 - [[lambda].sub.q+1]),

and the trace statistic for the null hypothesis of at most q cointegration vectors is

Trace = -T [[[sigma].sup.n].sub.j=q+1] ln(1 - [[lambda].sub.j]).

If the results are consistent with the hypothesis of at least one cointegration vector, structural hypotheses, that is, the hypotheses regarding the restrictions on [beta]'s, could be tested using the maximum likelihood methodology.

Testing structural hypotheses is an important segment of cointegration analysis. Important long-run equilibrium relationships based on economic theory can be tested in a cointegration framework. Consider the following two examples:

1. Long-Run Exclusion. It occurs frequently that only a subset of the variables in [X.sub.t] can be assumed to be relevant for the long-run relations. In this case there is a need to test the hypothesis of exclusion of a subset of variables, [[beta].sub.i] = [[beta].sub.j] = ... = 0, from the long-run relations.

2. Linear Restrictions on the Cointegration Space. Economic theory often suggests certain long-run equilibrium relationships existing among a set of economic variables. These equilibrium relationships have been utilized as important building blocks in economic modeling. For instance, two long-run relationships are frequently used in monetary and international economics. One is purchasing power parity (PPP), which plays an important role in international economic modeling. The PPP condition implies a cointegration relation between the exchange rate (ER) of two currencies and the price levels ([P.sub.A] and [P.sub.B]) of the two relevant countries: [z.sub.t] = [[beta].sub.pa] ln [P.sub.At] + [[beta].sub.er] ln [ER.sub.t] + [[beta].sub.pb] ln [P.sub.Bt] with a cointegration vector [[[beta].sub.pa], [[beta].sub.er], [[beta].sub.pb]] = [-1, 1, 1]. Another is a long-run money relation linking nominal money (M), price level (P), real income (Y), and interest rate (R): [z.sub.t] = [[beta].sub.m] ln [M.sub.t]+[[beta].sub.p] ln [P.sub.t]+[[beta].sub.y] ln [Y.sub.t]+[[beta].sub.R][R.sub.t]. Monetary theory suggests long-run price homogeneity: [[[beta].sub.m], [[beta].sub.p], [[beta].sub.y], [[beta].sub.R]] = [-1, 1, [[beta].sub.y], [[beta].sub.R]], or long-run income homogeneity: [[[beta].sub.m], [[beta].sub.p], [[beta].sub.y], [[beta].sub.R]] = [-1, [[beta].sub.p], 1, [[beta].sub.R]], or both: [[[beta].sub.m], [[beta].sub.p], [[beta].sub.y], [[beta].sub.R]] = [-1, 1, 1, [[beta].sub.R]]. These relations could be examined through an analysis of the likelihood function by imposing some linear restrictions on the cointegration relations.

The likelihood ratio (LR) test procedure for these structural hypotheses is prescribed by Johansen [1991]. The test is formulated by Johansen and Juselius [1990, 1992] as

[H.sub.2] : [beta] = H[phi]

where the matrix H of dimension n x m imposes (n - m) restrictions on i = 1, ..., q cointegration vectors. The test statistic is calculated as

Q([H.sub.2]\[H.sub.1]) = -T [[[sigma].sup.q].sub.i=1]ln[(1 - [[lambda].sub.1,i])/(1 - [[lambda].sub.2,i])],

where [[lambda].sub.1,i], and [[lambda].sub.2,i] are the ith estimated eigenvalues in the unrestricted model [H.sub.1] and the restricted model [H.sub.2], respectively. Johansen [1991] shows that the asymptotic distribution of the test statistics when there exists only one cointegration vector is [[chi].sup.2] with degrees of freedom equal to the number of restrictions (n - m).

However, empirical analyses necessarily deal with finite samples. It is widely recognized that finite-sample studies with the LR test statistics using the asymptotic critical values tend to bias the tests toward rejecting the structural hypotheses too often. Therefore, using (T - c)--where c is the number of parameters in the unrestricted equation (per equation)--instead of T in forming the LR test statistics has been recommended by almost every textbook in multivariate time series, a correction usually attributed to Sims [1980, 17]. Both the degree of the finite sample bias of the LR tests as well as the ability of Sims's adjustment in correcting the finite-sample bias are investigated by the following experiments.

III. THE MONTE CARLO SIMULATION EXPERIMENTS

I examine the finite-sample bias of the LR tests for structural hypotheses through the Monte Carlo simulations, where the simulated test statistics, with or without Sims's correction, are compared with the asymptotic critical values. The study is limited to the test statistics for one and two linear restrictions, allowing only one cointegration vector (q = 1) in the system. In the experiments, multivariate models with the dimensions n = 2, 3, 4, 5 are generated. The numbers of effective observations are T = 100 and 500 and the lag length of equation (1) is set by k = 1, 2, 4, 6. [X.sub.t] is generated by setting the initial values of [X.sub.0] equal to zero and creating T + k + 100 observations, of which the first 100 observations are discarded to minimize the effect of the initial condition. The GAUSS matrix programming language is used to write the computer programs for this study, and GAUSS's RNDN functions are used to generate pseudo-random normal innovations. Each simulation experiment consists of 20,000 replications for a sample size T, a number of variables n, and a lag length k. The data-generating processes (DGPs) for the experiments are similar to the ones utilized by Engle and Granger [1987], Gonzalo [1994], and Phillips [1994]. The true lag of the DGPs is set to be k = 1.

To test for the long-run exclusion of a variable, the following DGP(1) is used: For n [greater than] 2, [X.sub.t] is set by [X.sub.t] = [[y.sub.t], [W.sub.t]] and [W.sub.t] = [[W.sub.1t], [w.sub.2t]] with [W.sub.t] being an (n - 1)-dimensional random walk process, that is, [W.sub.t] = [W.sub.t-1] + [e.sub.wt] or [delta][W.sub.t] = [e.sub.wt], [w.sub.2t] being a random-walk variable (within [W.sub.t]) to be excluded from the cointegration relation. The cointegration equation is expressed as [beta]'[X.sub.t] = [[beta].sub.1][y.sub.t] + [[beta].sup.*][W.sub.1t] = [z.sub.t] with [beta]' = [[[beta].sub.1], [[beta].sub.2], ..., [[beta].sub.n]] = [[[beta].sub.1], [[beta].sup.*], 0] being the cointegration vector where [[beta].sup.*] = [[[beta].sub.2], ..., [[beta].sub.n-1]] and [[beta].sub.n] being restricted to be zero, [z.sub.t] = [rho][z.sub.t-1] + [e.sub.zt], [absolute val. of [rho]] [less than] 1. [u.sub.t] = [[e.sub.zt], [e.sub.wt]]' is an n x 1 vector of independently distributed normal errors of mean zero and covariance matrix [I.sub.n]. The null hypothesis of the test is [H.sub.0]: [[beta].sub.i] = 0 or

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [omega] is a 1 x (n - 2) vector of zeros.

For n = 2, [X.sub.t] is set by [X.sub.t] = [[y.sub.t], [w.sub.t]]' with [[beta].sub.1][y.sub.t] = [z.sub.t], [z.sub.t] = [rho][z.sub.t-1] + [e.sub.zt], [absolute val. of [rho]] [less than] 1, [delta][w.sub.t] = [e.sub.wt] and [u.sub.t] = [[e.sub.zt], [e.sub.wt]]'. That is, the null hypothesis is [H.sub.0]: [[beta].sub.2] = 0 or

[beta] = H[phi] = [1 0] [[beta].sub.1] = [[[beta].sub.1], 0]'

To test other linear restrictions on the cointegration space, we use the following DGP(2): [X.sub.t] = [[y.sub.t], [W.sub.t]]' with [W.sub.t] being an (n - 1)-dimensional random walk process, that is, [delta][W.sub.t] = [e.sub.wt]. The cointegration equation is [beta]'[X.sub.t] = [[beta].sub.1][y.sub.t] + [gamma][W.sub.t] = [z.sub.t] with [beta]' = [[[beta].sub.1], [gamma]] being the cointegration vector where [gamma] = [[[beta].sub.2], ..., [[beta].sub.n]], [z.sub.t] = [rho][z.sub.t-1] + [e.sub.zt], [absolute val. of [rho]] [less than] 1. Again, [u.sub.t] = [[e.sub.zt], [e.sub.wt]]' is an n x 1 vector of independently distributed normal errors of means zero and covariance matrix [I.sub.n].

The first linear restriction taken into consideration was the equality of two coefficients of the cointegration vector, [[beta].sub.i] = [+ or -][[beta].sub.j]. We set [H.sub.0]: [[[beta].sub.1], [[beta].sub.2], [[beta].sub.3], ..., [[beta].sub.n]] = [[[beta].sub.1], -[[beta].sub.1], [[gamma].sup.*]] where [[gamma].sup.*] = [[[beta].sub.3], ..., [[beta].sub.n]]. This is, [H.sub.0]: [[beta].sub.1] = -[[beta].sub.2] or

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

This restriction matches the hypothesis of proportional co-movements of two variables (like the long- and short-term interest rates, consumption and income, money demand and price level, or two countries' price levels, etc.) in a cointegrated system. It is not surprising to find that the finite- sample distributions of the test statistics for such a restriction, corresponding to different model dimensions, are almost the same as those for testing the long-run exclusion of a single variable because either of the two is testing for one linear restriction. Therefore, only the results for testing the long-run exclusion are reported.

I then study the finite-sample bias problem associated with the test statistics for two linear restrictions for the models of n = 3, 4, 5. The null hypothesis is set as [H.sub.0]: [[[beta].sub.1], [[beta].sub.2], [[beta].sub.3], [[beta].sub.4],...,[[beta].sub.n]] = [[[beta].sub.1], -[[beta].sub.1], -[[beta].sub.1], [[gamma].sup.**]] where [[gamma].sup.**] = [[[beta].sub.4],..., [[beta].sub.n]]. That is, [H.sub.0]: [[beta].sub.1] = -[[beta].sub.2] = -[[beta].sub.3] or

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

= [[[beta].sub.1], -[[beta].sub.1], -[[beta].sub.1], [[gamma].sup.**]]'

here [omega] is a 1 x (n - 3) vector of zeros. Note that the distributions of these test statistics are not affected by the values of [beta]'s to be set under the null. The tests of two linear restrictions are applicable for testing the PPP condition as well as the long-run monetary relation, as mentioned in section II.

Table I reports the results of the Monte Carlo experiments with and without Sims's correction. Consistent with the findings in the existing literature, size distortions of the LR tests are greater when the sample size, T, is smaller. The degree of size distortion is positively related to [rho], to the model dimensions, n, and to the number of lags, k. These suggest that finite-sample studies based on the asymptotic critical values would bias the tests toward rejecting the structural hypotheses too often for high dimensional models, when the sample size is small, and/or selected lag length is long. This bias would become more severe when [rho] is closer to 1, that is, when the cointegrating residual [z.sub.t] is highly serially correlated. [2] As can be seen in Table I, Sims's correction appears to be incompetent for reducing size distortions.

IV. THE EFFECTS OF DIFFERENT TREND SPECIFICATIONS ON THE TESTS

The practice of section III is consistent with the analysis conducted by Johansen and Juselius [1990] in their study for the Finnish data, where the DGP has no linear trend but a constant term, [micro], is included in the VEC model of equation (1) to capture the impact of a possible time trend in the sample variables. I call it Case [1.sup.*]. Johansen has realized that the critical values for his popular test for cointegration rank depend on the structure of the deterministic components that prevail in the system. Three alternative deterministic-trend specifications are considered in Johansen [1988], Johansen and Juselius [1990], and Johansen [1992]. Although these alternative specifications would not affect the asymptotic distributions of the test statistics for structural hypotheses (they are still [[chi].sup.2] distributions), they might have different effects on the finite-sample distributions. Correspondingly, we investigate the three cases for the effects of different trend specifications on the finit e-sample tests for structural hypotheses.

The DGP with alternative trend specifications can be expressed by [beta]'[X.sub.t] + [[beta].sub.0] = [z.sub.t], where [X.sub.t] = [[y.sub.t], [W.sub.t]]' with [W.sub.t] being an (n - 1) - dimensional random walk process with a drift [a.sub.w] (i.e., [delta][W.sub.t] = [a.sub.w] + [e.sub.wt] or [W.sub.t] = [a.sub.w]t + [[W.sup.*].sub.t] with [[W.sup.*].sub.t] = [[W.sup.*].sub.t-1] + [e.sub.wt]). [[beta]' [[beta].sub.0]]' is the cointegration vector with [[beta].sub.0] being a constant term and [z.sub.t] = [rho][z.sub.t-1] + [e.sub.zt] [absolute val. of [rho]] [less than] 1. The estimated VEC model is

(2) [delta][X.sub.t] = [[[sigma].sup.k-1].sub.j=1] [[gamma].sub.j] [delta][X.sub.t-j] + [pi][[X.sup.*].sub.t-1] + [micro] + [[epsilon].sub.t].

For Case [1.sup.*], [[beta].sub.0] and [a.sub.w] are zero. [micro] is unrestricted, and [[X.sup.*].sub.t-1] = [X.sub.t-1] in the VEC model. The three alternative cases are:

* Case 1 (Johansen [1988]): [[beta].sub.0] = 0 and [a.sub.w] = 0. [micro] = 0 and [[X.sup.*].sub.t-1] [X.sub.t-1] in the VEC model.

* Case 2 (Johansen and Juselius [1990]; Johansen [1992]): [a.sub.w] = 0, [micro] = 0, but [[beta].sub.0] [not equal to] 0 and [[X.sup.*].sub.t-1] = ([X'.sub.t-1], 1), in the VEC model. That is, [X.sub.t] has no time trend, but there is a constant term [[beta].sub.0] confined to the cointegration equation of the DGP.

* Case 3 (Johansen and Juselius [1990]; Johansen [1992]): [[beta].sub.0] = 0 [a.sub.w] [not equal to] 0, that is, there is a linear time trend present in the DGP. [micro]. is unrestricted, and [[X.sup.*].sub.t-1] = [X.sub.t-1] in the VEC model.

Table II shows the results of Case 1 with T = 100. It seems that Case 1 has less size distortions than Case [1.sup.*] corresponding to various n and k. In other words, when there is no linear time trend in the DGPs (for either Case 1 or Case [1.sup.*]), the absence of the constant term from the VEC models leads to less finite-sample distortion of test size. Nevertheless, both Case 1 and Case [1.sup.*] suffer large size distortions associated with small samples. [3] Again, Sims's correction is unable to reduce size distortions effectively.

The results of the Monte Carlo experiments for Cases 2 and 3 are not presented because they are very similar to those of Case [1.sup.*]. [4] This suggests that (a) having a constant term in the cointegration relation of [X.sub.t] (i.e., Case 2) yields very similar finite-sample distributions of the test statistics to those of the case allowing a constant term in the VEC model but not in the cointegration equation of the DGP (i.e., Case [1.sup.*]); and (b) having a time trend in the DGP (Case 3) or not (Case [1.sup.*]) does not have much influence on the finite-sample distributions of the LR test statistics for structural hypothesis as long as the constant term [micro] is included in the VEC model.

V. THE BOOTSTRAP SIMULATION

Because Sims's modification of the test statistics fails to correct the finite-sample bias of the LR tests for structural hypotheses, there is a need for the ways of dealing with the finite-sample bias problem. Since the finite-sample distributions of the LR test statistics are dependent on nuisance parameters (as revealed in the last two sections), tabulating critical values based on some simple DGPs would be of little use. For practical purposes, it seems appropriate to obtain problem-specific critical values for the LR tests by employing the bootstrap technique.

The bootstrap can be utilized to simulate the finite-sample distributions of the LR test statistics for structural hypotheses under the null. The procedure for the variables that may have a linear trend in the data is summarized by the following steps.

1. We apply the Johansen LR tests for structural hypotheses to actual data to obtain the test statistics, Q([H.sub.2]\[H.sub.1]), as described in section II, with the VEC model of equation (1).

2. Under the null hypothesis [H.sub.2]: [beta] = H[phi], we estimate the model

(3) [delta][X.sub.t] = [[[sigma].sup.k-1].sub.j=1] [[gamma].sub.i][delta][X.sub.t-j] + [alpha][beta]'[X.sub.t-1] + [micro] + [[epsilon].sub.t] by solving

\[lambda]H'[S.sub.11]H - H'[S.sub.10][[S.sup.-1].sub.00][S.sub.01]H\ = 0

for eigenvalues [[lambda].sub.1] [greater than] ... [greater than] [[lambda].sub.n] and corresponding eigenvectors V = ([v.sub.1],...,[v.sub.n]). The maximum likelihood estimator of [beta] is obtained by [beta]=H[phi] where [phi] = ([v.sub.1],...,[v.sub.q]). [[gamma].sub.i], [alpha], and [micro] are then calculated following Johansen and Juselius [1992, 174 and 176]. The residuals [[epsilon].sub.t], generated in equation (3), are multiplied by a scale factor of [[T/(T - c)].sup.1/2], denoted as [[epsilon].sub.t], where c represents the loss of degrees of freedom in estimating equation (3). [5]

3. We form the bootstrap disturbances, denoted as [[[epsilon].sup.*].sub.t], by randomly sampling with replacement from [[epsilon].sub.t] and define

(4) [delta][[X.sup.*].sub.t]=[[[sigma].sup.k-1].sub.j=1][[gamma].sub.j][d elta][[X.sup.*].sub.t-j]+[alpha][beta]'[[X.sup.*].sub.t-1]+[micro]+[[ [epsilon].sup.*].sub.t]

using the actual data for the first k -1 observations as initial conditions.

4. We then apply the Johansen LR tests to [[X.sup.*].sub.t] and obtain the test statistics, [Q.sup.*]([H.sub.2]\[H.sub.1]).

5. Repeating steps 3 and 4 N number of times and ordering the test statistics to be [[Q.sup.*].sub.1] [less than or equal to] [[Q.sup.*].sub.2] [less than or equal to] ... [less than or equal to] [[Q.sup.*].sub.N], we obtain the [delta]% critical value, which is [[Q.sup.*].sub.[delta]N]/100.

6. The null hypothesis is rejected at the [delta]% significance level if the Q([H.sub.2]\[H.sub.1]) statistic of step 1 is greater than the [delta]% bootstrap critical value.

For the variables with no linear trend in the data, one may simply remove the constant term [micro] from equations (1) and (3) and [micro] from equation (4) when conducting the bootstrap simulation following the above procedure. Besides, the residuals [[epsilon].sub.t], generated by step 2, have to be scaled and centered, that is, [[epsilon].sub.t] = [[T/(T-c)].sup.1/2]([[epsilon].sub.t]-[T.sup.-1] [[[sigma].sup.T].sub.t=1][[epsilon].sub.t]), where [T.sup.-1] [[[sigma].sup.T].sub.t=1] [[epsilon].sub.t] may be nonzero as [micro] is removed from equation (3).

To check the performance of the bootstrap procedure, we apply the procedure to the DGP utilized earlier in producing Table I and Table II for T = 100 and n = 3 to see if this procedure may help lessen the size distortion problem for Case [1.sup.*] and Case 1. The results listed in Table III are generated with 1,000 replications. For each replication, the bootstrap critical values are estimated with 400 bootstrap samples (N = 400). Compared to the results of Table I and Table II, Panel A of Table III reflects that the small-sample size distortions of the LR tests are substantially reduced as the tests are conducted with the bootstrap critical values. There exist only minor size distortions when the cointegrating residuals are moderately serially correlated (i.e., when [rho] is less than or around 0.7).

In Panel A of Table III, the estimated empirical sizes of the tests slightly vary with the lag lengths of the VEC models. I would like to point out that the results for k [greater than] 1 are attained by overparameterization by including more lags in the VEC models than are in the designed DGPs (k = 1). The results suggest that the LR tests for structural hypotheses with the bootstrap procedure are not very sensitive to overparameterization.

To further investigate the effect of lag specifications, similar to the method of Cheung and Lai [1993], I conducted the experiments based on the following DGPs for n = 3:

[[[beta].sub.1], [[beta].sub.2]][[y.sub.t], [w.sub.1t]]'

= [z.sub.t] for testing the null of [[beta].sub.3] = 0,

[[[beta].sub.1], [[beta].sub.2], [[beta].sub.3]][[y.sub.t], [w.sub.1t], [w.sub.2t]]'

= [z.sub.t] for testing the null of [[beta].sub.1]

(5) = -[[beta].sub.2] = -[[beta].sub.3],

[z.sub.t] = [rho][z.sub.t-1]+[e.sub.zt], with

[w.sub.1t] = [w.sub.1t-1] = [[phi].sub.1]([w.sub.1t-1] - [w.sub.1t-2])+[e.sub.w1t],

and

[w.sub.2t] = [w.sub.2t-1] + [[phi].sub.2]([w.sub.2t-1] - [w.sub.2t-2]) + [e.sub.w2t],

where [e.sub.w1t], [e.sub.w2t], and [e.sub.zt] are independent Gaussian zero-mean white noise innovations. Arbitrarily, the parameters [[phi].sub.1] and [[phi].sub.2] are set equal to 0.7 and 0.2, respectively. This is a model with autoregressive dependence, which implies that the appropriate lage length of the VEC model is k = 2. The estimated test sizes for T = 100 and [rho] = 0.7 are presented in Panel B of Table III. They indicate that when using the bootstrap critical values, the biases in the empirical sizes of the LR tests are reasonably small for the models specified with the right lag length (i.e., k = 2) as the cointegrating residual is no more than moderately serially correlated. The results confirm that the test sizes are not very sensitive to overparameterization (as k [greater than] 2). However, the distortion of the test size is severe in the case of underparameterization (for k = 1).

It is also found that (a) if we allow a constant term [micro] in the VEC models of equations (1) and (3), and an intercept [micro] in equation (4), the estimated test sizes appear to be nearly unaffected by whether there is a linear trend in the DGP or not; (b) when there is no linear trend in the DGP, if we allow [micro] in the VEC models of equations (1) and (3), the resampling scheme of equation (4) containing an intercept [micro] or not has little effect on the simulation results. [6]

VI. AN EXAMPLE OF STRUCTURAL HYPOTHESIS TESTING WITH THE BOOTSTRAP

In this section, the bootstrap method is applied to a data set for testing the long-run validity of PPP for France and Germany. The study uses 100 observations of quarterly data from 1974:1 through 1998:4, with observations from 1972:1 through 1973:4 available for presample lags as needed. The price variables ([P.sub.F] and [P.sub.G]) are the consumer price indexes of France and Germany, and the exchange rate (French franc price of a German mark) is the average market exchange rate, taken from the International Financial Statistics of the International Monetary Fund.

Note that the proposed bootstrap procedure has to be performed under the conditions that the rank of cointegration q has been determined, and the number of the lagged first differences in the VEC model k - 1 is properly specified. I chose the lag length of the VEC model based on the criteria that (a) additional lags are not statistically significant and (b) the residuals from equation (1) are white noise. The former is examined by the step-down testing procedure of Campbell and Perron [1991], beginning with a lag length [k.sub.max] = 8, while the latter is checked by the well-known Q test of Box and Pierce [1970] modified by Ljung and Box [1978] for small samples. The determination of the cointegration rank is on the basis of the Johansen cointegration rank tests, as described in section II. The finite-sample critical values of the cointegration rank tests are calculated following Cheung and Lai [1993, 318].

Since the price variables have an apparent time trend in the data, we allow a constant term in the VEC model and perform the tests and the bootstrap accordingly. The test statistics and the corresponding critical values are given in Table IV. The asymptotic 5% critical values of the LR tests for structural hypotheses are adjusted by Sims's correction of T/T-c). The lag length chosen by step-down testing is k = 7. The Q-statistics of Ljung and Box [1978] fail to reject the null that the residuals from the VEC models of equation (1) with k = 7 are white noise. [7] The results show that the hypothesis of no cointegration is rejected at the 5% significance level, whereas the hypothesis of one cointegration relation is not rejected. The long-run exclusion hypotheses of [[beta].sub.pf] = 0, [[beta].sub.pg] = 0 and [[beta].sub.er] = 0 are clearly rejected. We thus conclude that there exists one cointegration vector among In [P.sub.F], ln [P.sub.G], and In ER. For the hypothesis of PPP relation [[[beta].sub.pf], [[beta].sub.er], [[beta].sub.pg]] = [-1, 1, 1], using the asymptotic critical values adjusted by Sims's correction would be subject to over-rejection. When comparing the LR test statistic with our 5% bootstrap critical value, we are unable to reject the null of [[[beta].sub.pf], [[beta].sub.er], [[beta].sub.pg]] = [-1, 1, 1]. The study provides significant evidence that PPP holds well for France and Germany.

VII. CONCLUSIONS

In this study, the finite-sample bias of Johansen's [1991] LR tests for structural hypotheses (the hypotheses of linear restrictions on the cointegration relations) has been examined. The Monte Carlo experiments have been conducted to show the roles of the sample size, the lag order, and the dimension of the variable system in determining the finite-sample bias of the Johansen tests. The effects of the presence of deterministic terms in the DGPs or in the VEC model on the finite-sample behavior of the test statistics are also studied.

It is found that Johansen's tests are biased toward rejecting the null hypotheses more often than what asymptotic theory suggests, even after the test statistics are adjusted by Sims's correction. The finite-sample bias magnifies as the lag length or the dimension of the estimated system increases. Moreover, the bias problem would become more serious when the cointegrating residual is highly serially correlated.

The simulation results also indicate that the degrees of the finite-sample bias of the LR tests are very similar for the cases where there is a constant term in the VEC model regardless of the presence of a linear trend in the DGP or not (Case [1.sup.*] and Case 3) as well as for the case of a constant term confined to the cointegration equation of the DGP (Case 2). However, when the constant term is absent from both the DGP and the VEC model (Case 1), the finite-sample bias of the test statistic is somewhat smaller than those of the other three cases.

I recommend the use of the bootstrap method to obtain problem-specific critical values for the LR tests. An empirical application of the proposed bootstrap procedure is demonstrated. It is shown that using the bootstrap procedure may substantially reduce the small-sample bias. It is also illustrated that the LR tests for structural hypotheses with the bootstrap method are robust to overparameterization in the lag length of the VEC models, but not so to underparameterization when using low order VEC models.

(*.) I would like to thank Dennis Hoffman, Michael Melvin, Asatoshi Maeshiro, Mary McGarvey, and session participants at the 1998 WEA International Conference for their helpful suggestions and comments on earlier drafts of this paper. I also wish to thank an anonymous referee and the editor of Economic Inquiry for their valuable suggestions and guidance. This research is supported by a summer research grant of the College of Business of the University of Texas at San Antonio.

Zhou: Associate Professor, Division of Economics and Finance, University of Texas at San Antonio, 6900 North Loop 1604 West, San Antonio, Tex. 78249-0633. Phone 1-210-458-4315, Fax 1-210-458-5837, E-mail szhou@lonestar.jpl.utsa.edu

(1.) See Gonzalo [1994], Phillips [1994], and Haug [1996].

(2.) For a better understanding of [rho], it is worth pointing out that the value of [rho] is inversely related to that of [alpha], the error-correction parameter. Consider a two-variable system, [X.sub.t] = [[y.sub.t], [w.sub.t]]', where (a) [w.sub.t] = [w.sub.t-1] + [e.sub.wt], a random walk process, (b) [y.sub.t] = [beta][w.sub.t] + [z.sub.t], the cointegration relation, and (c) [z.sub.t] = [rho][z.sub.t-1] + [rho][e.sub.zt]. One may rewrite (b) as [y.sub.t] - [y.sub.t-1] = [beta]([w.sub.t] - [w.sub.t-1]) + [z.sub.t] - ([y.sub.t-1] - [beta][w.sub.t-1]), that is [delta][y.sub.t] = [beta][e.sub.wt] + [z.sub.t] - [z.sub.t-1] = [beta][e.sub.wt] + [rho][z.sub.t-1] + [e.sub.zt] - [z.sub.t-1] = -(1 - [rho])[z.sub.t-1] + [beta][e.sub.wt] + [e.sub.zt], where [alpha] = 1 - [rho] reflects the speed of adjustment of [y.sub.t] to past deviations ([z.sub.t-1]) from the long-run equilibrium.

(3.) Note that all results except those for k = 1, reported in Table I and Table II, correspond to over-parameterized models. Cheung and Lai [1993] show that the Johansen cointegration tests are rather sensitive to underparameterization in the lag length but not so to overparameterization. Following their procedure, I investigate the effect of lag specification on the tests for structural hypotheses for both Case 1 and Case [1.sup.*] and obtain results that are consistent with their conclusion. The results are not reported but are available on request.

(4.) The results of Case 2 are obtained by setting [[beta].sub.0] = 0.5. It is found that the finite-sample distribution of the test statistic for structural hypothesis is invariant to the value of [[beta].sub.0]. For Case 3, the results vary with different values of [a.sub.w], but the effects of different [a.sub.w] values are rather small.

(5.) This scale factor adjusts the variance of [[epsilon].sub.t] to account for the loss of c degrees of freedom in estimating equation (3).

(6.) The details of these results are not reported. They are available from the author on request.

(7.) These test statistics are not reported but are available from the author on request.

REFERENCES

Box, George E. P., and David A. Pierce. "Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models." Journal of the American Statistical Association, December 1990, 1509-26.

Campbell, John Y., and Pierre Perron. "Pitfalls and Opportunities: What Macroeconomists Should Know about Unit Roots." NBER Macroeconomics Annual, 1991, 141-201.

Cheung, Yin-Wong, and Kon S. Lai. "Finite-Sample Sizes of Johansen's Likelihood Ratio Tests for Cointegration." Oxford Bulletin of Economics and Statistics, August 1993, 313-28.

Engle, Robert F., and Clive W. J. Granger. "Cointegration and Error Correction: Representation, Estimation and Testing." Econometrica, March 1987, 251-76.

Gonzalo, Jesus. "Five Alternative Methods of Estimating Long-run Equilibrium Relationships." Journal of Econometrics, January/February 1994, 203-33.

Haug, Alfred A. "Tests for Cointegration: A Monte Carlo Comparison." Journal of Econometrics, 71, March/April 1996, 89-116.

Johansen, Soren. "Statistical Analysis of Cointegration Vectors." Journal of Economic Dynamics and Control, June/September 1988, 231-54.

-----. "Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models." Econometrica, November 1991, 1551-80.

-----. "Determination of Cointegration Rank in the Presence of a Linear Trend." Oxford Bulletin of Economics and Statistics, August 1992, 383-97.

Johansen, Soren, and Katarina Juselius. "Maximum Likelihood Estimation and Inference on Cointegration--with Applications to the Demand for Money." Oxford Bulletin of Economics and Statistics, May 1990, 169-210.

-----. "Testing Structural Hypotheses in a Multivariate Cointegration Analysis of PPP and the UIP for UK." Journal of Econometrics, July/September 1992, 211-44.

Ljung, G. M., and George E. P. Box. "On a Measure of Lack of Fit in Time Series Models." Biometrika, August 1978, 297-303.

Nelson, Charles R., and Charles I. Plosser. "Trends and Random Walks in Macroeconomic Time Series." Journal of Monetary Economics, September 1982, 139-62.

Phillips, Peter C. B. "Some Exact Distribution Theory for Maximum Likelihood Estimators of Cointegrating Coefficients in Error Correction Models." Econometrica, January 1994, 73-93.

Sims, Christopher A. "Macroeconomics and Reality." Econometrica, January 1980, 1-48.