文章基本信息

标题：The determinants of U.S. state economic growth: a less extreme bounds analysis.
作者：Reed, W. Robert
期刊名称：Economic Inquiry
印刷版ISSN：0095-2583
出版年度：2009
期号：October
语种：English
出版社：Western Economic Association International
关键词：Akaike information criterion;Bayesian information criterion;Economic growth;Economic policy;Employment;Expenditures, Public;Growth curve modeling;Public expenditures;United States economic conditions

The determinants of U.S. state economic growth: a less extreme bounds analysis.

Reed, W. Robert

[Correction added on June 27, 2008, after online publication: Equation (7) was incorrectly stated as AICc = T x ln([SSE/T]) + T x (T+k - 1/T - k -1). The correct statement is AICc = T x ln([SSE/T]) + T x ([T + k/T - k - 2).]

I. INTRODUCTION

It is now well established that economic growth studies reach different conclusions depending on model specification. This has been documented repeatedly in the literature on cross-country growth regressions (1) and in studies of growth in U.S. states. (2) In response, attempts have been made to identify "robust" variables, the "best" model specification, or ways of combining alternative model specifications (e.g., Crain and Lee 1999; Fernandez et al. 2001; Granger and Uhlig 1990; Hendry and Krolzig 2004; Hoover and Perez 2004; Levine and Renelt 1992; Sala-i-Martin 1997; Sala-i-Martin, Doppelhofer, and Miller 2004). While not intended as a substitute for economic theory, these approaches can be useful when the theory is sufficiently broad such that a large number of variables are potential regressors.

This study follows in this line of research by attempting to identify robust determinants of U.S. economic growth from 1970 to 1999. I innovate on previous studies by developing a new approach for addressing "model uncertainty" issues associated with estimating growth equations. My approach borrows from the "extreme bounds analysis" (EBA) approach of Learner (1985) while also addressing concerns raised by Granger and Uhlig (1990), Sala-i-Martin (1997), and others that not all specifications are equally likely to be true. I then apply this approach by sifting through a very large number of explanatory variables in order to find robust determinants of state economic growth. My analysis confirms the importance of productivity characteristics of the labor force and industrial composition of a state's economy. I also find that policy variables such as (1) the size and structure of government and (2) taxation are robust determinants of state economic growth.

The paper proceeds as follows: Section II develops a framework for specification of the empirical growth models. Section III describes the full set of variables used in this study. Section IV presents my approach for identifying robust determinants of economic growth. Section V describes my data and discusses details about the estimation procedure. Section VI presents the empirical results. Section VII concludes.

II. A FRAMEWORK FOR SPECIFICATION OF THE EMPIRICAL GROWTH MODELS

I assume that state income ([Y.sub.t]) is determined by the following generalized Cobb-Douglas production function:

(1) [Y.sub.t] = [A.sub.t][K.sup.[alpha].sub.t][([L.sub.t][Q.sub.t]).sup.[beta]] = [A.sub.t][Q.sup.[beta].sub.t][K.sup.[alpha].sub.t][L.sub.[beta].sub.t],

where [L.sub.t] and [K.sub.t] are labor and capital, [Q.sub.t] is the efficiency of labor, and [A.sub.t] is a time-varying parameter that represents other variables that can influence state income (e.g., human capital variables). The textbook Solow model and the augmented human capital model of Mankiw, Romer, and Weil (1992) are both special cases of Equation (1). (3)

Dividing both sides by population, [N.sub.t], produces the following per capita expression:

(2) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].

This can be expressed in log form as:

(3) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],

where [y.sub.t] = [Y.sub.t]/[N.sub.t], [k.sub.t] = [K.sub.t]/[N.sub.t], and [l.sub.t] = [L.sub.t][N.sub.t].

Differentiating Equation (3) with respect to time yields:

(4) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].

It follows that:

(5) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],

where [C.sub.t] = [ln([A.sub.t]) - ln([A.sub.t-L])] + [beta][ln([Q.sub.t]) -ln([Q.sub.t-L])] and L = the length of the time period minus 1 (i.e., for a 5-yr period, with t measuring calendar years, L = 4). (4,5)

The preceding analysis identifies changes in capital, employment, and population as important determinants of economic growth. However, the last term, [C.sub.t], is sufficiently general that it allows for a large number of possible explanatory variables. It encompasses many of the models that have been used to estimate U.S. state economic growth (e.g., Garofalo and Yamarik 2002; Holtz-Eakin 1993; Lee and Gordon 2005).

III. POTENTIAL DETERMINANTS OF STATE ECONOMIC GROWTH

Table 1 lists a number of variables that have been suggested in previous studies of economic growth, primarily U.S. state economic growth. (6) The empirical task of this paper consists of identifying which of these should be included in a growth equation along with capital, employment, and population variables.

I group the variables into four major categories: (1) Population/Labor Force characteristics, (2) Economy characteristics, (3) Public Sector characteristics, and (4) Political Control characteristics. Variables included in the Population/Labor Force category include educational attainment, percentage of the population that is working aged (ages between 20 and 64), percentage of the population that is nonwhite or female, and total population. Economy characteristics include population density, degree of urbanization, the relative importance of various industries within the state, percentage of the workforce that is unionized, and a measure of industrial diversity.

Public Sector characteristics are divided into three subcategories: (1) Size and Structure variables, (2) Tax variables, and (3) Expenditure variables. Each of these can be thought of representing a particular component of public policy. Size and Structure variables include the size of the (1) federal and (2) state and local government sectors of the economy, measured by both share of total earnings and employment. Also included are the amount of federal government revenue received by state and local governments; the degree to which expenditures are made at the local, as opposed to the state, level; and the number of governments.

Tax variables include a measure of the overall importance of state and local taxes in the state's economy ("tax burden"), measured as a share of state personal income. Also included are specific types of taxes, such as property, sales, individual income, and corporate income taxes. These tax variables should be interpreted as measuring the net growth effect of increasing taxes to fund general spending. (7)

Expenditure variables measure the compositional effects of state and local government spending. The specific expenditure categories are primary and secondary education, higher education, public health, and highways. Each of the respective expenditure variables is measured as a share of total state and local (direct general) spending.

Finally, Political Control variables measure the influence of political parties. These include how often the Democratic and Republican parties control the state legislature and how often the governor is a Democrat.

These preceding variables attempt to capture the economic influences represented by [C.sub.t] = [ln([A.sub.t]) - ln([A.sub.t]- L)] + [[beta]ln([Q.sub.t]) - ln([Q.sub.t] - L)] in Equation (5). One immediate issue is whether the 32 variables in Table 1 should be entered in (1) level or (2) differenced form. Because economic theory is not sufficiently specific to answer this question, this becomes an empirical issue. (8) Restricting the Political Control variables to be entered in level form, (9) and recognizing that the change in population is already included in the core specification of Equation (5) (i.e., [ln([N.sub.t] - ln([N.sub.t-L])], leads to a total of 60 possible explanatory variables.

There are approximately 1.15 x [10.sup.18] ways to combine 60 variables. Each of these permutations, appending a core set of "free" variables, can be thought of as a single model. Thus, the empirical problem consists of choosing the best model, or set of models, from these 1.15 x 1018 possibilities. One might think that it was computationally unfeasible to estimate so many models. While this is true, there exist algorithms that allow me to circumvent this problem.

IV. A PROCEDURE FOR DETERMINING ROBUST VARIABLES

A. Schwarz Information Criterion and the Corrected Version of the Akaike Information Criterion

The first step in my approach consists of identifying a best specification: I employ two model selection criteria for this purpose: the Schwarz

Information Criterion (SIC) and the corrected version of the Akaike Information Criterion (AICc). While I give a brief description of these criteria, more detailed discussions can be found in McQuarrie and Tsai (1998), Burnham and Anderson (2002), and the references therein.

The SIC and the AICc, respectively, represent two competing schools of thought regarding how to conceptualize the task of selecting the best model. If the researcher believes that the true model is included within the set of candidate models, then a desirable property of a model selection procedure is that it be "consistent." That is, that it selects the true model with probability converging to 1 as the sample size becomes infinitely large. The SIC is by far the most commonly used of the several model selection criteria that possess this property (other consistent criteria include the Hannan and Quinn criterion and the Geweke and Meese criterion).

Alternatively, if the researcher believes that the true model is not included within the set of candidate models, then a desirable property of a model selection procedure is that it be "efficient." That is, that it selects the model that is "closest" to the true model, where closest is defined by some distance or information criterion. A selection procedure is said to be "asymptotically efficient" if it selects the model closest to the true model with probability converging to 1 as the sample size becomes infinitely large.

A number of model selection procedures have been developed that have the property of asymptotic efficiency, including Akaike's Final Prediction Error, Mallow's Cp criterion, and the Akaike Information Criterion (AIC). Of these, the AIC is by far the most widely employed. However, many researchers have noted that the AIC suffers from overfitting in finite samples, incorporating too many variables in its best models. As a result, a number of finite sample corrections have been developed for the AIC. Of these, the most preferred is a version known as AICc (Hurvich and Tsai 1989; Sugiura 1978).

Monte Carlo studies of finite sample performance have demonstrated that both the SIC and the AICc perform well relative to alternative procedures (cf. McQuarrie and Tsai 1998). While there are a number of equivalent formulations, this study uses the following formulae:

(6) SIC = T In (SSE/T) + k ln(T).

(7) AICc = T x ln (SSE/T) + T x (T + k / T - k-2),

where T is the number of observations; k is the number of coefficients in the model, including the intercept; and SSE is the sum of squared residuals from the estimated model. Note that SSE and k are the only parameters that vary across models since sample size and the dependent variable do not change. The SIC and AICc make different tradeoffs between these parameters. Generally, the SIC penalizes additional explanatory variables more severely than the AICc, producing best models with fewer variables.

Conceptually, I need a program that will sort through all 1.15 x [10.sup.18] possible linear combinations of the 60 variables (level plus differenced forms) identified in Table 1 in order to select the best model specification according to each selection criterion. For this task, I use the SELECTION = RSQUARE option within the REG procedure available through SAS. This procedure does not actually estimate all possible regression specifications. Instead, it relies on the "leaps and bounds" algorithm developed by Furnival and Wilson (1974) to identify the specifications with the highest [R.sup.2] values among all possible specifications having the same number of regressors. It is straightforward to use the output generated by this SAS program to calculate a ranked ordering of the M best specifications across all possible variable combinations--for any predetermined value of M--according to either the SIC or the AICc. (10) The corresponding SAS program is easy to implement and remarkably efficient in computational requirements. It required about an hour to run using a standard desktop computer. (11)

B. EBA and Bayesian Model Averaging

My approach uses insights from both EBA (Learner 1985) and "Bayesian model averaging" (BMA; Hoeting et al. 1999). Therefore, it is useful to consider these before proceeding.

EBA is designed to study the sensitivity of coefficient estimates across different regression specifications. For example, suppose a researcher wants to measure the effect of variable [X.sub.1] on variable Y. EBA proceeds by estimating a large number of specifications that include [X.sub.1], calculating the confidence interval for each [[beta].sub.1] estimate. The highest and lowest values over all these confidence intervals define the "extreme" upper and lower bounds. If these bounds do not overlap zero (i.e., if they are same signed), then the variable [X.sub.1] is said to be robust (cf. Crain and Lee 1999; Levine and Renelt 1992).

The main criticism of EBA is that it weights all model specifications equally, so that a divergent coefficient estimate from a poorly specified equation can be sufficient to disqualify a variable as robust. (12) In recognition of this shortcoming, Granger and Uhlig (1990) propose "reasonable extreme bounds analysis," where the range of coefficient values is restricted to the set of specifications that produce [R.sup.2] values within a given [delta] value of the maximum achieved [R.sup.2] across all specifications. However, they do not provide guidance for the choice of [delta] and acknowledge that the use of [R.sup.2] has problems.

BMA directly addresses the "all specifications weighted equally" criticism by developing a system for weighting model specifications based on information criteria. BMA starts by positing a prior distribution for the population value for some parameter of the model specification (usually a regression coefficient). This prior distribution is updated with the results from regression estimates across--theoretically--all possible model specifications to form a posterior distribution of parameter values. The updating procedure weights the corresponding specifications by model probabilities that can be thought of as the conditional probability that a given specification is the "true model." (13)

While the BMA approach is useful for weighting specifications for forecasting purposes, it is problematic when used to weight coefficient estimates. Consider the following example: suppose a researcher is interested in the relationship between dependent variable y and an explanatory variable, [X.sub.1]. Let the true model be given by [Y.sub.t] = [[beta].sub.0] + [summation].sup.K]k = 1 [[beta].sub.k][X.sub.k,t] + [[epsilon].sub.t], t = 1, ..., T, where some [[beta].sub.k] may equal 0 (but not [[beta].sub.1]; and Cov ([X.sub.j], [X.sub.k]) = 0 for all j [not equal to] k. There are [2.sup.K] (possible linear combinations of these variables, and we suppose the researcher considers each combination a potentially true model. Define P([M.sub.j]) as the prior probability that Model j is the true model and let P([M.sub.j]) > 0 for all j.

The BMA approach calculates the posterior probability of each model as:

(8) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].

where [k.sub.j] and [SSE.sub.j] are the number of included regressors and the sum of squared residuals in Model j. The corresponding (posterior) expected value of [[beta].sub.1] is given by:

(9) E([[beta].sub.1]|y) = [[2.sup.K].summation over (j=1)] P ([M.sub.j]|y)[[??.sub.1,j],

where [[??.sub.1,j] is the estimate of [[beta].sub.1] in Model j.

In each specification in which [X.sub.1] appears, the preceding assumptions ensure that the least squares estimate is unbiased, so that E [[??.sub.1,j] = [beta].sub.1]. However, [X.sub.1] appears in only half of all possible specifications. In the other [2.sup.K-1] models, [X.sub.1] is excluded, and the BMA approach sets [[??].sub.1,j] =0. (14) It follows that E([[beta].sub.1]|y) < [[beta].sub.1] even if [[??].sub.1,j] = [[beta].sub.1] in every specification in which it appears. In other words, the BMA-based expectation is biased toward zero. This follows directly from the fact that BMA "estimates" the value of [[beta].sub.1] to be 0 in all specifications in which [X.sub.1] is not included. (15)

C. A Less Extreme Bounds Analysis

My approach borrows elements from both EBA and BMA. Like EBA, I estimate a set of specifications and report the corresponding ranges of coefficient estimates and t ratios for those specifications including the respective variables. However, like BMA, I use information criteria to restrict the set of model specifications. I follow a procedure developed by Poskitt and Tremayne (PT; 1987) to identify two categories of models: (1) "reasonable" models and (2) others. Only reasonable models are considered for EBA.

PK take as their point of departure that informational criteria such as the SIC and the AICc are themselves sample statistics, so that the model with the lowest SIC or AICc value may not be the best model. They argue that all "close competitors" be included in a portfolio of reasonable models.

Let [I.sup.*] be the value of the information criterion for the best model and [I.sup.A] be the corresponding value for an alternative model. The posterior odds ratio is defined as:

(10) R = exp [- 1/2([I.sup.*] - [I.sup.A])].

Following Jeffreys (1961, p. 143) and Zellner (1977), PK characterize any model with R < [square root of 10] as a "close competitor" to the best model:

"... any ... specification satisfying 0t R < [square root of 10] may be thought of as a close competitor. This intimates that it may be advantageous to extend the usual model building process. It suggests not only that the model minimizing the criterion should be selected, but also that any additional specifications closely competing ... should not be discarded, thereby advancing the general notion of a portfolio of models" (Poskitt and Tremayne 1987, p. 127).

PK go on to present Monte Carlo evidence that model portfolios constructed in this manner behave well in finite samples.

To summarize, my approach constructs separate model portfolios using SIC and AICc selection methods. For each portfolio, I identify robust variables in a manner similar to conventional EBA. In this respect, my approach is similar to reasonable EBA by Granger and Uhlig (1990), except that I use information criteria, not [R.sup.2], to evaluate models, and the set of evaluated models is determined by PK's R < [square root of 10] standard, rather than an arbitary [delta] value.

V. DATA AND FURTHER ESTIMATION ISSUES

My data consist of observations on 46 U.S. states from 1970 to 1999. (16) I decided on this particular time period because a longer time frame would have required me to omit many variables of interest. The respective 30 yr of data were grouped into six 5-yr periods (1970 1974, 1975-1979, ..., 1995-1999). Data for most of these variables were collected from original data sources. (17)

Using over 5-yr rather than annual data offers several advantages: it (1) reduces the impact of "business cycle effects" (Grier and Tullock 1989), (ii) minimizes errors from mis-specifying lag effects, and (iii) reduces time-specification issues. Time-specification issues arise because data can have different start and end periods within a given calendar year. For example, state income data are defined over calendar years; state fiscal data are defined over fiscal years (which are different for different states); and other variables (e.g. employment, population data) may be measured at different points within the year (beginning/ middle/end). In addition, a number of variables (e.g., variables based on decennial Census data) require interpolation in order to get a balanced panel. For all these reasons, the use of 5-yr interval data should entail fewer estimation problems. Following Equation (5), the general specification for the empirical models is: 18.19

(11) DLNY, = [[[beta].sub.0] + [[beta].sub.1] DLNK, + [[beta].sub.2] [DLNL.sub.t],

+ [[[beta].sub.3][DLNN.sub.t]

+ State Fixed Effects

+ Time Fixed Effects]

+ [summation over (1)] [[lambda].sub.l[X.sub.l,t-5]

+ [summation over (d)] [[delta].sub.d([X.sub.d,t] - [X.sub.d,t-1]

+ [summation over (p)] [[pi].sub.p([[??].sub.p,t] + [[epsilon.sub.t]

where t = 1974, 1979, 1984, 1989, 1994, 1999; [DLNY.sub.t], [DLNK.sub.t], [DLNL.sub.t], and [DLNN.sub.t] are the respective difference quantities from Equation (5) multiplied by 100 (to give per cent); [X.sub.l,t_4] is the value of the explanatory variable at the beginning of the 5-yr period ("level" form); ([X.sub.d,t] - [X.sub.d,t-4]) is the change in the explanatory variable over the 5-yr period ("difference" form); and

[[bar.X].sub.p,t] = [[bar.X].sub.p,t-1] + [[bar.X].sub.p,t-2] + [[bar.X].sub.p,t-3] + [[bar.X].sub.p,t-4] + [[bar.X].sub.p,t-5] / 5

is the 5-yr average over the period (t-5 to t-1) for the Political Control variables Democratic Legislature, Republican Legislature, and Democratic Governor. (20)

The [2.sup.60] possible model specifications each include the variables listed in brackets in Equation (11) but allow for alternative configurations of the last three sets of variables ([X.sub.l,t-4], ([X.xub.dt] - [X.sub.d,t-4]), and [[bar.X].sub.p,t]), since the theory is nonspecific about which variables belong in Ct (cf. Equation (5)).

VI. EMPIRICAL RESULTS

A. Robust Determinants of State Economic Growth

Following EBA convention, I identify as robust any variable whose coefficient estimates are all same signed and lie more than two standard deviations away from zero. However, two features of my approach differ from standard EBA analysis: (1) I analyze two "portfolios of models" (one for SIC and one for AICc) and (2) not every variable appears in every specification within a given portfolio. Accordingly, I also require robust variables to appear in at least 50% of the specifications in either portfolio.

The SIC portfolio consists of 27 different models, the Best SIC specification, and 26 "close competitors" as defined by the R < [square root of 10] criterion. (21) The results from analyzing this portfolio of models are reported in Table 2, Panel A. Variables are ranked in descending order of number of appearances within the portfolio. Robust variables are identified with an "R." A total of 18 different variables are analyzed, as shown in Table 2, Panel A. Some, like Education, appear in all 27 models. Others, like State & Local Employees and State & Local Government, appear in only a very few models (though both have high t ratios when they do appear). Not surprisingly, there is a high overlap between (1) the set of variables that appears in at least 50% of the models in the SIC portfolio and (2) the set of variables having a range of t ratios all same signed and larger than 2.0. (22) Table 2, Panel B, reports that 57 models are included in the AICc portfolio. (23) A total of 23 different variables appear in at least one of these models. However, many of these appear in only a few models and some, like Individual Income Tax and Higher Education Spending, appear only once.

Table 3 collects the robust variables from these EBAs and reports them, along with a "mean estimated effect" calculated as the simple average of the respective means from Table 2, Panels A and B. To interpret the respective sizes of these effects, recall that the dependent variable is the 5-yr growth rate in state per capita personal income. For my sample of 30 yr (six 5-yr time periods) and 46 states (yielding 276 observations), the mean growth rate is 27.01%. Thus, a 1 percentage point increase in the 5-yr growth rate equates approximately to a 3.7% increase in growth.

Given the underlying theoretical model of Equation (5), the variables of Table 3 should be related to the term, [C.sub.t] = [ln([A.sub.t]) - ln([A.sub.t - L])] + [[beta][ln([Q.sub.t]) - ln([Q.sub.t-L])]. Since [A.sub.t] and [Q.sub.t] represent production function parameters, theory suggests that these variables affect the rate of invention and adoption of new technologies that transform the production function over time. This includes effects on resource allocation.

Differenced variables are indicated by "D" and represent changes in that variable during the 5-yr period. Level variables are indicated by "L" and represent the value of that variable at the beginning of the 5-yr period. A variable that appears in both differenced and level form has both an immediate and a lagged effect. The differenced form indicates the immediate effect since changes during the 5-yr period impact economic growth during that same period. The level form indicates a lagged effect since changes that get reflected at the beginning of the period show up later, in the subsequent 5-yr growth period.

Table 3 identifies three Population/Labor Force variables as robust determinants of state economic growth: Education, Working Population, and Female. All have the expected signs. Education appears in level form. The mean estimated effect indicates that a 1 percentage point increase in the percentage of the population that is college educated at the beginning of a 5-yr period is associated with a 0.97 percentage point increase in that state's subsequent 5-yr growth rate. This effect is relatively small in economic terms, given that the average 5-yr growth rate is 27.01% .

The differenced form of Working Population is also identified as a robust variable. The corresponding estimated positive effect indicates that a 1 percentage point increase in the share of the population that is aged 20-64 yr during a given 5-yr period is associated with an approximate 0.90 percentage point, contemporaneous increase in economic growth during that period. Of course, one of the variables being held constant in the estimation is employment (specifically, DLNL). Thus, this variable likely reflects higher worker quality within the labor force. Increases in the female share of a state's population (Female) are also estimated to have a contemporaneous, albeit negative impact on economic growth. Again, since employment is being held constant, this may reflect productivity differences between men and women in the labor force.

Table 3 identifies two economy characteristic variables: Agriculture and Mining. The coefficient for Agriculture is positive in both level and differenced forms, indicating that states with larger and growing agricultural sectors (as measured by earnings share) grew faster than other states. The sources of increased agricultural productivity are debated, but lower input prices, public and private research, increased specialization, and changes in farm size have all been identified as contributing factors (cf. Evenson and Huffman 1997). In contrast, the coefficients for Mining, which also appear in both differenced and level forms, are each negative. This is consistent with research that finds that the mining industry contributes negligibly, or even negatively, to aggregate total factor productivity growth (cf. Jorgenson and Stiroh 2000).

Table 3 includes seven Public Sector variables: Federal Government, Federal Employees, Federal Revenue, Decentralization, Tax Burden, Sales Tax, and Corporate Income Tax. The first two variables measure the size of the federal government's presence in a state, measured by earnings share and employment per capita, respectively. The corresponding coefficients for both variables indicate that a larger federal government sector is associated with lower economic growth, ceteris paribus. This may be due to the fact that, relative to the private sector, resources in the public sector are less likely to be allocated to where they will produce income growth (cf. Barro 1990).

The mean estimated effect for the difference form of Federal Government indicates that a 1 percentage point increase in this variable-corresponding to roughly a 15% increase in the size of the federal government sector over a 5-yr period--is associated with a contemporaneous 0.83 percentage point decline in state economic growth. The corresponding estimate for the level form of Federal Employees implies that doubling the number of federal employees per capita would lower the subsequent 5-yr growth rate of that state by 4.48%. While not robust, it is interesting to note that I estimate similar-sized effects for both State & Local Government and State & Local Employees (cf. Table 2, Panels A and B).

The variable Federal Revenue measures the size of federal aid to states. The sample mean of Federal Revenue is 3.90. A 1 percentage point increase in this variable would represent approximately a 25% increase in federal aid. The mean estimated effect reported in Table 3 for this variable indicates that an increase of this size would raise a state's subsequent 5-yr growth rate by 1.16 percentage points.

The variable Decentralization measures the share of total state and local public spending made at the local level. I estimate that a 1 percentage point increase in the share of local control is associated with a contemporaneous decrease of 0.11 percentage points in a state's 5-yr growth rate. Given that the sample mean of Decentralization is 55.0 percentage points, this constitutes a very small effect. It is consistent with the fact that other studies have had difficulty finding significant effects for this variable (cf. Xie, Zou, and Davoodi 1999).

The remaining three variables are tax variables. The negative coefficients for Tax Burden indicate that an increase in state tax revenues as a share of state personal income (i.e., average tax rate) results in lower economic growth. The fact that both level and differenced forms of the variable are identified as robust determinants indicates that the effect of taxes is both immediate and persistent. A 1 percentage point increase in Tax Burden over a 5-yr period is associated with a contemporaneous decrease in state economic growth of 0.63 percentage points. In addition, it is estimated to lower growth by 0.73 percentage points over subsequent 5-yr periods. As a gauge of size, a 1 percentage point increase in Tax Burden equates approximately to a 10% increase in overall taxes.

While not huge, these effects are larger than estimated by previous studies (cf. Wasylenko 1997). First, they imply both an immediate and a long-lived effect of taxes. Second, the estimated effects represent the net effect of taxes and spending. Previous studies, following Helms (1985), commonly estimated "government budget constraint" specifications, so that most categories of public expenditures were held constant. (24) The associated tax estimates did not incorporate the corresponding positive effects related to stimulative spending. In contrast, my specifications do not hold constant the level of public expenditures and thus imply significantly larger negative tax effects.

In contrast, the estimated coefficients for both Sales Tax and Corporate Income Tax are each positive. Note that a 1 percentage point increase in these variables represents approximate increases of 30% and 200%, respectively. The positive effects for these two variables indicate that sales and corporate income taxes are less distortionary than other taxes, such as individual income and property taxes. A further factor may be in play when it comes to business taxes in general and corporate income taxes in particular: Corporate profits may be more likely than other sources of income to be exported outside the state. Taxing corporate profits may serve to channel economic activity within the state, thus contributing to economic growth.

It should be noted that choosing a different interval length can produce different robust variables. When I repeated the analysis using 10-yr intervals, some of the robust variables from Table 3 continued to be chosen, but others were not. (25) This is not particularly surprising, given that previous research has demonstrated that estimates of economic growth equations for U.S. states can differ substantially when the interval length is changed (Reed 2008). Which interval length is most appropriate remains an unsettled research question.

B. Comparison with Crain and Lee (1999)

The only other study that searches for robust determinants of state economic growth is Crain and Lee (1999). (26) Crain and Lee (CL) implement the EBA approach of Levine and Renelt (1992) using annual data on U.S. states from 1977 to 1992. (27) A comparison of their robust variables and my robust variables reveals several differences. (28) Unlike CL, I find that Agriculture and Mining are robust determinants of economic growth and that Diversity and Service are not. Further, their "core variable" for Education is always insignificant, and sometimes negative, whereas I find that Education is robust and positively associated with economic growth.

There are, however, a number of similarities: while we measure it differently, we both find size of government variables to be robust and negatively associated with economic growth. (29) We both find that Decentralization is negatively associated with economic growth. Additionally, I find that Tax Burden is negatively associated with state economic growth, while CL obtain a similar finding using a measure that includes all state and local revenues, not just taxes. (30)

VII. CONCLUSIONS

This study examines the determinants of U.S. state economic growth from 1970 to 1999. It considers a large number of potential explanatory variables, including Population/ Labor Force characteristics, Economy characteristics, Public Sector (Policy) variables, and Political Control variables. Counting both difference and level forms, a total of 60 possible explanatory variables are considered, in addition to the capital, employment, and population variables specified by the theory. This yields a total of [2.sup.60] [congruent to] 1.15 x [10.sup.18] possible linear combinations of variables, each representing a potentially true model.

I devise an approach for sorting through these different model specifications in order to identify robust determinants of state economic growth. My approach is related to the reasonable EBA of Granger and Uhlig (1990). Unlike their study, however, I use information criteria (the SIC and AICc) to choose "portfolios of reasonable models," as suggested by Poskitt and Tremayne (1987). I then perform conventional EBA within these portfolios. An advantage of my approach is that (1) it is a straightforward extension of a standard SAS program, (2) it requires relatively little computational time, and (3) its simplicity assures that different researchers using the same procedure will obtain identical results.

My analysis identifies 12 robust determinants of U.S. economic growth over the 30-yr period from 1970 to 1999. Among these are (1) college attainment within the population, (2) share of the population that is "working age," and (3) population gender share, and the size of the (4) agricultural and (5) mining sectors of the economy. I also find that a relatively large number of public sector variables are significantly correlated with growth. Among these are (6 and 7) the size of the federal sector within a state, (8) federal aid, (9) decentralization, and (10 through 12) various categories of taxes. This latter finding highlights the importance of public policy as a determinant of economic growth. While one must be careful to draw causative inferences from these results, they provide further motivation to identify channels by which public policy directly impacts economic activity.

ABBREVIATIONS

AIC: Akaike Information Criterion

AICc: Corrected Version of Akaike Information Criterion

BMA: Bayesian Model Averaging

EBA: Extreme Bounds Analysis

SIC: Schwarz Information Criterion

doi: 10.1111/j.1465-7295.2008.00127.x

APPENDIX

STATISTICAL SUMMARY OF DATA

Number Name (a) D/L Mean SD

Dependent
 variable DLNY 27.01 10.30
-- DLNK 7.40 7.76
-- DLNL 4.61 4.01
-- DLNN 4.72 4.55
 1 Education D 1.76 0.55
 L 16.39 4.95
 2 Working Population D 0.97 0.92
 L 55.89 3.18
 3 Nonwhite D 0.55 0.52
 L 12.08 8.79
 4 Female D -0.02 0.15
 L 51.24 0.79
 5 Population L 14.93 1.01
 6 Population Density D 5.09 6.77
 L 167.72 234.21
 7 Urban D 0.75 1.15
 L 67.23 14.73
 8 Agriculture D -0.04 2.42
 L 3.12 3.88
 9 Manufacturing D -0.84 1.69
 L 21.03 8.54
10 Service D 1.47 1.27
 L 19.56 5.71
11 Mining D -0.19 0.77
 L 2.21 3.60
12 Union D -1.47 2.39
 L 18.42 8.18
13 Diversity D -0.06 0.78
 L 17.44 2.05
l4 Federal Government D -0.57 0.82
 L 7.02 3.63
 State & Local D -0.10 0.88
 Government L 11.98 1.66
16 Federal Employees D -0.04 0.09
 L 4.70 0.38
17 State & Local D 0.03 0.06
 Employees L 6.20 0.13
18 Federal Revenue D 0.10 0.77
 L 3.90 1.22
19 Decentralization D -0.13 2.54
 L 55.00 7.88
20 Number of Governments D -0.02 0.07
 L 5.90 0.88
21 Tax Burden D 0.12 0.88
 L 10.84 1.37
22 Property Tax D -0.09 0.56
 L 3.51 1.34
23 Sales Tax D -0.03 1.02
 L 3.31 1.18
24 Individual Income Tax D 0.19 0.29
 L 1.65 1.09
25 Corporate Income Tax D 0.01 0.14
 L 0.46 0.25
26 Local Education D -0.67 1.93
 Spending L 25.66 3.06
27 Higher Education D -0.22 1.02
 Spending L 10.34 2.69
28 Health & Hospital D 0.25 1.13
 Spending L 8.14 2.88
29 Highway Spending D -1.21 1.74
 L 10.96 4.04
30 Democratic Legislature -- 55.0 46.2
31 Republican Legislature -- 24.9 39.0
32 Democratic Governor -- 55.94 40.91

Number Name (a) D/L Minimum Maximum

Dependent
 variable DLNY 6.49 64.40
-- DLNK -26.92 55.43
-- DLNL -7.22 14.98
-- DLNN -8.63 21.45
 1 Education D 0.34 3.21
 L 6.66 30.21
 2 Working Population D -1.22 2.93
 L 47.54 62.26
 3 Nonwhite D -0.98 2.42
 L 0.36 37.35
 4 Female D -0.57 0.75
 L 48.77 52.76
 5 Population L 12.72 17.27
 6 Population Density D -8.44 37.26
 L 3.44 1089.83
 7 Urban D -1.97 3.96
 L 32.16 93.54
 8 Agriculture D -16.72 18.85
 L -8.92 29.06
 9 Manufacturing D -6.09 3.37
 L 3.73 40.49
10 Service D -3.22 6.40
 L 10.93 41.55
11 Mining D -3.29 4.27
 L 0.02 24.98
12 Union D -10.60 5.00
 L 3.30 41.70
13 Diversity D -5.42 4.66
 L 13.84 23.56
l4 Federal Government D -5.98 1.25
 L 2.05 23.45
 State & Local D -3.98 5.02
 Government L 8.47 18.40
16 Federal Employees D -0.67 0.37
 L 3.99 5.93
17 State & Local D -0.13 0.19
 Employees L 5.86 6.66
18 Federal Revenue D -1.74 2.50
 L 1.67 8.31
19 Decentralization D -10.37 6.35
 L 34.81 76.80
20 Number of Governments D -0.36 0.40
 L 4.26 8.40
21 Tax Burden D -5.52 5.91
 L 7.92 19.27
22 Property Tax D -2.97 3.21
 L 1.09 8.23
23 Sales Tax D -3.55 2.92
 L 0.69 6.92
24 Individual Income Tax D -0.73 1.82
 L 0 4.23
25 Corporate Income Tax D -0.50 0.81
 L 0 1.18
26 Local Education D -7.51 4.38
 Spending L 18.34 35.37
27 Higher Education D -4.65 2.89
 Spending L 4.22 18.45
28 Health & Hospital D -3.44 4.00
 Spending L 2.46 18.37
29 Highway Spending D -9.24 3.11
 L 4.27 25.59
30 Democratic Legislature -- 0 100
31 Republican Legislature -- 0 100
32 Democratic Governor -- 0 100

(a) The numbered variables are described in Table 1. DLNY is
derived from BEA Personal Income Data. DLNK measures the change
in net private capital stock and comes from Steve Yamarik
(Garofalo and Yamarik 2002). DLNL is computed from BEA data on
the total number of full- and part-time jobs. DLNN comes from
midyear population estimates provided by the Census.

REFERENCES

Barro, R. J. "Government Spending in a Simple Model of Endogenous Growth." Journal of Political Economy, 98, 1990, S103-25.

Bartik, T. J. Who Benefits from State and Local Economic Development Policies? Kalamazoo, MI: W.E. Upjohn Institute for Employment Research, 1991.

Burnham, K. P., and D. R. Anderson. Model Selection and Multimodel Inference. 2nd ed. New York: Springer-Verlag, Inc., 2002.

Crain, W. M., and K. J. Lee. "Economic Growth Regressions for the American States: A Sensitivity Analysis." Economic Inquiry, 37, 1999, 242 57.

Evenson, R. E., and W. B. Huffman. "Long-Run Structural and Productivity Change in U.S. Agriculture:

Effects of Prices and Policies." Mimeo, Yale University, Economic Growth Center, Center Discussion Paper No. 773, 1997.

Fernandez, C., E. Ley, and M. F. J. Steel. "Model Uncertainty in Cross-Country Growth Regressions." Journal of Applied Econometrics, 16, 2001, 563-76.

Furnival, G. M., and R. W. J. Wilson. "Regression by Leaps and Bounds." Technometrics, 16, 1974, 499-511.

Garofalo, G. A., and S. Yamarik. "Regional Convergence: Evidence from a New State-by-State Capital Stock Series." The Review of Economics and Statistics, 84, 2002, 315 23.

Gilligan, T. W., and J. G. Matsusaka. "Deviations from Constituent Interests: The Role of Legislative Structure and Political Parties in the States." Economic Inquiry, 33, 1995, 383401.

Granger, C. W. J., and H. F. Uhlig. "Reasonable Extreme-Bounds Analysis." Journal of Econometrics, 44, 1990, 159-70.

Grier, K. B., and G. Tullock. "An Empirical Analysis of Cross-national Economic Growth, 1951-1980." Journal of Monetary Economics, 24, 1989, 169-328.

Helms, J. L. "The Effect of State and Local Taxes on Economic Growth: A Time Series-Cross Section Approach." The Review of Economics and Statistics, 67, 1985, 574-82.

Hendry, D. F., and H.-M. Krolzig. +'We Ran One Regression." Oxford Bulletin of Economics and Statistics, 66, 2004, 799 810.

Higgins, M. J., A. T. Young, and D. Levy. Forthcoming. "Robust Correlates of County-Level Growth in the U.S." Applied Economics Letters, 2008.

Hirsch, B. T., D. A. MacPherson, and W. G. Vroman. "Estimates of Union Density by State." Monthly Labor Review, 124, 2001, 51-55.

Hoeting, J. A., D. Madigan, A. E. Raftery, and C. T. Volinsky. "Bayesian Model Averaging: A Tutorial." Statistical Science, 14, 1999, 382-17.

Holtz-Eakin, D. "Solow and the States: Capital Accumulation, Productivity, and Economic Growth." National Tax Journal, 46, 1993, 425-40.

Hoover, K. D., and S. J. Perez. "Truth and Robustness in Cross-Country Growth Regressions." Oxford Bulletin of Economics and Statistics, 66, 2004, 765 98.

Hurvich, C. M., and C. L. Tsai. "Regression and Time Series Model Selection in Small Samples." Biometrika, 76, 1989, 297-307.

Jeffreys, H. The Theory of Probability. 3rd ed. Oxford: Clarendon Press, 1961.

Jorgenson, D. W., and K. J. Stiroh. "Raising the Speed Limit: U.S. Economic Growth in the Information Age." Brookings Papers on Economic Activity, 1, 2000, 125-211.

Kocherlakota, N. R., and K.-M. Yi. "Is There Endogenous Long-Run Growth? Evidence from the United States and the United Kingdom." Journal of Money. Credit and Banking, 29, 1997, 235 62.

Leamer, E. E. "Sensitivity Analyses Would Help." American Economic Review, 75, 1985, 308-13.

Lee, Y., and R. H. Gordon. "Tax Structure and Economic Growth." Journal of Public Economics, 89, 2005, 1027-43.

Levine, R., and D. Renelt. "A Sensitivity Analysis of Cross-Country Growth Regressions." American Economic Review, 82, 1992, 942 63.

Mankiw, N. G., D. Romer, and D. N. Weil. The Quarterly Journal of Economics, 107, 1992, 407-37.

McGuire, T. "Review of 'Who Benefits From State and Local Economic Development Policies." National Tax Journal, 45, 1992, 457-9.

McQuarrie, A. D. R., and C.-L. Tsai. Regression and Time Series Model Selection. Singapore: World Scientific Publishing Co. Pte. Ltd., 1998.

Mendoza, E. G., G. M. Milesi-Ferretti, and P. Asea. "On the Ineffectiveness of Tax Policy in Altering Long-Run Growth: Harberger's Superneutrality Conjecture." Journal of Public Economics, 66, 1997, 99-126.

Miller, S. M., and F. S. Russek. "Fiscal Structures and Economic Growth: International Evidence." Economic Inquiry, 35, 1997, 603 13.

Phillips, J., and E. Goss. "The Effect of State and Local Taxes on Economic Development: A Meta-Analysis." Southern Economic Journal 62, 1995, 320-33.

Poskitt, D. S., and A. R. Tremayne. "Determining a Portfolio of Time Series Models." Biometrika, 74, 1987, 125-37.

Poterba, J. M. "State Responses to Fiscal Crises: The Effects of Budgetary Institutions and Politics." Journal o]" Political Economy, 102, 1994, 799-821.

Reed, W. R. "Democrats, Republicans, and Taxes: Evidence that Political Parties Matter." Journal o[Public Economics, 9, 2006, 725-50.

--. Forthcoming. "The Robust Relationship between Taxes and U.S. State Income Growth." National Tax Journal, 61, 2008, 57-80.

Sala-i-Martin, X. X. "I Just Ran Two Million Regressions." American Economic Review, 87, 1997, 178-83.

Sala-i-Martin, X., G. Doppelhofer, and R. I. Miller. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach." American Economic Review, 94, 2004, 813-35.

Sugiura, N. "Further Analysis of the Data By Akaike's Information Criterion and the Finite Corrections." Communications in Statistic--Theory and Methods, 7, 1978, 13 26.

Wasylenko, M. "Taxation and Economic Development: The State of the Economic Literature." New England Economic Review, March/April 1997, 37 52.

Xie, D., H.-fu Zou, and H. Davoodi. "Fiscal Decentralization and Economic Growth in the United States." Journal o[ Urban Economics, 45, 1999, 228-39.

Zellner, A. "Comments on Time Series Analysis and Causal Concepts in Business Cycle Research," in New Methods in Business Cycle Research, edited by C. A. Sims. Minneapolis: Federal Reserve Bank of Minneapolis, 1977, 167-73.

(1.) Studies that have examined the robustness of coefficient estimates in the context of cross-country growth regressions include Levine and Renelt (1992); Sala-i-Martin (1997); Fernandez et al. (2001): Hendry and Krolzig (2004); Sala-i-Martin, Doppelhofer, and Miller (2004); and Hoover and Perez (2004).

(2.) The following studies have highlighted the phenomenon of wide-ranging coefficient estimates across empirical specifications: Bartik 199l, McGuire 1992, Phillips and Goss 1995, Wasylenko 1997, and Crain and Lee 1999.

(3.) The textbook Solow model is [Y.sub.t] = [K.sup.[alpha].sub.t][([L.sub.t],[Q.sub.t]).sup.1-[alpha]] = [Q.sup.1-[alpha].sub.t][K.sup.[alpha].sub.t][L.sup.1- [alpha].sub.t]. Mankiw, Romer, and Weil's augmented version of the Solow model is [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].

(4.) In the subsequent empirical work, the difference in log values is multiplied by 100.

(5.) An alternative specification solves for the steady-state value of y as a function of state parameters and then introduces convergence through the inclusion of a lagged value of the dependent variable. This both (1) imposes additional restrictions on the model and (2) raises econometric issues of inconsistency from using both fixed effects and the lagged dependent variable as explanatory variables. Nevertheless, the approach of this paper is readily applied to selecting control variables for this, and other, specifications.

(6.) For a detailed listing of studies that use these variables, see Table 1 in the expanded version of this paper available at http://ideas.repec.org/p/cbt/econwp/06-05.html.

(7.) Ideally, I would have liked to measure these latter tax variables as shares of total tax revenues. This would have been most appropriate for investigating the compositional effects of the tax burden. Unfortunately, sales and income tax data are not separately reported for local governments, so that the shares of the respective tax subcategories do not sum to 1.

(8.) The inclusion of level variables is consistent with an endogenous growth model with scale effects. Kocherlakota and Yi (1997) find support for such a model. Examples of other studies that have included variables in both differenced and level form are Mendoza, Milesi-Ferretti, and Asea (1997); Lee and Gordon (2005); and Miller and Russek (1997).

(9.) Unlike the other variables in Table 1, the Political Control variables represent the average number of years in which a political party is in control during the respective 5-yr period. There is no analog to 5-yr differences that would correspond to the 5-yr differences for the other variables in Table 1.

(10.) The general principle of the "leaps and bounds algorithm" can be illustrated in the context of a "regression tree": Consider the case of five "doubtful" variables, [X.sub.1] through [X.sub.5]. At the top of the regression tree are models with only one regressor. At the bottom of the tree are models with more variables. Suppose the [R.sup.2] from the model having only one regressor, [X.sub.1], is larger than the [R.sup.2] from a model with the four regressors [X.sub.2] through [X.sub.5]. In this case, the model with the highest [R.sup.2] must lie on the "node" below [X.sub.1]. This eliminates the necessity of estimating large portions of the regression tree, which greatly reduces the computational burden. Further details are given in Furnival and Wilson (1974). SAS uses this algorithm and sorts the best [R.sup.2] models within subsets of specifications having the same number of regressors. I calculate SIC and AICc values within these subsets--noting that highest [R.sup.2] equates with lowest SIC/AICc values when the number of regressors is held constant--and then globally rank the best specifications across all subsets.

(11.) A copy of the SAS program used in this analysis is available from the author.

(12.) For example, suppose the true model is [Y.sub.t] = [[alpha].sub.0] + [[alpha].sub.1] [Z.sub.t] + [[beta].sub.1] [X.sub.1t] + [[beta].sub.2] [X.sub.2t] + [[epsilon].sub.t]. Suppose further that [X.sub.1t], and [X.sub.2t] are both positively correlated and that [[beta].sub.1] > 0 and [[beta].sub.0] < 0. Last, suppose we now estimate the following three equations: (i) Y, = [Y.sub.t] = [[alpha].sub.0] + [[alpha].sub.1][Z.sub.t] + [[beta].sub.1] [X.sub.1t] + [[beta].sub.2] [X.sub.2t] + [[epsilon].sub.t], (ii) [Y.sub.t] = [[alpha].sub.0] + [[alpha].sub.1] [Z.sub.t] + [[beta].sub.1] [X.sub.1t] + [[epsilon].sub.t], and (iii) [Y.sub.t] = [[alpha].sub.0] + [[alpha].sub.1] [Z.sub.t] + [[beta].sub.2] [X.sub.2t] + [[epsilon].sub.t]. It is possible that [[beta].sub.1] and [[beta].sub.2] could both be significant in Equation (i) but insignificant in Equations (ii) and (iii). EBA would classify these variables as not being robust, despite the fact that both variables are in the true model. The reason for this anomaly is that the latter two equations are "bad" specifications. EBA gives equal weight to "good" and "bad" specifications.

(13.) It is a conditional probability because the probabilities are calculated over the set of "included" model specifications.

(14.) Compare Equations (8) and (9) with Equations (7) and (8) in Sala-i-Martin, Doppelhofer, and Miller (2004, p. 817) and note that in that context, they write, "... any variable excluded from a particular model has a slope coefficient with a degenerate posterior distribution at zero."

(15.) There are other problems with using the BMA approach. First, the results are sensitive to assumptions about the prior parameter distribution. For example, in order to implement their version of BMA known as Bayesian Averaging of Classical Estimates, Sala-i-Martin, Doppelhofer, and Miller (2004) must first specify an "expected model size." While they claim that their final results are robust across different assumptions about this parameter, they acknowledge that this is not true in all cases: some of the variables that are "significant" under a given assumed expected model size become "insignificant" under a different assumed expected model size and vice versa. Second, there are important computational issues. BMA does not actually estimate all possible specifications. Instead, it uses sampling procedures (e.g., Markov chain Monte Carlo procedures, of which the Gibbs sampler is the best known) to estimate the "probability" that a given specification is the true one. There is no standard sampling algorithm, which raises the possibility that the results will be idiosyncratic to the program used by the individual researcher. Finally, the weighting probabilities are derived from Bayesian statistical foundations and are closely related to the SIC criterion defined above. As we shall see below, alternative criteria, such as the AICc, produce different results.

(16.) Alaska and Hawaii were omitted, as is usual for studies of U.S. state economic growth. Nebraska and Minnesota were also eliminated because the variables Democratic Legislature and Republican Legislature could not be constructed for these two states over the full-time period: in Nebraska, state representatives do not formally affiliate with political parties, whereas Minnesota had a unicameral state legislature through 1970.

(17.) The Appendix presents statistical descriptions of all the variables used in this study.

(18.) Note that because (l) the dependent variable is expressed in logs and (2) the annual price deflator is only available for the nation as a whole, and not for individual states, inflationary effects are captured by the time period dummies. Thus, there is no need to convert the dependent variable to real values.

(19.) In the estimated specification of Equation (6), I do not impose the restriction that [[beta].sub.3] = ([[beta].sub.1] + [[beta].sub.2] - 1) for two reasons. First, population growth could also be a factor included in [C.sub.t], which, if true, would invalidate the restriction. Second, as a practical matter, this restriction is consistently rejected below the 1% significance level in all of the top model specifications.

(20.) This last adjustment is made to account for the fact that it takes at least a year for political representation to get translated into legislation (cf. Gilligan and Matsusaka 1995; Poterba 1994; Reed 2006L and it is the latter that is assumed to matter for economic growth.

(21.) The best variable specification according to the SIC is DLNK, DLNL, DLNN, State + Time Fixed Effects, Education-L, Working Population-D, Female-D, Agriculture-D, Agriculture-L, Service-L, Mining-D, Mining-L, Federal Government-D, Federal Employees-L, Federal Revenue-L, Decentralization-D, Tax Burden-D, Tax Burden-L, Sales Tax-L, Corporate Income Tax-L.

(22.) t Statistics are calculated in the classic fashion.

(23.) The best variable specification according to the AICc is DLNK, DLNL, DLNN, State + Time Fixed Effects, Education-D, Education-L, Working Population-D, Female-D, Population-L, Agriculture-D, Agriculture-L, Mining-D, Mining-L, Federal Government-D, Federal Employees-L, Federal Revenue-L, Decentralization-D, Tax Burden-D, Tax Burden-L, Sales Tax-L. Corporate Income Tax-L, Democratic Legislature.

(24.) Following Helms (1985), most studies use welfare transfers as the omitted expenditure category, so that estimated tax effects measure the impact of tax-financed welfare expenditures.

(25.) The robust variables selected from the 10-yr interval analysis, along with the sign of their mean estimated effects, are Decentralization-L(+), Democratic Governor(+), Diversity-D(-), Education-D(+), Education-L(+), Female-D(), Female-L(+), Federal Employees-L(), Federal Government-L(+), Federal Revenue-D(-), Highway Spending-D(+), Individual Income Tax-D(+), Manufacturing-L(), Nonwhite-L(+), Number of Governments-D(+), Number of Governments-L(+), Population-L(+), State & Local Employees-D(+), Tax Burden-D(-), and Working Population-D(+). I thank an anonymous referee for suggesting that this analysis also be applied to 10-yr interval data.

(26.) Higgins, Young, and Levy (2008) also search for robust determinants of U.S. income growth, but they conduct their analysis at the county level.

(27.) Their sample employs data from 48 states, versus the 46 states used in my study. Further, while there is much overlap, our sets of variables differ both in kind (e.g., I include Political Control variables, they include Pressure Groups variables) and in form (e.g., their government expenditure variable is expressed as a share of state income, my government expenditure variables are broken down by categories and expressed as shares of total government expenditures).

(28.) This discussion is based on a comparison of their table II and my Table 3.

(29.) CL measure size of government by the combined earnings of local, state, and federal government. I have separate variables for the federal government (Federal Government, Federal Employees) and for state government (State & Local Government, State & Local Employees). My federal measures are both robust with estimated negative impact, while the state and local variables are consistently negative but not always statistically significant and hence not robust.

(30.) There are other differences in our studies, but these likely stem from the fact that we use different variable sets. For example, I find that Female is a robust determinant of economic growth, whereas CL do not include this variable. CL include a measure for political Pressure Groups ("Bus Assoc Revenue Share of Income") that I do not. CL include a measure of state and local expenditures as a share of state income ("Expenditure Share of Income"). In contrast, I do not include a separate variable for the size of state and local expenditures since I want Tax Burden to pick up the net effect of tax-financed expenditures. Last, I include separate components of state tax revenues (e.g., sales taxes, corporate income taxes) and a measure of federal aid to the states (Federal Revenue), while CL do not.

W. ROBERT REED, I thank Hua Chen, Elisabeth Reed, and Ying Xiao for able research assistance. I am grateful to Chris Chatfield, David Merriman, Cindy Rogers, Steven Yamarik, and participants at the 2006 NZAE conference for providing helpful comments on an earlier draft of this paper. In addition, special thanks go to Steven Yamarik for generously supplying state-level capital stock data from Garofalo and Yamarik (2002).

Reed: Professor, Department of Economics, University of Canterbury, Private Bag 4800, Christchurch, 8140 New Zealand. Phone +64 3 364 2846, Fax +64 3 364 2635, E-mail bobreednz@yahoo.com

TABLE 1 List of Potential Determinants of U.S. State Economic
Growth (a)

Number Name Description

 1 Education Percentage of population (aged
 25 and above) who have completed
 college (Source: Census)

 2 Working Population Percentage of population between
 20 and 64 yr of age
 (Source: Census)

 3 Nonwhite Percentage of population that is
 nonwhite (Source: Census)

 4 Female Percentage of population that is
 female (Source: Census)

 5 Population Log of total population
 (Source: Census)

 6 Population Density Population density
 (Source: Census)

 7 Urban Percentage of population living
 in urban areas (Source: Census)

 8 Agriculture Share of total earnings earned
 in "Farm" and "Other Agriculture"
 industries (Source: BEA)

 9 Manufacturing Share of total earnings earned
 in "Manufacturing" industries
 (Source: BEA)

10 Service Share of total earnings earned
 in "Service" industries
 (Source: BEA)

11 Mining Share of total earnings earned
 in "Mining" industries
 (Source: BEA)

12 Union Percentage of nonagricultural
 wage and salary employees who
 are union members (Source:
 Hirsch, MacPherson, and Vroman
 2001)

13 Diversity A measure of industrial
 diversity, Diversity = [summation
 over (i)] [(Earnings in
 [industry.sub.I]/Total
 Earnings).sup.2] (Source: BEA)

14 Federal Government Share of total earnings earned
 in "Federal government"
 (Source: BEA)

15 State & Local Government Share of total earnings earned
 in "State and Local government"
 (Source: BEA)

16 Federal Employees Log of federal employees per
 capita (Source: Census)

17 State & Local Employees Log of state and local employees
 per capita (Source: Census)

18 Federal Revenue Intergovernmental revenue
 received by state and local
 governments from the federal
 government as a share of personal
 income (Source: Census)

19 Decentralization Share of total state and local
 direct general expenditures made
 by local governments (Source:
 Census)

20 Number of Governments Number of state and local
 governments (Source: Census)

21 Tax Burden Total state and local tax
 revenues as a share of personal
 income (Source: Census)

22 Property Tax Total state and local property
 tax revenues as a share of
 personal income (Source: Census)

23 Sales Tax Total state sales tax revenues
 as a share of personal income
 (Source: Census)

24 Individual Income Tax Total state individual income
 tax revenues as a share of
 personal income (Source: Census)

25 Corporate Income Tax Total state corporate income tax
 revenues as a share of personal
 income (Source: Census)

26 Local Education Spending Total state and local spending
 on local schools as a share of
 total state and local
 expenditures (Source: Census)

27 Higher Education Spending Total state and local spending
 on higher education as a share
 of total state and local
 expenditures (Source: Census)

28 Health & Hospital Spending Total state and local spending
 on health and hospitals as a
 share of total state and local
 expenditures (Source: Census)

29 Highway Spending Total state and local direct
 spending on highways as a share
 of total state and local
 expenditures (Source: Census)

30 Democratic Legislature Percentage of years that both
 houses of the state legislature
 were controlled by Democrats
 (Source: National Conference
 of State Legislatures)

31 Republican Legislature Percentage of years that both
 houses of the state legislature
 were controlled by Republicans
 (Source: National Conference of
 State Legislatures)

32 Democratic Governor Percentage of years that governor
 was a Democrat (Source: National
 Conference of State Legislatures)

BEA, U.S. Bureau of Economic Analysis.

(a) Descriptive statistics for all variables are
reported in the Appendix.

TABLE 2
EBA for Portfolio of Top SIC and AICc Models

Number
 (%) Robust Variable D/L

A. SIC models
 27 (100) R Education (1) L
 27 (100) R Female (4) D
 27 (100) R Agriculture (8) D
 27 (100) R Agriculture (8) L
 27 (100) R Mining (11) D
 27 (100) R Federal Government (14) D
 27 (100) R Federal Employees (16) L
 27 (100) R Federal Revenue (18) L
 27 (100) R Sales Tax (23) L
 27 (100) R Corporate Income L
 Tax (24)
 24 (89) -- Working Population (2) D
 24 (89) R Mining (11) L
 24 (89) R Tax Burden (21) D
 24 (89) R Tax Burden (21) L
 14 (52) R Decentralization (19) D
 11 (41) -- Population (5) L
 11 (41) -- Service (10) L
 8 (30) -- Education (1) D
 7 (26) -- State & Local D
 Government (15)
 7 (26) -- Decentralization (19) L
 6 (22) -- Democratic --
 Legislature (30)
 3 (11) -- Health & Hospital D
 Spending (28)
 2 (7) -- State & Local L
 Employees (17)
 1 (4) -- State & Local L
 Government (15)

B. AICc model
 57 (100) R Education (1) L
 57 (100) R Working Population (2) D
 57 (100) R Female (4) D
 57 (100) R Agriculture (8) D
 57 (100) R Agriculture (8) L
 57 (100) R Mining (11) D
 57 (100) -- Mining (11) L
 57 (100) R Federal Government (14) D
 57 (100) R Federal Employees (16) L
 57 (100) R Federal Revenue (18) L
 57 (100) R Tax Burden (21) D
 57 (100) R Tax Burden (21) L
 57 (100) R Sales Tax (23) L
 56 (98) -- Population (5) L
 55 (96) -- Decentralization (19) D
 52 (91) R Corporate Income L
 Tax (25)
 37 (65) -- Democratic --
 Legislature (30)
 34 (60) -- Education (1) D
 34 (60) -- Service (10) L
 26 (46) -- State & Local D
 Government (15)
 17 (30) -- Diversity (13) D
 17 (30) -- Health & Hospital D
 Spending (28)
 13 (23) -- Manufacturing (9) D
 13 (23) -- Union (12) L
 8 (14) -- Diversity (13) L
 5 (9) -- Corporate Income D
 Tax (25)
 4 (7) -- Union (12) D
 3 (5) -- Democratic Governor (32) --
 2 (4) -- Decentralization (19) L
 1 (2) -- Female (4) L
 1 (2) -- Service (10) D
 1 (2) -- Individual Income L
 Tax (24)
 1 (2) -- Higher Education D
 Spending (27)
 1 (2) -- Health & Hospital L
 Spending (28)

 Range of Coefficient
 Estimates

Number
(%) Robust Variable Low Mean High

A. SIC models
 27 (100) R Education (1) 0.7100 0.9477 1.0932
 27 (100) R Female (4) -6.9309 -6.1086 -5.4376
 27 (100) R Agriculture (8) 0.6360 0.7127 0.7693
 27 (100) R Agriculture (8) 0.2440 0.3071 0.3582
 27 (100) R Mining (11) -1.2974 -1.1395 -0.9005
 27 (100) R Federal Government (14) -1.0244 -0.8805 -0.7497
 27 (100) R Federal Employees (16) -6.1410 -5.0072 -3.6200
 27 (100) R Federal Revenue (18) 0.9085 1.1573 1.3387
 27 (100) R Sales Tax (23) 0.9990 1.1636 1.2911
 27 (100) R Corporate Income 2.2712 2.5939 3.3821
 Tax (24)
 24 (89) -- Working Population (2) 0.6515 0.9055 1.1352
 24 (89) R Mining (11) -0.7136 -0.4670 -0.3440
 24 (89) R Tax Burden (21) -0.8223 -0.6639 -0.5053
 24 (89) R Tax Burden (21) -0.9129 -0.7467 -0.6449
 14 (52) R Decentralization (19) -0.1321 -0.1112 -0.1012
 11 (41) -- Population (5) 3.2958 3.9428 4.5846
 11 (41) -- Service (10) -0.4862 -0.3454 -0.2757
 8 (30) -- Education (1) 1.1661 1.4657 1.7840
 7 (26) -- State & Local -0.9016 -0.6049 -0.4388
 Government (15)
 7 (26) -- Decentralization (19) 0.1020 0.1285 0.1422
 6 (22) -- Democratic 0.0095 0.0110 0.0128
 Legislature (30)
 3 (11) -- Health & Hospital 0.1894 0.2226 0.2588
 Spending (28)
 2 (7) -- State & Local -7.2722 -7.1917 -7.1113
 Employees (17)
 1 (4) -- State & Local -0.6068 -0.6068 -0.6068
 Government (15)
B. AICc model
 57 (100) R Education (1) 0.8354 0.9912 1.1365
 57 (100) R Working Population (2) 0.6760 0.8864 1.0386
 57 (100) R Female (4) -6.7496 -5.6811 -4.4992
 57 (100) R Agriculture (8) 0.4968 0.6754 0.7602
 57 (100) R Agriculture (8) 0.1783 0.2590 0.3071
 57 (100) R Mining (11) -1.3193 -1.1687 -1.0655
 57 (100) -- Mining (11) -0.5221 -0.4173 -0.2859
 57 (100) R Federal Government (14) -0.9425 -0.7731 -0.6711
 57 (100) R Federal Employees (16) -5.7203 -3.9538 -3.4240
 57 (100) R Federal Revenue (18) 0.9514 1.1672 1.3310
 57 (100) R Tax Burden (21) -0.7492 -0.6030 -0.4535
 57 (100) R Tax Burden (21) -0.8272 -0.7222 -0.6386
 57 (100) R Sales Tax (23) 0.9668 1.0802 1.1669
 56 (98) -- Population (5) 3.1220 4.0412 5.0292
 55 (96) -- Decentralization (19) -0.1284 -0.1093 -0.0959
 52 (91) R Corporate Income 2.0656 2.4037 2.9069
 Tax (25)
 37 (65) -- Democratic 0.0067 0.0103 0.0127
 Legislature (30)
 34 (60) -- Education (1) 0.9103 1.2891 1.4924
 34 (60) -- Service (10) -0.3761 -0.2798 -0.1997
 26 (46) -- State & Local -0.6745 -0.4750 -0.3118
 Government (15)
 17 (30) -- Diversity (13) 0.1820 0.3384 0.4565
 17 (30) -- Health & Hospital 0.1544 0.1830 0.2297
 Spending (28)
 13 (23) -- Manufacturing (9) -0.3544 -0.2642 -0.2211
 13 (23) -- Union (12) -0.1085 -0.0887 -0.0571
 8 (14) -- Diversity (13) -0.3253 -0.2776 -0.2003
 5 (9) -- Corporate Income -2.6637 -2.4655 -2.2580
 Tax (25)
 4 (7) -- Union (12) 0.0667 0.0745 0.0853
 3 (5) -- Democratic Governor (32) 0.0039 0.0046 0.0049
 2 (4) -- Decentralization (19) 0.1065 0.1105 0.1146
 1 (2) -- Female (4) 0.8031 0.8031 0.8031
 1 (2) -- Service (10) -0.2989 -0.2989 -0.2989
 1 (2) -- Individual Income 0.5450 0.5450 0.5450
 Tax (24)
 1 (2) -- Higher Education -0.1359 -0.1359 -0.1359
 Spending (27)
 1 (2) -- Health & Hospital 0.1557 0.1557 0.1557
 Spending (28)

 Range of t Ratios

Number
(%) Robust Variable Low Mean High

A. SIC models
 27 (100) R Education (1) 5.16 6.38 6.85
 27 (100) R Female (4) 4.10 4.61 5.30
 27 (100) R Agriculture (8) 6.21 7.57 8.25
 27 (100) R Agriculture (8) 3.48 4.57 5.37
 27 (100) R Mining (11) 4.07 4.72 5.38
 27 (100) R Federal Government (14) 3.39 4.00 4.74
 27 (100) R Federal Employees (16) 2.31 3.48 4.46
 27 (100) R Federal Revenue (18) 3.24 4.02 4.52
 27 (100) R Sales Tax (23) 3.58 4.11 4.59
 27 (100) R Corporate Income 2.56 2.91 3.79
 Tax (24)
 24 (89) -- Working Population (2) 1.97 2.78 3.37
 24 (89) R Mining (11) 2.10 2.79 4.18
 24 (89) R Tax Burden (21) 2.56 3.51 4.55
 24 (89) R Tax Burden (21) 2.98 3.48 4.48
 14 (52) R Decentralization (19) 2.07 2.27 2.68
 11 (41) -- Population (5) 1.90 2.27 2.64
 11 (41) -- Service (10) 1.96 2.53 3.54
 8 (30) -- Education (1) 1.84 2.31 2.84
 7 (26) -- State & Local 1.75 2.40 3.39
 Government (15)
 7 (26) -- Decentralization (19) 1.90 2.44 2.71
 6 (22) -- Democratic 1.88 2.16 2.53
 Legislature (30)
 3 (11) -- Health & Hospital 1.63 1.91 2.23
 Spending (28)
 2 (7) -- State & Local 2.53 2.54 2.56
 Employees (17)
 1 (4) -- State & Local 2.50 2.50 2.50
 Government (15)
B. AICc model
 57 (100) R Education (1) 5.74 6.42 7.06
 57 (100) R Working Population (2) 2.04 2.70 3.23
 57 (100) R Female (4) 3.26 4.21 5.13
 57 (100) R Agriculture (8) 4.02 7.01 8.21
 57 (100) R Agriculture (8) 2.40 3.80 4.58
 57 (100) R Mining (11) 4.32 4.79 5.27
 57 (100) -- Mining (11) 1.74 2.46 3.03
 57 (100) R Federal Government (14) 3.02 3.51 4.15
 57 (100) R Federal Employees (16) 2.21 2.54 4.21
 57 (100) R Federal Revenue (18) 3.37 4.08 4.61
 57 (100) R Tax Burden (21) 2.32 3.16 3.98
 57 (100) R Tax Burden (21) 2.96 3.37 3.88
 57 (100) R Sales Tax (23) 3.36 3.81 4.13
 56 (98) -- Population (5) 1.81 2.31 2.83
 55 (96) -- Decentralization (19) 1.96 2.25 2.64
 52 (91) R Corporate Income 2.31 2.72 3.26
 Tax (25)
 37 (65) -- Democratic 1.33 2.02 2.52
 Legislature (30)
 34 (60) -- Education (1) 1.43 2.02 2.33
 34 (60) -- Service (10) 1.44 2.03 2.62
 26 (46) -- State & Local 1.23 1.87 2.56
 Government (15)
 17 (30) -- Diversity (13) 1.14 1.93 2.48
 17 (30) -- Health & Hospital 1.34 1.58 1.87
 Spending (28)
 13 (23) -- Manufacturing (9) 1.72 1.96 2.36
 13 (23) -- Union (12) 1.05 1.60 1.94
 8 (14) -- Diversity (13) 1.25 1.74 2.05
 5 (9) -- Corporate Income 2.51 2.72 2.92
 Tax (25)
 4 (7) -- Union (12) 1.17 1.31 1.51
 3 (5) -- Democratic Governor (32) 1.19 1.42 1.54
 2 (4) -- Decentralization (19) 1.99 2.06 2.14
 1 (2) -- Female (4) 1.09 1.09 1.09
 1 (2) -- Service (10) 1.47 1.47 1.47
 1 (2) -- Individual Income 1.39 1.39 1.39
 Tax (24)
 1 (2) -- Higher Education 1.16 1.16 1.16
 Spending (27)
 1 (2) -- Health & Hospital 1.41 1.41 1.41
 Spending (28)

Note: The criteria for determining which variables are robust
are described in Section IV.

TABLE 3
Robust Variables and Mean Estimated Effects

 Variable
 Mean
 Estimated
Category Number Name Effect

Population/Labor 1 Education-L 0.97
 Force 2 Working Population-D 0.90
 characteristics 4 Female-D -5.89
Economy 8 Agriculture-D 0.69
 characteristics 8 Agriculture-L 0.28
 11 Mining-D -1.15
 11 Mining-L -0.44
Public Sector 14 Federal Government-D -0.83
 (Policy) 16 Federal Employees-L -4.48
 variables 18 Federal Revenue-L 1.16
 19 Decentralization-D -0.11
 21 Tax Burden-D -0.63
 21 Tax Burden-L -0.73
 23 Sales Tax-L 1.12
 24 Corporate Income Tax-L 1.39

Note: Mean estimated effect is the simple average of the "mean"
coefficient estimates in Table 2, Panels A and B.