The gravity model: an illustration of structural estimation as calibration.
Balistreri, Edward J. ; Hillberry, Russell H.
Dawkins, Srinivasan, and Whalley ("Calibration," Handbook
of Econometrics, 2001) propose that estimation is calibration. We
illustrate their point by examining a leading econometric application in
the study of international and interregional trade by Anderson and van
Wincoop ("Gravity with Gravitas: A Solution to the Border
Puzzle," American Economic Review, 2003). We replicate the
econometric process and show it to be a calibration of a general
equilibrium model. Our approach offers unique insights into structural
estimation, and we highlight the importance of traditional calibration
considerations when one uses econometric techniques to calibrate a model
for comparative policy analysis. (JEL F10, C13, C60)
I. INTRODUCTION
Contemporary economic analysis includes two broad traditions of
fitting models to data. Many estimate stochastic, theory-based, reduced
forms with few parameters, while others calibrate models by an extensive
collection, and computation, of consistent fitted values. Although the
first technique is called estimation and the second calibration, these
exercises are identical under consistent identifying assumptions. Both
calibration and estimation fit a model to data.
Dawkins, Srinivasan, and Whalley (2001) make this point succinctly:
"Calibration is estimation, estimation is calibration." The
point is widely recognized in the macroeconomic real business cycle
literature (Hoover 1995). Our purpose is to demonstrate that it is
equally relevant to micro-based general equilibrium models. In our view,
there is too little communication between calibrators and estimators of
such models and the lack of communication impedes research.
It is important to clearly delineate the processes of data fitting
and the subsequent model analyses. In analysis of fitted models, the
degree of concentration on counterfactual simulation versus hypothesis
testing around specific parameters often cleaves with our respective
notions of calibration and estimation, but counter examples are easily
found. Sensitivity analysis found in computable general equilibrium (CGE) studies can be specifically designed to generate higher order
moments to facilitate hypothesis testing. (1) Many econometric studies
are specifically focused on model identification for the purpose of
counterfactual simulation. (2)
A more informative distinction might be drawn between testing
models and calibrating models. Hoover (1995) places this issue at the
heart of the macroeconomic debate surrounding real business cycle
models. We see a direct extension of the macroeconomic debate into any
empirical methodology that involves general equilibrium systems. In the
testing paradigm, stochastic measures of fit provide a critically
important benchmark for evaluating alternative structural assumptions or
analytical results derived from a particular set of assumptions. When
the objective is to provide a quantitative context for counterfactual
analysis, traditional measures of fit are little more than indicators of
parsimony. One example of the macroeconomic debate spilling over to the
empirical literature is forwarded by Learner and Levinsohn (1995) who
advise us to "Estimate, don't test." The point being that
falsifiable theories are informative--conceptually and empirically.
In the context of estimating, as opposed to testing, some
researchers find it preferable to exhaust the degrees of freedom
available in the data by expanding the set of structural parameters.
This results in a calibrated model that perfectly fits the benchmark
observation but requires external information on how it will respond to
shocks. Model responses are usually summarized in a set of elasticity
assumptions. Econometric estimation does not exhaust the degrees of
freedom and may focus on internal measurements of response parameters.
There also seems to be convergence in the empirical international
trade literature; some calibrators of traditional general equilibrium
simulation models attempt to identify response parameters (e.g., Liu,
Arndt, and Hertel 2004 or Francois 2001), and some gravity model estimations include numerous dummy variables to isolate cross-sectional
fixed effects (e.g., Hummels 2001 or Feenstra 2004 [Chapter 5]).
Hillberry et al. (2005) contend that idiosyncratic calibration
parameters in traditional CGE applications operate in a way that is
similar to econometric residuals; calibration parameters allow the model
to fit the data exactly. Another way to think about it is that
idiosyncratic calibration parameters in CGE applications are analogous
to fixed effects in an econometric model that includes a fixed effect
for each observation.
The model proposed and applied by Anderson and van Wincoop (2003)
(henceforth AvW) is exceptionally well suited for communicating with
both estimators and calibrators. In their two-country application, AvW
fit a structural gravity model to observed trade patterns among Canadian
provinces and U.S. states. It is our contention that estimating and
calibrating the AvW model are equivalent methods for fitting the model
to data. The model is, at its core, a general equilibrium that can be
fitted with standard calibration techniques. One of these techniques is
the econometric procedure proposed by AvW. Our illustration that the
econometric procedure is a calibration entails a structural
interpretation of the parametric estimates and an itemization of those
remaining structural parameters that are implied by the estimable model's identifying assumptions.
We show that the AvW calibration is consistent with a broad class
of first-order approximations of the general equilibrium system that are
equally efficient statistically (as measured by the sum of squared
residuals on observed trade flows). Within this class of models, there
is ambiguity regarding the size of cross-border trade frictions and thus
ambiguity regarding the welfare effects of trade frictions. Our approach
offers unique insights into structural estimation, and we highlight the
importance of traditional calibration considerations when one uses
econometric techniques to calibrate a model for comparative policy
analysis.
We propose alternative estimations of the core general equilibrium:
one that eliminates a bias in the structurally consistent fitted trade
flows and one that yields an R-squared of one. Like the AvW estimation,
these procedures are incomplete--they cannot identify the trade costs or
price response parameters without ad hoc identifying assumptions (or
additional data). We conclude that the theoretic gravity model in
question is useful as a descriptive tool, but its ability to match trade
flows closely in calibration is not novel. Richer theories of trade
(dependent on traditional comparative advantage or scale and variety
effects) are routinely fit--exactly--to observed trade flows. (3)
It is not our intent to critique the theory behind the gravity
model. We also do not advocate any particular approach to calibration.
Our contribution is a bridge between the methods and the language used
by modelers who calibrate systems regularly and those who use
econometric methods.
In the following section, we outline the general theoretic
framework for the illustrative example and also outline a general
calibration strategy in terms of identifying the necessary parameters.
In Section III, we recast the AvW econometric method as a calibration,
showing that it generates a complete set of structural parameters under
particular identifying assumptions. A comparison of AvW's
restrictive case and the general set of equally efficient, or superior,
approximations are examined in Section IV. Concluding remarks are
offered in Section V.
II. FOUNDATION FOR THE ILLUSTRATIVE EXAMPLE
A. The General Equilibrium Model
Following AvW, consider a model in which representative consumers
in each region have preferences over goods differentiated by region of
origin. Goods are aggregated to a single region-specific variety. Each
region is given an endowment of its variety, which it trades for foreign
varieties. Ad valorem trade frictions induce substitution into varieties
that are from home and nearby regions.
We formulate the multiregion competitive exchange equilibrium as a
mixed complementarity problem (MCP). The MCP is essentially the
Mathiesen (1985) formulation of an Arrow-Debreu general equilibrium
(also see Rutherford 1995b for an introduction to MCPs). The key
advantage of using the Mathiesen formulation here is that it represents
the entire multiregion general equilibrium compactly, as a set of 4n
conditions in 4n unknowns (where n is the number of regions). (4)
Income for a region, [Y.sub.i], is given by the product of its
exogenous endowment, [q.sub.i], and the net (of trade cost or f.o.b.)
price of output, [P.sub.i]:
(1) [Y.sub.i] = [P.sub.i][q.sub.i].
The second set of conditions require market clearing for each
region-specific product such that the quantity endowed equals the sum of
demands by each region j:
(2) [q.sub.i] = [summation over j][X.sub.ij](P, [Y.sub.j],
[t.sub.j], [y.sub.j]).
Demand for commodity i by region j ([X.sub.ij]) is a function of P,
the vector of f.o.b, prices across the regions; [Y.sub.j], the income
level in region j; [t.sub.j], the vector of trade cost wedges faced by
region j; and [[gamma].sub.j], a vector of structural parameters that
identify the preferences of region j.
Consistent with expenditure minimization in each region, the third
set of equilibrium conditions set the true-cost-of-living index,
[E.sub.i], equal to the unit expenditure function:
(3) [E.sub.i] = [e.sub.i](P, [t.sub.i], [[gamma].sub.i]),
where the utility function is assumed to be linearly homogeneous.
The final set of conditions require balance between the nominal value of
utility and income:
(4) [U.sub.i][E.sub.i] = [Y.sub.i].
Together, Conditions (1) through (4) are a complete multiregion
general equilibrium that can be solved numerically for relative prices
([P.sub.i] and [E.sub.i]), regional welfare levels ([U.sub.i]), and
income levels ([Y.sub.i]). (5) As an artifact of the equilibrium, we can
recover the individual bilateral trade flows using the demand functions
evaluated at the equilibrium:
(5) [X.sup.*.sub.ij] = ([P.sup.*], [Y.sup.*.sub.j], [t.sub.j],
[[gamma].sub.j]).
A convenient property of the equilibrium defined by Equations (1)
through (4) is that it is fully identified by endowments, trade
frictions, and an explicit representation of the unit expenditure
functions for each region. (6) Rarely are equilibrium systems dependent
on such a modest set of unknown parameters.
B. Identifying the Functions
As a starting point for identifying the expenditure functions,
first consider a set of consistent utility functions for each region:
(6) [U.sub.j] = [U.sub.j]([X.sub.j]/[t.sub.j], [[gamma].sub.j]).
Utility in a region is a function of the vector of f.o.b, imports
to that region scaled by the trade cost factors ([X.sub.j]/[t.sub.j]);
this operationalizes the formulation of iceberg transportation costs.
(7) The other argument in the utility function is the vector of
structural parameters, [[gamma].sub.j].
AvW (2003) assume that the utility function takes on a constant
elasticity of substitution (CES) functional form. The CES form is
popular because of its curvature properties and empirical tractability.
Typically, a CES representation of Equation (6) would include a
distribution parameter ([[alpha].sub.ij], which indicates the preference
of consumer j for goods from region i) and a substitution elasticity
([[sigma].sub.j], which indicates how responsive consumer j is to
relative price changes):
(7) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
Notice that we designate an element that will be informed by the
data with a hat (e.g., [??]); these are estimates.
Rutherford (1995a) introduced a more convenient form of the CES
function, the calibrated share form, which describes the function in
terms of benchmark (fitted) values:
(8) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],
where the benchmark value shares are given by
(9) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
The calibrated share form is a simple algebraic transformation that
decomposes [[??].sub.ij]. It is important to notice our use of notation
in the terms that enter Equations (8) and (9). The superscript 0
indicates the level a variable takes at the benchmark equilibrium. For
example, the [[??].sup.0.sub.ij] are the estimated reference export
quantities at the benchmark equilibrium. In our empirical equations, (8)
and (9), and those that follow, we are careful to distinguish the
estimated parameters ([[??].sup.0.sub.i], [[??].sup.0.sub.ij], and
[[??].sup.0.sub.ij]), the model variables ([P.sub.i] and [X.sub.ij]),
and the exogenous instruments ([t.sub.ij]).
The calibrated share form is useful in that it allows us to
separately classify the parameters in terms of their role in the order
of approximation. Following tradition, we will refer to the
[[??].sub.ij] as first-order parameters and the [[??].sub.j] as
second-order parameters. (8) Adopting the separable CES form is
consistent with a restricted second-order approximation to the true
function because all the second-order curvature parameters (the
cross-elasticities of substitution) are equal and invariant. (9) An
exercise, such as AvW's estimation, which identifies the
[[??].sub.ij] but not the [[??].sub.j] gives us a first-order
approximation to the functions. (10)
The unit expenditure functions, under the CES approximation, are
given by
(10) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
The corresponding demand functions are
(11) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
In computational applications, the calibrated share forms of the
expenditure and demand functions are preferable because they provide a
simple parameter and functional check that is independent of
second-order curvature.
With the expenditure function specified in Equation (10), and
provided estimates of each parameter, the general equilibrium is
operational as a comparative static or simulation tool. We simply
substitute Equations (10) and (11) into Equations (3) and (2),
respectively. Table 1 lists all the symbolic elements of the general
equilibrium. Again, the equilibrium consists of 4n variables. There are
n + [n.sup.2] exogenous instruments that characterize endowments and
trade frictions. Parameter requirements include 3n + [2n.sup.2]
primitives that represent the fitted benchmark and an additional n
substitution elasticities. (11) Many of these parameters are routinely
eliminated or identified by assumption. In the following section, we
show how adopting AvW's simplifying assumptions and least squares
procedure identifies the elements of [??].
III. ECONOMETRIC CALIBRATION AND GENERAL EQUILIBRIUM SIMULATION
This section illustrates how the estimation made by AvW (2003) can
be interpreted as a calibration of the general equilibrium. First, we
outline the assumptions and least squares procedure necessary to arrive
at a full set of benchmark calibration parameters from the data. The
data include trade flows among U.S. states and Canadian provinces, state
and province incomes, and bilateral distances. (12) Second, once the
parameters are established, we bring them to the original model to
complete the calibration procedure. We purposefully return to the
extensive form represented in Conditions (1)-(4). The final step is to
perform the counterfactual of interest: the integration of the U.S. and
Canadian economies by removing the effect of the border.
A. Parametric Identification
Conditions (1)-(4) are useful as an empirical tool only after the
elements of [??] are estimated using data. What follows is an
illustration of how one might arrive at a set of parameters consistent
with the set derived by AvW, but from the perspective of a calibration
exercise. The data are limited, so some of the elements must be
identified directly through parametric or structural assumptions.
As a first step in reducing data requirements, we identify some
parameters by simply making the benchmark price normalization explicit.
Estimates of income at the benchmark, [[??].sup.0.sub.i], and exchange
rates are widely published so it seems logical to adopt the convention
that common nominal units equal real units at the benchmark. This allows
us to set [q.sub.i] = [[??].sup.0.sub.i] for each region. Income and,
therefore, endowment quantities are in units of U.S. dollars (at the
benchmark). (13) Given our choice of units and by the first equilibrium
condition [[??].sup.0.sub.i] = 1 for all i, which is a convenient
normalization. Any other convention for measuring real units can be
adopted without loss of generality. We remove the hat on income because
these are assumed to be primary data rather than estimates (i.e.,
[[??].sup.0.sub.i] = [Y.sup.0.sub.i]). (14)
Another convenient simplification is to assume symmetric trade
costs, that is, [[??].sup.0.sub.ij] = [[??].sup.0.sub.ji]. This
assumption decreases the number of parameters that need to be
identified. More importantly, for AvW, it enables them to arrive at a
relatively simple benchmark reduced form. (15) Additionally, the trade
cost is assumed to follow a log-linear form such that,
(12) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
The estimated border cost factor, represented by
[[??].sup.0.sub.ij], equals one plus the tariff equivalent. The observed
bilateral distance is given by [d.ij], and [??] is the elasticity of the
total cost factor with respect to distance. Although convenient for
AvW's theory, the form of Equation (12) is another ad hoc
approximation. (16) Notice that Equation (12) indicates that absolute
trade costs will be affected by the units in which distance is measured,
which is potentially problematic. The indeterminacy is resolved by
choosing an ad hoc normalization.
A key identifying, and restrictive, assumption of the econometric
calibration is one of identical tastes across regions. (17) This
significantly reduces the number of parameters needed. AvW used this
assumption to derive a reduced form that is free of share parameters.
Essentially, under identical tastes, one can use the information that in
a frictionless world, each region j would consume its income share of
each regional commodity i. The estimates of the distribution parameters
in the CES functions are, thus, replaced by information on local
(benchmark) income shares.
With the simplifications established (and assuming a common
elasticity across all regions, [[??].sub.i] = [sigma], AvW derived the
reduced form for the local unit expenditure functions,
(13) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII];
and the local nominal demand functions
(14) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
Note that given our benchmark price normalization, the nominal flow
([[??].sup.0.sub.i][[??].sup.0.sub.ij]) equals the real flow
([[??].sup.0.sub.ij]) at the benchmark. Thus, we have simplified
Equation (14) by removing [[??].sup.0.sub.i] from the left-hand side.
With some slight rearranging and in log form, Equation (14) becomes the
structural gravity equation presented by AvW:
(15) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],
where k is a regression constant that should approximate -ln
[[summation].sub.m] [Y.sup.0.sub.m]. The dummy variable [a.sub.ij] is
one if the shipment from i to j does not involve a border. The estimate
[[??.sup.0] - 1 is interpreted as the ad valorem tariff equivalent of
the border cost (where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN
ASCII]).
Direct measures of some of the trade flows ([X.sup.0.sub.st]) are
observed. The regional pair (s, t) is a member of the set of potential
bilateral pairs, {(s, t)|s [member of] I, t [member of] j}. (18) The
[X.sup.0.sub.st] are assumed to be primary data but rampant with
well-behaved measurement error. To reconcile Equation (15) with these
observations, AvW define an objective function that is the sum of
squared errors between the direct observations and the model prediction.
Calibration involves the minimization of this objective subject to the
reduced form of the unit expenditure functions, Equation (13). The
nonlinear programming problem is
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
subject to:
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
The coefficient [[beta].sub.1] equals (1 - [sigma])[??] and the
coefficient [[beta].sub.2] equals (1 - [sigma])ln [[??].sup.0].
Estimation requires observations on distance for each (i, j) pair and
the construction of the border dummy for each pair. Given the set of
assumptions necessary to arrive at the reduced form and the solution
values for 131, 132, and the [[??].sup.0(1 - [sigma]).sub.j] from the
programming problem, we can recover all the elements of [??].
Comparative static experiments can then be simulated via the equilibrium
conditions.
A key insight our calibration perspective offers is the absence of
variables (the [Y.sub.i], [E.sub.i], and [X.sub.ij]) in the reduced form
model, characterized in Equations (13) and (14). Rather, only the
estimated levels of these variables at the benchmark equilibrium appear
(the [Y.sup.0.sub.i], [E.sup.0.sub.i], and [[??].sup.0.sub.ij]). Once
the nonlinear program (NLP) is solved, the first moments of these
estimates are exogenous, and they enter the system of equations as
parameters. The estimation (calibration) procedure brings the data to
the economic model and is, therefore, devoid of endogenous elements. The
reduced system, Equations (13) and (14), does not constitute an
operational general equilibrium model, despite being locally consistent
with the benchmark general equilibrium.
AvW (2003) conducted comparative static calculations using the
reduced system, Equations (13) and (14). Despite sharing common data and
parameters, these calculations are not consistent with our general
equilibrium results--for two reasons. First, AvW treat regional incomes
as exogenous in the trade equation. (19) While endowment quantities are
appropriately held fixed in a counterfactual exercise, the purchasing
power of the endowment will change as border costs are removed. Border
removal induces significant terms of trade effects. AvW miss these
effects when they control country size in the counterfactual using
measured incomes (AvW 2003, 183). Our extensive form method (which
distinguishes the parameter, benchmark income, from the endogenous
variable, income) accommodates the direct impact income changes have on
trade flows.
Second, the system of Equations (13) and (14) is only locally
consistent with the proposed general equilibrium (Balistreri 2006).
AvW's normalization (AvW 2003, Footnote 12) determines the
geographic pattern of preferences conditional on the observed
[Y.sup.0.sub.i], the estimated [[??].sup.0.sub.i], and an arbitrary
choice of endowment units. Adopting Equation (14) in a counterfactual
implicitly changes either preferences or the unit of measure. Balistreri
(2006) shows that if preferences are held fixed (and incomes calculated
correctly), AvW's measure of trade flows will deviate from the
theory consistent measure by an endogenous scalar (which might be
interpreted as a shift in the numeraire). (20) In contrast to AvW's
method, our system requires the choice of a numeraire, and we hold this
numeraire fixed across the counterfactual experiment.
B. Completing the Calibration
Although all the information has been processed and each
identifying assumption itemized, calibration is only complete once the
data have been brought back to the extensive form theory. Our goal is to
identify each of the parameters listed in Table 1. Following AvW's
adoption of [sigma] = 5, the benchmark price indexes are recovered from
the direct measures of [[??].sup.0(1-[sigma]).sub.j] as they are
estimated in the least squares problem. (21) The [[??].sup.0.sub.ij] are
re. covered using Equation (12) and the estimated distance and border
coefficients. Following Balistreri and Hillberry (2006), distance is
normalized to minimize the resources devoted to melt, subject to no
negative trade costs (the minimum estimated [[??].sup.0.sub.ij]. is
normalized to one). The scale on distance has no relative implications,
but it does change the level of the estimated benchmark price indexes
and measures of resources devoted to trade costs. (22)
The elements on the right-hand side of Equation (14) are now
determined, so we can use Equation (14) to recover the benchmark flows,
[[??].sup.0.sub.ij]. Notice that we cannot use Equation (15) to recover
the [[??].sup.0.sub.ij] if k is inconsistent with observed incomes. If k
is not consistent with the theory, then the sum of exports will not add
up to the total endowment. (23) Finally, we recover the [[??].sub.ij]
from Equation (9) and the scale parameter [[phi].sub.j] =
[[??].sub.j]/[[??].sup.0.sub.j]. All the items in the parameter vector,
[??], presented in Table 1 are thus identified. The system is calibrated
and available for counterfactual analysis.
C. Counterfactual Results
Counterfactual simulations are computed by evaluating the system
with different endowments or trade costs. (24) Specifically, we compute
the general equilibrium with border frictions eliminated, such that
[t.sub.ij] = [d.sup.[??].sub.ij]. This simulates integration of the
Canadian and U.S. economies. Table 2 presents the numeric benchmark
equilibrium and percent changes in each of the variables using
AvW's central elasticity ([sigma] = 5), AvW's coefficient
estimates ([[beta].sub.1] = -0.79 and [[beta].sub.2] = -1.65), and
choosing Alabama output as the numeraire. Utility is numeraire
independent, but changes in income and the prices depend explicitly on
our arbitrary choice of numeraire (the model is homogeneous degree zero
in prices).
To analyze the effect of border removal on international versus
intranational trade, we recover the disposition of each region's
output ([q.sub.i]) using the demand equations. The endowed quantity for
each region is either consumed in the United States, consumed in Canada,
or it melts in transit. Table 3 presents these results.
IV. IMPLICATIONS FOR THE CALIBRATED MODEL AND ALTERNATIVE
CALIBRATIONS
In the context of the econometric application, we highlight three
lessons illuminated by our calibration perspective. First, structural
integrity must be maintained throughout the calibration. We show that a
lack of discipline concerning the regression intercept generates a bias
in the structural fitted flows and taints the structural interpretation
of other regression coefficients. Second, we take a partial step toward
traditional calibration techniques by operationalizing AvW's
proposed inclusion of home bias in tastes. This consumes at least one
degree of freedom (outside the regression analysis) and generates an
unidentified first-order approximation. Information on trade costs must
be added for an operational system. Our third point is that if some
estimates of trade costs are required (because the regression does not
identify these), the econometric method uses the available information
inefficiently. A more efficient calibration is achieved using the
traditional calibration method to achieve an exact fit on observed trade
flows.
A. Calibration Bias and Stochastic Efficiency
Equations (14) and (15) highlight a key difference between the
economic model and the econometric procedure proposed by AvW. The
economic theory motivating the regression equation imposes a restriction
on the regression constant: it should equal -ln [[summation].sub.i]
[Y.sup.0.sub.i]. Failure to impose this restriction biases econometric
estimates relative to the theory, which requires equality between the
value of aggregate demand and aggregate income. The bias manifests
itself in the different implied benchmark trade flows depending on
whether we compute them using Equation (14) and Equation (15). The
theory cannot accommodate a free intercept, k, without violating a key
adding-up requirement. (25)
Figure 1 illustrates the tradeoffs between calibration bias and
stochastic efficiency. We generate the log difference between the fitted
and the actual trade flows for three sets of fitted flows. (26) Columns
1 through 3 show the error distributions for the econometric procedure,
a calibration based on estimates generated by the econometric exercise,
and a theory consistent estimation/calibration, respectively. (27)
The figure demonstrates two points. First, structurally consistent
estimation can reduce econometric efficiency when theory imposes
restrictions on the parameter estimates. The distribution of errors
under structurally consistent estimation (Column 3) is larger than in
econometric estimation (Column 1). It is notable, however, that the
econometric efficiency loss that occurs when the regression constant is
restricted to its theory consistent value is small.
[FIGURE 1 OMITTED]
Second, parameter estimates taken from estimation procedures
inconsistent with the theoretic structure of an economic model generate
bias in subsequent calibrations that use the parameter estimates as
inputs. The distribution of error terms in the implied calibration
(Column 2) is identical to that of the econometric estimation, it is
simply shifted higher. By allowing the estimation model to overstate
aggregate income, the econometric procedure understates the trade costs
necessary to fit trade flows within the model. A calibration based on
trade costs from the estimating procedure substantially overstates
interregional trade.
In contrast, the error terms from theory consistent estimation (a
constrained regression constant) are replicated in calibration. (28) The
error distribution in Column 3 is identical, whether one uses a
structurally consistent estimation procedure or a calibration using
structural parameters generated by structurally consistent estimation.
It is for this reason that we advocate estimation procedures that are
fully consistent when the goal is parameterizing a model for subsequent
counterfactual analysis. We also emphasize that structural estimation is
not a new procedure, it is simply calibration renamed.
B. Systematic Taste Bias
AvW cautioned that we should not accept their estimates of
[[??].sup.0.sub.i] in their literal interpretation as consumer price
indexes (AvW 2003, p. 176 and Footnote 17) because measured trade
resistance, attributed to borders and distance, need not indicate a
price markup. (29) AvW proposed home bias in preferences as a source of
trade resistance. We adopt AvW's suggestion of structural taste
bias, which is consistent with their regression analysis. In this
context, the first-order calibration is underidentified. The ambiguity
in the system is only resolved through a direct measure of the trade
frictions that identifies these frictions independent of geographic
regularities in the CES distribution parameters.
To illustrate the importance of our assumption about taste bias,
consider a new benchmark parameter, [??] [member of] [0, 1], that
represents the proportion of measured border resistance that is not due
to taste bias. (30) The only modifications are to our definition of the
reference distortions. Equation (12) becomes
(16) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
At the benchmark, the instruments must be set to their estimated
local values ([t.sub.ij] = [[??].sup.0.sub.ij]), and as before, the
counterfactual involves a computation of the equilibrium under
[t.sub.ij] = [[d.sup.[??].sub.ij]
Using Equation (16) to compute tlae benchmark distortions fully
operationalizes the structural taste bias suggested by AvW (2003). The
advantages of the calibrated share form should be noted: no
modifications are required in our functional representation of the
equilibrium (the benchmark value shares are unchanged and there is no
need to change the scaling). The benchmark distortions and value of the
instruments change, but the calibrated share form allows us to directly
input these changes to the first-order information without the tedious
recalibration of the scale-dependent distribution parameters in the
standard form represented in Equation (8).
It is apparent from Equation (16) that [??] (or equivalently, a
direct measure of the border cost) will determine the magnitude of the
price changes embodied in the counterfactual, and [sigma] determines the
responses to those price changes. As a method of second-order
approximation, the AvW econometric method is silent on the two most
important pieces of information for welfare analysis ([??] and [sigma]).
Welfare analysis depends critically on these parametric assumptions,
which cannot be informed by the proposed regression. (31)
We illustrate the importance of good measures of [??] in Figure 2.
If one assumes no taste bias, the welfare effects of border removal are
maximized conditional on a given [sigma]. At the values preferred by AvW
(2002), of [sigma] = 5 and [??] = 1 (no taste bias), the tariff
equivalent of the border charge is 51%. On the other extreme, if we
simply measure duties collected divided by trade flows, the ad valorem
rate in 1993 was less than 0.4%. (32) Using this rate as the border
charge (and [sigma] = 5), the proportion of border resistance attributed
to taste bias is 99% (marked in the figure as Measured Tariffs). Using
measured tariffs, the welfare impact on Canada of removing the border is
only 0.1%. Including nontariff barriers in the calculated border charge
will likely place [??] in the middle of its range. Our point is that
freeing [??] implies that trade flow data cannot be used in the gravity
framework to infer trade costs.
Thus, we contend that the econometric calibration outlined in
Section III does not complete even the first-order approximation to the
general equilibrium because it fails to quantify key first-order
information on the benchmark trade costs (or equivalently, it fails to
quantify the taste bias). The reduced form estimation is consistent with
the broad class of models covered on the taste-bias interval, [??]
[member of] [0, 1]. All these calibrations are equally as efficient as
the AvW procedure, when measured by the sum of squared residuals.
C. Improved Stochastic Efficiency in a Generalized Calibration
Once one admits that taste bias must be considered--in order to
allow the model to better approximate a world with production
specialization and nontraded goods, for example--there is no particular
need to add the restrictions imposed by the stochastic form of the
regression. The regression no longer uniquely identifies the
[[??].sup.0.sub.ij]. If the taste parameters are free to be asymmetric,
then we can fit the trade flows exactly. That is, we use the observed
trade flows rather than the fitted trade flows to compute the structural
parameters. Effectively, this reallocates the assumed cross-sectional
error terms in the econometric model to an idiosyncratic taste bias.
From an econometric perspective, this is akin to adding pairwise fixed
effects. With the restriction of identical tastes (or structural taste
bias) lifted, there is at least one taste parameter for each observed
trade flow. With no degrees of freedom, it is trivial to calibrate the
system to a consistent equilibrium that produces an R-squared of one for
the observed trade flows.
[FIGURE 2 OMITTED]
To fit the data more efficiently, we simply compute the value
shares based on the observed trade flows:
(17) [[??].sub.st] = [X.sup.0.sub.st]/[Y.sup.0.sub.t].
We make the nontrivial assumption that the trade flows contain no
measurement error (as we did with income). This directly accommodates
the observed pattern of trade. The cost of this direct approach is that
summary measures of how trade reacts to distance and borders, on
average, are not directly reported. The benefit of this approach is that
the benchmark replicates the (first order) observations.
The first-order calibration is not complete, however, without trade
flows for the unobserved pairs and a measure of the trade costs. For
simplicity and comparability with the econometric calibrations, we
assume that trade costs are those implied by the regression analysis
(with [??] = 1). Thus, we use the regression coefficients as a source of
descriptive information on average costs. (33) Obviously, it would be
more appropriate to find information on each of the bilateral trade
costs, however, using the regression coefficients to compute the
[[??].sup.0.sub.ij] offers a consistent point of departure to compare
the more efficient calibration with the results presented above.
The unobserved trade flows are problematic because there might be
any number of combinations that are consistent with an equilibrium that
includes the observed flows (satisfying the R-squared equals one
criteria). To solve this problem, we again appeal to the descriptive
properties of the original regression. Using the regression coefficients
as given, we minimize the sum of squared deviations between the fitted
value of the unobserved flows and the right-hand side of Equation (14),
choosing values for the unobserved pairs and subject to the adding-up
constraints. The programming problem is given by
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
subject to:
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. (NLP2)
Any number of modifications might be added to this programming
problem to accommodate additional information. For example, some of the
unobserved pairs might be restricted to zero flow if we believe that
there is actually no trade (this is no longer inconsistent with the
theory because some of the idiosyncratic share parameters might be
zero). Once the fitted values from the programming problem are used to
compute the remaining value shares, the calibration is complete.
Using the implied trade costs from the original model (a 51% tariff
equivalent), Table 4 shows a comparison between the welfare impacts of
border removal from the econometric calibration and the welfare impacts
of border removal from the direct calibration. The econometric
calibration, based on AvW's proposed econometric model, has
significant implications on the size of the welfare impact on Canada and
masks many of the interprovincial distributional impacts of border
removal.
As one might expect, the dispersion in welfare effects is greatly
reduced in the econometric calibration because the model is calibrated
to the fitted (or average) flows. When we calibrate to the observed
trade flows, a rich story about the pattern of trade emerges. For
example, the simulated impact on Quebec's welfare of removing the
U.S.-Canada border drops from 29% to 19% when we use the actual data.
The differences are directly attributable to errors in the fitted
values implied by the econometric calibration. In the fitted values, the
proportion of Quebec's Gross Domestic Product (GDP) that is
exported across the border is 25.9%, but in the actual observations, the
proportion of Quebec's GDP that is exported across the border is
15.7%. Ontario in contrast has fitted border flows that are almost
identical to the observed flows (30.3% vs. 30.4%). Therefore, the
alternative approaches to calibration produce comparable welfare impacts
for Ontario but not for Quebec. The general pattern in Table 4 is an
overstatement of welfare impacts under the original calibration, and
this is attributed to the biased fitted flows generated by the
atheoretic intercept (illustrated in Figure 1).
A further point to take away from Table 4 is that the welfare
impacts of economic integration are sizeable in the last column, despite
our use of traditional calibration techniques. AvW (2002) contended that
their approach is more plausible than traditional calibration methods
because they estimate nontariff barriers to be many times larger than
formal trade barriers (of course, this requires the identical-taste
assumption). They go on to critique traditional calibrated models as
understating the effects of the North American Free Trade Agreement. We
find the explanation of the differences trivial: the critiqued
computational studies focused on experiments that removed much smaller
barriers. The traditional approach is to directly measure trade barriers
(tariff and nontariff, if available) and attribute any unexplained
border resistance to idiosyncratic taste parameters. In contrast, AvW
assume all trade resistance at the border to be a border charge. Table 4
shows that given comparable border charges (and comparable response
parameters), the traditional calibration technique yields comparable
aggregate results.
V. CONCLUSIONS
Contemporary economic analysis includes two broad traditions of
fitting economic models to data. While the analytical objectives of
calibration and estimation have traditionally differed, recent
applications highlight the need to consider them in a unified framework.
Under consistent identifying assumptions, both approaches generate the
same structural parameters necessary to relate exogenous changes to
endogenous outcomes.
For many structural econometric studies, the core assumptions of
the economic model are without question. The goal is to identify an
estimable reduced form of the equilibrium system. We emphasize the
importance of returning to the model's extensive form if reduced
form estimates are to be given their structural interpretation.
Operationalizing an economic model requires identification of a full set
of exogenous structural parameters.
In contrast to the unified framework proposed by Dawkins,
Srinivasan, and Whalley (2001), AvW (2002) argue that estimated models
are superior to calibrated simulation models. Our illustration that
estimation is calibration moves the focus of analysis onto the key
identifying structural assumptions that make the fitting procedure
possible and beyond the particular label placed on the fitting
procedure. The econometric procedure proposed by AvW generates results
that seem to contradict similar calibrated models, but we show that this
is due to specific structural and parametric restrictions not found in
traditional simulation models. Our contribution, however, is broader
than our specific example. Viewing empirical investigations from the
unified perspective, that structural estimation is calibration, will aid
in applying structural assumptions consistently and in directing
assumptions toward more efficient use of limited data.
ABBREVIATIONS
AvW: Anderson and van Wincoop
CES: Constant Elasticity of Substitution
CGE: Computable General Equilibrium
GDP: Gross Domestic Product
MCP: Mixed Complementarity Problem
NLP: Nonlinear Program
REFERENCES
Anderson, J. E., and E.van Wincoop. "Borders, Trade, and
Welfare," in Brookings Trade Forum: 2001, edited by S. M. Collins
and D. Rodrik. Washington, DC: Brookings Institution Press, 2002,
207-30.
--. "Gravity with Gravitas: A Solution to the Border
Puzzle." American Economic Review, 93, 2003, 170-91.
Balistreri, E. J. "A Note on the Local and Global Consistency
of Anderson and van Wincoop's Gravity Model." Working Paper,
Colorado School of Mines, 2006.
Balistreri, E. J., and R. H. Hillberry. "Trade Frictions and
Welfare in the Gravity Model: How Much of the Iceberg Melts?"
Canadian Journal of Economics, 39, 2006, 247-65.
Brown, D. K., and R. M. Stern. "Computational Analysis of the
U.S.-Canada Free Trade Agreement: the Role of Product Differentiation and Market Structure," in Trade Policies for International
Competitiveness, edited by R. C. Feenstra. Chicago, IL: University of
Chicago Press, 1989, 217-50.
Cheng, I.-H, and H. J. Wall. "Controlling for Heterogeneity in
Gravity Models of Trade and Integration." Federal Reserve Bank of
St. Louis Review, 87, 2005, 49-63.
Dawkins, C., T. N. Srinivasan, and J. Whalley.
"Calibration," in Handbook of Econometrics, Vol. 5, Chapter
58, edited by J. J. Heckman and E. E. Learner. Amsterdam: Elsevier,
2001, 3655-705.
Deaton, A., and J. Muellbauer. Economics and Consumer Behavior.
Cambridge: Cambridge University Press, 1980.
Feenstra, R. C. Advanced International Trade: Theory and Evidence.
Princeton, NJ: Princeton University Press, 2004.
Francois, J. "Flexible Estimation and Inference Within General
Equilibrium Systems." Conference Proceedings: The Fourth Annual
Conference on Global Economic Analysis, GTAP, Vol. 1, 2001.
Harrison, G. W., R. C. Jones, L. J. Kimbell, and R. M. Wigle.
"How Robust is Applied General Equilibrium Analysis?" Journal
of Policy Modeling, 15, 1992, 99-115.
Harrison, G. W., and H. D. Vinod. "The Sensitivity Analysis of
Applied General Equilibrium Models: Completely Randomized Factorial Sample Designs." Review of Economics and Statistics, 74, 1992,
357-62.
Hertel, T., D. Hummels, M. Ivanic, and R. Keeney. "How
Confident Can We Be in CGE-Based Assessments of Free Trade
Agreements?" National Bureau of Economic Research Working Paper No.
10477, 2004.
Hillberry, R. H., M. A. Anderson, E. J. Balistreri, and A. K. Fox.
"Taste Parameters as Model Residuals: Assessing the 'Fit'
of an Armington Trade Model." Review of International Economics,
13, 2005, 973-84.
Hoover, K. D. "Facts and Artifacts: Calibration and the
Empirical Assessment of Real-Business-Cycle Models." Oxford
Economic Papers, 47, 1995, 2444.
Hummels, D. "Toward a Geography of Trade Costs." Working
Paper, Purdue University, 2001.
Learner, E. E., and J. Levinsohn. "International Trade Theory:
The evidence," in Handbook of International Economics, Vol. 3,
Chapter 26, edited by G. M. Grossman and K. Rogoff. Amsterdam: Elsevier,
1995, 1339-94.
Liu, J., C. Arndt, and T. W. Hertel. "Parameter Estimation and
Measures of Fit in A Global, General Equilibrium Model." Journal of
Economic Integration, 19, 2004, 626-49.
Lopez-de-Silanes, F., J. R. Markusen, and T. F. Rutherford.
"The Auto Industry and the North American Free Trade
Agreement," in Modeling Trade Policy." Applied General
Equilibrium Assessments of North American Free Trade, Chapter 8, edited
by J. F. Francois and C. R. Shiells. Cambridge: Cambridge University
Press, 1994, 223-55.
Mathiesen, L. "Computation of Economic Equilibria by a
Sequence of Linear Complementarity Problems." Mathematical
Programming Study, 23, 1985, 144-62.
de Melo, J. "Computable General Equilibrium Models for Trade
Policy Analysis in Developing Countries: A Survey." Journal of
Policy Modeling, 10, 1988, 469-503.
Perroni, C., and T. F. Rutherford. "Regular Flexibility of
Nested CES Functions." European Economic Review, 39, 1995, 335-43.
Rose, A. K., and E. van Wincoop. "National Money as a Barrier
to International Trade: The Real Case for Currency Union." American
Economic Review, 91, 2001, 386-90. Papers and Proceedings of the Hundred
Thirteenth Annual Meeting of the American Economic Association.
Rutherford, T. F. "Constant Elasticity of Substitution
Functions Some Hints and Useful Formulae." mimeo, University of
Colorado, 1995a.
--. "Extensions of GAMS for Complementarity Problems Arising
in Applied Economic Analysis." Journal of Economic Dynamics and
Control, 19, 1995b, 1299-1324.
Shoven, J. B., and J. Whalley. "Applied General-Equilibrium
Models of Taxation and International Trade: An Introduction and
Survey." Journal of Economic Literature, 22, 1984, 1007-51.
Trefler, D. "Trade Liberalization and the Theory of Endogenous
Protection: An Econometric Study of U.S. Import Policy." Journal of
Political Economy, 101, 1993, 138-60.
(1.) See Harrison et al. (1992), Harrison and Vinod (1992), and
Hertel et al. (2004).
(2.) See Trefler (1993) or Rose and van Wincoop (2001).
(3.) Myriad constant returns general equilibrium models appear in
the trade and tax policy literature (the early work is surveyed by de
Melo 1988 and Shoven and Whalley 1984). Studies that incorporate new
theories of scale or variety effects include Lopez-de-Silanes, Markusen,
and Rutherford (1994) and Brown and Stern (1989).
(4.) More generally, an MCP is a powerful numeric tool that
directly accommodates complementary slack conditions that arise in
economics (Rutherford 1995b).
(5.) The system (1) through (4) represents the equilibrium as it
might be solved numerically. The 4n unknowns are the [Y.sub.i],
[U.sub.i], [E.sub.i], and [P.sub.i]. Only relative prices are
determined, however, so one of the market clearance conditions is
removed (by Walras's law), and we assign the associated price as
the numeraire.
(6.) The demand functions embedded in Equation (2) also need to be
identified, but it is a routine calculus exercise to recover these from
the unit expenditure function.
(7.) AvW (2003) define the nominal trade flow from i to j as the
product of the f.o.b. price, the trade cost factor, and the net quantity
consumed ([P.sub.i][t.sub.ij][c.sub.ij]). This also equals the product
of the f.o.b. price and the export quantity ([P.sub.i][X.sub.ij]). The
net (of iceberg melt) quantity consumed is therefore
[X.sub.ij][t.sub.ij]. We simply make this substitution and directly
define utility as a function of the bilateral export quantities scaled
by their respective trade costs.
(8.) The zero-order scale parameter [[??].sub.j], is set equal to
the benchmark utility level, which from Equation (4) is benchmark income
divided by the benchmark consumer price index, [[??].sub.j] =
[[??].sup.sub.j]/[[??.sup.0.sub.j].
(9.) Perroni and Rutherford (1995) formally characterized the
orders of approximation approach to functional identification in their
analysis of the (second order) flexibility of nested CES functions.
(10.) The second-order curvature parameters (the [[??].sub.j]) used
by AvW (2003) are assumed, not informed by their data. We adopt the same
approach in this analysis.
(11.) We do not include the familiar summary parameters in our
count of primitive parameters. In general, theoretic and econometric
exercises focus on these summary parameters, where as calibration
exercises focus on their primitives (the estimated benchmark fitted
values). There are 2n + [n.sup.2] summary parameters: the zero-order
[[??].sub.i], the first-order [[??].sub.ij], and the second-order
[[??].sub.i]. This obscures the fact that many more primitive estimates
are needed to identify the summary parameters, and our count assumes
that each of the substitution elasticities has only a single underlying
primitive. Realistically, the count of primitives should be expanded to
include multiple first-order fitted observations (a minimum of two
different observations of the first-order information are needed to
identify the [[??].sub.i]).
(12.) Our illustrative example follows AvW's two-country
application, which assumes that the United States and Canada are the
only countries and that states and provinces are the relevant geographic
divisions for region specific varieties.
(13.) It is important to emphasize that the benchmark price
normalization is only valid locally. Away from the benchmark, we
normalize on the price of a single region's endowment commodity,
this commodity serves as numeraire.
(14.) In making this seemingly innocent assumption about physical
units and endowment values, we have implicitly asserted that there is no
error associated with the measurement of income or its conversion into
common nominal units. Our failure to account for uncertainty in these
underlying procedures may not bias our subsequent analysis but seriously
undermine the validity of statistical inference. After all, many degrees
of freedom have been consumed in the generation of what we accept as
primary data. Following the tradition in most of empirical economics, we
acknowledge this problem and ignore it.
(15.) AvW explain that although symmetry is assumed, the
econometric model cannot distinguish between this equilibrium and one in
which there are asymmetries that produce the same average trade
resistance (AvW 2003, Footnote 11).
(16.) For example, Hummels (2001) argues that an additive form is
more sensible. As with many theories that support the gravity
literature, the origin of Equation (12) is more likely the log-linear
regression, not the most plausible microfoundations.
(17.) In deriving their reduced form, AvW assume homogeneity in
both relative and absolute tastes across regions (the cardinalization of
utility is maintained across regions). Subsequently, AvW suggest
structural taste bias as an alternative to homogeneity: we explore this
suggestion in Section IV. For the equations here to be consistent with
absolute taste homogeneity, one could normalize the utility functions by
a positive monotonic transformation. For example, multiplying Equation
(8) by
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
normalizes the scale of utility across regions. With this
modification, the equilibrium Condition (4) becomes
[U.sub.i][E.sub.i]/[[??].sub.i] = [Y.sub.i]. Relative homogeneity in
utility is achieved by holding the distribution parameters constant
across the regions. For example,
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
In subsequent analysis, we adopt AvW's reduced form for
calibration, noting that their cardinalization is different than ours
but the resulting demand systems and, therefore, the extensive forms are
isomorphic.
(18.) AvW (2003) reduce the full set of bilateral pairs by: flows
that do not have observations, flows that are observed to be zero, and
any measured internal flows.
(19.) AvW calculate changes in relative income as they apply to
their multilateral resistance indexes, Equation (13). They do not adjust
the regional income terms as they appear in Equation (14).
(20.) For a full presentation of the local versus global
consistency of AvW's model and analysis, see Balistreri (2006).
Balistreri shows that the numeraire shift will be large if the original
numeraire commodity is associated with a region largely affected by the
shock. For example, in the border removal experiment, Balistreri
reported that if Alberta's endowment commodity is used as the
numeraire, AvW's gravity equation understates trade by 67%. In
contrast, if Alabama's endowment commodity is used as the
numeraire, AvW's measure understates trade by 4%. These
calculations depend on [sigma] = 5 (and a proper account of income
changes).
(21.) Our arguments in this article do not depend qualitatively on
the value of [sigma], but the reader is warned that subsequent
quantitative illustrations depend on [sigma] = 5. This is the value AvW
(2003) prefer based on their reading of the literature.
(22.) Under an agnostic stance on absolute trade costs, the
benchmark general equilibrium is unidentified and the regression loses
its ability to inform the structure. The estimating system, Equation
(15) subject to Equation (13), is consistent with the general
equilibrium at the benchmark, but this system alone cannot inform us
about the economic variables away from that benchmark. Moving from
regression coefficients to an operational economic model requires a
normalization that defines the absolute sizes of the [[??].sup.0.sub.ij]
Our normalization is based on the illogic of negative distance-related
costs.
(23.) AvW (2003) did not report information obtained on the
estimated constant k, which is inevitably inconsistent with its
structural interpretation, given direct measures of benchmark income.
Empirical estimates of k from the least squares problem imply an
aggregate income that is more than 3.5 times larger than observed (by
summing across the [Y.sup.0.sub.i]). If one were interested in testing
the theory rather than calibrating the theoretic model, the model is
easily rejected based on a hypothesis test of the similarity between k
and its theory consistent value, -ln [[summation].sub.i]
[[Y.sup.0.sub.i]. The implications of these inconsistencies are further
explored in Section IV.
(24.) The relatively small nonlinear system (of only 164 equations)
is solved using PATH, a complementarity problem algorithm, available in
the GAMS software. The code is available upon request.
(25.) Unlike AvW's method, some traditional econometric
methods of estimating demand systems (e.g., the Linear Expenditure
System or the Almost Ideal Demand System) automatically impose adding up
(Deaton and Muellbauer 1980). Adding up is only achieved in the AvW
estimation by restricting the regression intercept.
(26.) It is important to note that the log difference in trade
flows is not the residual being minimized in the regression Equation
(15). The log difference in trade flows is the more common residual in
the broader gravity literature.
(27.) The plots follow the conventions of STATA software. The
central line within each box represents the median value of the
distribution. The box includes estimates within the interquartile range (those between the 25th and 75th percentile). The whiskers extend beyond
the box in each direction at a distance of 1.5 times the interquartile
range. Observations outside the whiskers are outliers represented as
individual data points.
(28.) The regression coefficients change substantially when
structure is imposed on the constant term: [[beta].sub.1] = -1.44 and
[[beta].sub.2] = -1.85 under true structural estimation.
(29.) The [[??].sup.0.sub.i] are always interpreted as the price of
the composite good (the composite good is regional units of utility),
but its value relative to the benchmark f.o.b. prices of endowments
depends on the arbitrary scale of utility. To interpret this measure
literally as a consumer price index, the benchmark units of the
composite good need to be comparable with endowment units (another
special cardinalization of utility). In this case when trade frictions
that cause an f.o.b. to c.i.f. price wedge (pecuniary costs as AvW call
them) are removed, the consumer price indexes revert to unity. Like
other issues relating to the cardinalization of utility, the
interpretation of [[??].sup.0.sub.i] as a consumer price index, as
opposed to the relative price of utils, is largely irrelevant in the
more general discussion of regularities in trade resistance due to
borders and distance. AvW's point is that some portion of the
resistance em bodied in the measured [[??].sup.0.sub.ij] might be due to
things other than transport or border policy that vary with distance and
country, respectively. Our conclusions from an earlier article support
this, more general, interpretation of trade resistance (Balistreri and
Hillberry 2006).
(30.) We only examine border-related taste bias, but it is also
reasonable, and prudent, to think that distance-related resistance also
includes a taste component. We concentrate on the border bias because of
its relevance to welfare effects in the counterfactual simulation of
border removal. For this illustration, we also make a stark
simplification that the portion of border costs that are due to taste
bias are constant across regions.
(31.) The large welfare benefits of integration advertised by AvW
(2002) follow from the assumption that all measured trade resistance at
the border is a border charge.
(32.) Measured tariffs are calculated from the USITC Interactive
Tariff and Trade DataWeb. This illustrative calculation only measures
the ad valorem rate on U.S. imports from Canada.
(33.) An econometric analogue to this procedure is conducted by
Cheng and Wall (2005). They estimate a model with country-pair fixed
effects and then regress those fixed effects on distance and other
geographic variables to measure the average effect of the geography
variables on trade.
EDWARD J. BALISTRERI and RUSSELL H. HILLBERRY *
* We thank James E. Anderson and Eric van Wincoop for providing
their data. This article was largely completed while the authors were
employed with the U.S. International Trade Commission. The opinions and
conclusions are solely those of the authors.
Balistreri: Assistant Professor, Division of Economics and
Business, Colorado School of Mines, Golden, CO 80401-1887. Phone
303-384-2156, Fax 303-273-3416, E-mail ebalistr@mines.edu
Hillberry: Senior Lecturer, Department of Economics, Economics and
Commerce Building, University of Melbourne, Parkville, VIC 3010,
Australia. Phone +61 3 8344 5354, Fax +61 38344 6899, E-mail rhhi@
unimelb.edu.au
TABLE 1 Scope of the AvW General Equilibrium with CES Preferences
Variables
[Y.sub.i] = incomes
[E.sub.i] = unit expenditure index
[P.sub.i] = prices (f.o.b.)
[U.sub.i] = utility levels
Instruments
[q.sub.i] = endowments
[t.sub.ij] = trade frictions
Structural Parameters ([gamma])
[Y.sup.o.sub.i] = benchmark incomes
[E.sup.o.sub.i] = benchmark unit expenditure indexes
[[PHI].sup.o.sub.i] = [Y.sup.o.sub.i]/[E.sup.o.sub.i] (zero-order
summary parameters)
[X.sup.o.sub.ij] = benchmark trade flows
[P.sup.o.sub.i] = benchmark prices (f.o.b.)
[t.sup.o.sub.ij] = benchmark trade frictions
[[theta].sub.ij] = ([P.sup.o.sub.i][X.sup.o.sub.ij])/[Y.sup.o.sub.j]
(first-order summary parameters)
[[sigma].sub.i] = elasticities of substitution (second-order summary
parameters)
TABLE 2 Benchmark Equilibrium and Simulated Border Removal
([sigma] = 5)
Cost-of-Living
Income ([Y.sub.i]) Index ([E.sub.i])
Benchmark Percent Percent
($US Change Change
billion) (%) Benchmark (%)
Province
Alberta 56.3 14 2.1 -16
British Columbia 62.9 10 1.9 -12
Manitoba 16.7 19 2.1 -20
New Brunswick 9.5 16 2.1 -18
Newfoundland 6.4 19 2.4 -20
Nova Scotia 12.4 16 2.1 -17
Ontario 194.3 14 1.8 -16
Prince Edward Island 1.6 17 2.2 -19
Quebec 107.1 12 1.9 -14
Saskatchewan 15.3 18 2.2 -19
State
Alabama 83.0 0 1.6 -1
Arizona 85.0 0 1.8 -1
California 843.1 0 1.4 -1
Florida 300.7 0 1.5 -1
Georgia 170.9 0 1.5 -1
Idaho 22.4 1 1.8 -2
Illinois 312.3 0 1.5 -1
Indiana 129.7 0 1.5 -1
Kentucky 79.9 0 1.5 -1
Louisiana 94.7 0 1.6 -1
Massachusetts 174.0 0 1.5 -1
Maryland 124.6 0 1.4 -1
Maine 25.1 1 1.6 -2
Michigan 217.3 0 1.5 -1
Minnesota 114.6 0 1.6 -1
Missouri 118.3 0 1.5 -1
Montana 16.1 1 1.8 -2
North Carolina 168.6 0 1.5 -1
North Dakota 12.7 1 1.7 -2
New Hampshire 27.2 I 1.6 -2
New Jersey 243.9 0 1.4 -1
New York 541.1 0 1.4 -1
Ohio 256.6 0 1.5 -1
Pennsylvania 283.1 0 1.4 -1
Tennessee 116.7 0 1.5 -1
Texas 453.0 0 1.6 -1
Virginia 170.0 0 1.5 -1
Vermont 13.0 1 1.6 -2
Washington 136.4 1 1.7 -2
Wisconsin 117.7 0 1.6 -1
Rest of United States 988.6 0 1.8 -1
Price of Output
([P.sub.i]) Utility ([U.sub.i])
Percent Percent
Change Change
Benchmark (%) Benchmark (%)
Province
Alberta 1.0 14 26.5 36
British Columbia 1.0 10 32.9 26
Manitoba 1.0 19 7.8 49
New Brunswick 1.0 16 4.6 42
Newfoundland 1.0 19 2.7 50
Nova Scotia 1.0 16 5.9 39
Ontario 1.0 14 107.3 35
Prince Edward Island 1.0 17 0.7 45
Quebec 1.0 12 57.5 29
Saskatchewan 1.0 18 7.1 46
State
Alabama 1.0 0 53.2 1
Arizona 1.0 0 48.4 1
California 1.0 0 595.1 0
Florida 1.0 0 195.1 1
Georgia 1.0 0 111.9 1
Idaho 1.0 1 12.7 2
Illinois 1.0 0 210.3 1
Indiana 1.0 0 86.7 1
Kentucky 1.0 0 52.8 1
Louisiana 1.0 0 59.0 1
Massachusetts 1.0 0 119.6 1
Maryland 1.0 0 88.0 1
Maine 1.0 1 15.2 3
Michigan 1.0 0 142.6 2
Minnesota 1.0 0 71.1 2
Missouri 1.0 0 76.4 1
Montana 1.0 1 9.0 3
North Carolina 1.0 0 110.3 1
North Dakota 1.0 1 7.4 2
New Hampshire 1.0 1 17.5 2
New Jersey 1.0 0 168.2 1
New York 1.0 0 386.7 1
Ohio 1.0 0 174.0 1
Pennsylvania 1.0 0 196.7 1
Tennessee 1.0 0 75.9 1
Texas 1.0 0 283.7 1
Virginia 1.0 0 114.1 1
Vermont 1.0 1 8.2 3
Washington 1.0 1 82.6 3
Wisconsin 1.0 0 75.3 1
Rest of United States 1.0 0 552.5 2
TABLE 3 Disbursement of Endowments ([sigma] = 5)
Consumed in
Canada
Endowment
([q.sub.i] Benchmark Counterfactual
Province
Alberta 56.3 19.3 5.7
British Columbia 62.9 33.0 12.8
Manitoba 16.7 4.6 1.0
New Brunswick 9.5 3.0 0.8
Newfoundland 6.4 1.5 0.3
Nova Scotia 12.4 4.4 1.2
Ontario 194.3 79.4 24.0
Prince Edward 1.6 0.5 0.1
Island
Quebec 107.1 47.7 16.7
Saskatchewan 15.3 4.3 1.0
State
Alabama 83.0 0.4 1.6
Arizona 85.0 0.4 2.0
California 843.1 2.1 9.8
Florida 300.7 1.2 5.2
Georgia 170.9 0.8 3.4
Idaho 22.4 0.2 0.8
Illinois 312.4 1.5 6.8
Indiana 129.7 0.8 3.4
Kentucky 79.9 0.5 2.1
Louisiana 94.7 0.4 1.8
Massachusetts 174.0 1.3 5.7
Maryland 124.6 0.7 3.1
Maine 25.1 0.4 1.6
Michigan 217.4 1.9 8.6
Minnesota 114.6 0.8 3.6
Missouri 118.3 0.6 2.7
Montana 16.1 0.2 0.7
North Carolina 168.6 0.9 4.2
North Dakota 12.7 0.1 0.5
New Hampshire 27.2 0.3 1.3
New Jersey 243.9 1.6 7.4
New York 541.1 3.9 17.7
Ohio 256.6 1.8 7.9
Pennsylvania 283.1 2.1 9.2
Tennessee 116.7 0.6 2.6
Texas 453.0 1.7 7.7
Virginia 170.0 1.0 4.5
Vermont 13.0 0.2 0.8
Washington 136.4 1.7 8.1
Wisconsin 117.7 0.8 3.6
Rest of United 988.6 6.8 30.0
States
Consumed in the
United States
Benchmark Counterfactual
Province
Alberta 4.1 15.7
British Columbia 3.9 17.6
Manitoba 1.7 5.4
New Brunswick 0.9 3.0
Newfoundland 0.6 1.8
Nova Scotia 1.0 3.7
Ontario 16.5 64.7
Prince Edward 0.2 0.5
Island
Quebec 7.3 31.4
Saskatchewan 1.4 4.7
State
Alabama 37.7 36.3
Arizona 32.9 31.4
California 548.4 537.4
Florida 147.0 142.4
Georgia 81.1 78.2
Idaho 8.4 7.8
Illinois 153.8 148.1
Indiana 61.5 58.8
Kentucky 36.8 35.2
Louisiana 42.3 40.8
Massachusetts 105.6 100.2
Maryland 75.9 72.9
Maine 10.4 9.4
Michigan 99.3 93.2
Minnesota 48.6 45.9
Missouri 53.6 51.4
Montana 5.8 5.4
North Carolina 80.6 77.2
North Dakota 4.7 4.4
New Hampshire 12.8 11.9
New Jersey 132.3 126.0
New York 323.3 307.4
Ohio 126.8 120.6
Pennsylvania 152.5 144.9
Tennessee 52.6 50.5
Texas 205.1 198.5
Virginia 85.9 82.1
Vermont 5.6 5.1
Washington 69.3 63.1
Wisconsin 51.4 48.7
Rest of United 348.9 328.6
States
Transport
Melt
Benchmark Counterfactual
Province
Alberta 32.9 34.9
British Columbia 26.1 32.5
Manitoba 10.5 10.3
New Brunswick 5.6 5.7
Newfoundland 4.3 4.2
Nova Scotia 7.0 7.5
Ontario 98.5 105.7
Prince Edward 1.0 1.0
Island
Quebec 52.1 59.0
Saskatchewan 9.6 9.6
State
Alabama 45.0 45.1
Arizona 51.6 51.6
California 292.6 296.0
Florida 152.6 153.1
Georgia 89.0 89.3
Idaho 13.8 13.7
Illinois 157.0 157.5
Indiana 67.5 67.5
Kentucky 42.7 42.7
Louisiana 52.0 52.1
Massachusetts 67.2 68.1
Maryland 48.0 48.5
Maine 14.3 14.1
Michigan 116.1 115.6
Minnesota 65.2 65.1
Missouri 64.2 64.2
Montana 10.1 10.0
North Carolina 87.0 87.2
North Dakota 7.9 7.8
New Hampshire 14.1 14.0
New Jersey 109.9 110.5
New York 213.9 216.1
Ohio 128.1 128.2
Pennsylvania 128.6 129.0
Tennessee 63.5 63.5
Texas 246.2 246.9
Virginia 83.1 83.4
Vermont 7.2 7.1
Washington 65.4 65.1
Wisconsin 65.5 65.4
Rest of United 632.9 630.1
States
TABLE 4 Welfare Impacts of Economic Integration under Alternative
Calibrations
Equivalent Variation
Econometric Exact-Fit
Benchmark GDP Calibration Calibration
Province ($US billion) (%) (%)
Alberta 56 36 28
British Columbia 63 26 14
Manitoba 17 49 20
New Brunswick 9 42 24
Newfoundland 6 50 78
Nova Scotia 12 40 14
Ontario 194 35 36
Prince Edward Island 2 45 64
Quebec 107 29 19
Saskatchewan 15 46 19
GDP-weighted average 34 27