文章基本信息

标题：Semiparametric estimation of land price gradients using large Data sets.
作者：Bryan, Kevin A. ; Sarte, Pierre-Daniel G.
期刊名称：Economic Quarterly
印刷版ISSN：1069-7225
出版年度：2009
期号：January
语种：English
出版社：Federal Reserve Bank of Richmond
摘要：Traditional urban theory typically predicts land values that form a smooth and convex surface centered at a central business district (CBD) (Mills 1972 and Fujita 1989). The fact that land values are highest near the city center reflects a trade off between commuting costs and agglomeration externalities at the CBD. As distance from the city center increases, so do commuting costs for workers employed at the CBD. Agglomeration effects, however, such as knowledge sharing or decreased shipment costs from a common port, are highest near the CBD. In equilibrium, therefore, the price of residential land tends to be bid up most forcefully close to the city center where commuting costs are lowest. In empirical work, the shape of the land price surface is often estimated using a parametric regression that includes a measure of Euclidian distance from the CBD or a polynomial function of location data. The parameter associated with distance from the CBD, then, captures the rate at which land prices decline as one moves away from the city center and toward the rural outskirts. Though this is a straightforward method to obtain estimates of the rate of price decline, parametric methods can be misleading for two reasons.
关键词：Real estate;Real estate industry;Real property

Semiparametric estimation of land price gradients using large Data sets.

Bryan, Kevin A. ; Sarte, Pierre-Daniel G.

Traditional urban theory typically predicts land values that form a smooth and convex surface centered at a central business district (CBD) (Mills 1972 and Fujita 1989). The fact that land values are highest near the city center reflects a trade off between commuting costs and agglomeration externalities at the CBD. As distance from the city center increases, so do commuting costs for workers employed at the CBD. Agglomeration effects, however, such as knowledge sharing or decreased shipment costs from a common port, are highest near the CBD. In equilibrium, therefore, the price of residential land tends to be bid up most forcefully close to the city center where commuting costs are lowest. In empirical work, the shape of the land price surface is often estimated using a parametric regression that includes a measure of Euclidian distance from the CBD or a polynomial function of location data. The parameter associated with distance from the CBD, then, captures the rate at which land prices decline as one moves away from the city center and toward the rural outskirts. Though this is a straightforward method to obtain estimates of the rate of price decline, parametric methods can be misleading for two reasons.

First, as noted by Seyfried (1963) among others, cities are "not a featureless plain." Bodies of water, mountains, and geography more generally all distort the land price surface by influencing potential commuting patterns. Second, and more importantly, there is growing evidence, both theoretical and empirical, that the monocentric city of Mills (1972), for example, is being replaced by the polycentric city, where employment subcenters lead to land price gradients of a form that may be difficult to uncover parametrically. Anas, Arnott, and Small (1998) survey this literature, while Redfearn (2007) provides an example of the employment density surface in Los Angeles. A parametric model of such a city may smooth over important employment subcenters and high-price suburbs. As such, nonparametric estimates of the land price surface allow for a more robust description of the data.

Estimation of land gradients using nonparametric or semiparametric methods is somewhat involved relative to parametric regressions. In part, to economize on computations, early work in this area has tended to use only vacant lot sales (Colwell and Munneke 2003), but the sparseness of that data can lead to overly smooth price gradients. Furthermore, vacant lot sales are not as informative when considering land prices outside of dense urban cores, as there may be large areas without any nearby sale during the period studied. In contrast, the number of residential house sales in a given period can be substantial. This article, therefore, reviews, a method for constructing land price gradients using a potentially large set of housing sales data. Drawing on work by Yatchew (1997) and Yatchew and No (2001), we estimate a semiparametric hedonic housing price equation where the contribution of housing attributes to home prices is obtained parametrically, but the component of home prices that varies with location is not assumed to lie in a given parametric family.

Using data from 2002-2006, we apply this method to the city of Richmond, Virginia, and three surrounding counties. The region under consideration covers approximately 1,218 square miles, comprises nearly one million people, and has boundaries that are agricultural in nature. Since our technique uses home transaction sales, and not simply vacant land, we are able to construct a land gradient from over 100,000 observations. Surprisingly, given the recent trend toward polycentric cities in the United States, we find that the price surface in Richmond is largely monocentric, with land prices falling from over $100 per square foot (in 2006 constant dollars) around the CBD to less than $1 per square foot in the rural outskirts. Though the CBD is the dominant feature on the price surface, larger suburbs, such as Mechanicsville, Ashland, Short Pump, and Midlothian, and corridors along Interstates 64 and 95, are easily identified. Furthermore, the presence of these subcenters distorts estimates of parametric surfaces even when they assume one dominant center.

An exponential function fitted to our estimates of land prices reveals that prices fall, on average, at the rate of 2.8 percent per mile as one moves away from Richmond's CBD. Put another way, land prices fall by 1/2 every 25 miles. This rate is significantly higher than the rate of decline estimated with a least squares regression of home prices on housing characteristics and a measure of distance from the CBD, which finds prices falling at only 1 percent per mile. This difference arises because conventional parametric methods do not allow for local variations in housing prices and, consequently, achieve a considerably worse fit over a large area. In particular, the parametric regression is associated with a much poorer fit of the data relative to its semiparametric counterpart.

The technique for estimating land prices proposed in this article makes no assumptions about the geography of the Richmond area or the structure of the land price surface. In a parametric estimation of land prices, the inclusion of variables to represent location within a certain county or distance from an identified "employment subcenter" (such as Chicago O' Hare airport in Colwell and Munneke [1991]) is essential to achieving a reasonably accurate land price surface. However, identifying locations such as an employment subcenter can be a difficult and arbitrary task (Giuliano and Small [1991] and McMillen [2001]). Moreover, independent of commuting considerations, proximity to geographical features such as lakes may enhance the value of certain locations. By construction, this feature of land prices cannot be captured by parametric methods based on distance from a CBD. In contrast, because no arbitrary decisions about the functional form of the land surface need to be made with the method used in this article, it can be directly applied to any urban region of any shape and size.

The rest of the article proceeds as follows. In Section 1, we describe the data and features of the Richmond area. Section 2 describes the empirical model and discusses how semiparametric land price estimates are computed. In Section 3, we construct the land price surface and compare our estimates with those constructed using simpler polynomial and distance-from-CBD estimates. Section 4 concludes.

1. DATA DESCRIPTION

This article estimates the land price gradient from a full sample of residential sales in the city of Richmond and three nearby counties--Hanover, Henrico, and Chesterfield--from 2002-2006. Richmond is a mid-sized regional center with a population just over 200,000, lying 100 miles south of Washington, D.C., at the intersection of Interstates 95 and 64. The urban core was well developed by the late 19th century, when the city served as the Confederate capital during the Civil War. As such, the mean age of the housing stock in the city itself is more than 66 years. Hanover County lies due north of Richmond, with a largely rural population of less than 100,000, though significant suburbs do line the Interstate 95 corridor. Henrico County lies both to the west and to the east of Richmond, with a population just under 300,000, and is home to a number of quickly growing suburbs surrounding Interstate 64, notably the areas around Short Pump and Mechanicsville. Chesterfield County, with a population over 300,000, lies to the south of Richmond and is primarily made up of low density suburban areas, along with a few notable small towns such as Chester and Midlothian. A map of Richmond and surrounding counties is given in Figure 1. All told, the region includes almost one million people residing in over 1,218.5 square miles. Aside from the far southern end of Chesterfield County, which abuts the cities of Colonial Heights and Petersburg, the edge of this region consists of rural farmland.

[FIGURE 1 OMITTED]

We acquired a full record of property sales, with matched housing characteristics, from the city and counties. These characteristics include the furnished square footage of a house, the number of years since the house was first built, its plot acreage, and the number of bathrooms available. We also include binary variables that indicate whether a house has air conditioning, whether its exterior is made of brick, vinyl, or wood, and whether it is heated using oil, hot water, or central air. Before carrying out the estimation, we filter the data along several dimensions. First, all nonresidential properties were removed, as the parametric portion of our estimation requires data on, for instance, livable square footage. Next, we remove houses that appear to have improperly entered data--this includes houses with construction dates before 1800, houses with sales prices of less than one dollar, houses that appear to have been sold in a lot where no breakdown of the sales price per house. is available, and houses with plot acreage lower than .02 acres (871 square feet).

We geocode data from street addresses in ArcGIS using SPCS NAD83 coordinates, which, unlike simple latitude and longitude, allow easy calculation of Euclidean distance in feet between any two points. In some cases, the geocoder was unable to positively identify a street address; unidentified houses are left out of the final sample. Descriptive statistics of our data are reported in Table 1.

Table 1 Data Summary

Variable Mean St. Dev. Min. Max.

Sales Price (a) 210,068.63 152,035.95 6.46 5,639,195.45
Age 33.14 28.61 1 207
No. of Bathrooms 2.13 0.90 0 72
Air Conditioning 0.66 0.47 0 1
Square Footage 1,876.0 903.4 319 63,233
Lot Acreage 0.70 2.44 0.02 98.77
Brick Exterior 0.20 0.40 0 1
Vinyl Exterior 0.31 0.46 0 1
Wood Exterior 0.18 0.38 0 1
Gas Heating 0.09 0.29 0 1
Oil Heating 0.17 0.38 0 1
Hot Water Heating 0.14 0.35 0 1
Central Air Heating 0.13 0.33 0 1

Notes: (a) Expressed in constant 2006 dollars.

2. THE EMPIRICAL MODEL

This section sets up the basic framework we use throughout the remainder of the article. We denote the city of Richmond (and its four surrounding counties) by C and a location in the city by l = (x, y) [member of] C, where x and y are Cartesian coordinates. We denote the (log) per-square-foot price of a home in Richmond by p. Our analysis begins with the following hedonic price equation,

p = X[beta] + f(l) + [epsilon], (1)

where X is a k-element vector of conditioning housing characteristics such that cov(X|l) = [[SIGMA].sub.X|l], f(l) is the component of a home price directly related to location, and [epsilon] is a random variable such that E([epsilon]|l, X) = 0 and var ([epsilon]|l) = [[sigma].sub.[epsilon].sup.2]. The matrix X consists of all of the variables from Table 1 and quadratic terms for lot acreage and square footage, as Brownstone and De Vany (1991), among others, find that land price per acre is a concave function of parcel size. The coefficients, [beta], capture the reduced-form effects of particular housing attributes, such as the size of the living area of a house or whether air conditioning is available, on home prices. Moreover, since p--X[beta] represents housing prices purged of the contribution from specific attributes, we think of f(l) as capturing the value of land per-square-foot at a given location. While this general semi-log specification is standard in the analysis of real estate data, some differences exist regarding the functional form that describes the function f(*). One option is to specify f(*) as a polynomial function of location data, as in Galster, Tatian, and Accordino (2006),

f(x, y) = [[alpha].sub.0]x + [[alpha].sub.1] y + [[alpha].sub.3] [x.sup.2] + [[alpha].sub.4] [y.sub.2] + [[alpha].sub.5] x y. (2)

Substituting equation (2) into equation (1), one can then consistently estimate the coefficients [beta] and [[alpha].sub.i], i = 1,..., 5, using least squares.

An alternative approach that uses least squares estimation is to parameterize f(*) as a function of distance from the CBD, as in Zheng and Khan (2008),

f(x, y) = [[alpha].sub.d] [square root of (term) [(x - [x.sub.c]).sup.2] + [(y - [y.sub.c]).sup.2], (3)

where [l.sub.c] = ([x.sub.c], [y.sub.c]) denotes the location of the CBD. Recalling that p is measured in log units, [[alpha].sub.d] then captures the exponential rate of change in land values as one moves away from the CBD.

In contrast to either of these approaches, this article does not assume that f(l) lies in a given parametric family. The only restriction that we shall impose on f(l) is that it satisfies a Lipschitz condition,

||f([l.sub.a]) - f([l.sub.b])|| < L||[l.sub.a] - [l.sub.b]||, L [greater than or equal to] 0. (4)

Semiparametric Regression

Estimating the nonparametric component of equation (1), f(l), requires that we first address estimation of the parametric effects, [beta]. One strategy would be to estimate equation (1) in two stages, first ignoring the nonparametric component, f(l), in order to obtain estimates of [beta] by regressing p on X, and then applying nonparametric methods to purged home prices, p - X[bar.[beta]], where [bar.[beta]] denotes the previously obtained estimates of [beta]. However, since the reduced form model contains a component related to location that is being ignored, estimates of [beta] obtained in this way will be biased when housing attributes, X, are correlated with location, l. Rather, a two-step estimation strategy must somehow "get rid" of the nonparametric component in the first step.

Let n denote the number of observations in our data set. A popular approach, pioneered by Robinson (1988), recognizes as a first step that equation (1) implies that

p - E (p|l) = [X - E(X|l)] [beta] + [epsilon]. (5)

In other words, the conditional differencing of equation (1) gets rid of the nonparametric component. Robinson (1988) then shows that by replacing E (p|l) and E (X|l) with nonparametric kernel estimates (to be described below) [^.E](p\l) and [^.E](X|l), respectively, and then regressing p - [^.E](p\l) on [X - [^.E](X|l)], yields estimates of [beta] that are [square root of n] consistent. Unfortunately, this method can be quite onerous since separate nonparametric regressions are required for each housing attribute in X, where both the number of relevant housing attributes and the number of observations are large in our case. To avoid this problem, we summarize instead a differencing method developed more recently by Yatchew (1997) and Yatchew and No (2001), and adopted, for example, in Rossi-Hansberg, Sarte, and Owens (2008).

The basic idea behind differencing the data works as follows. We would like to re-order our data, ([p.sub.1], [X.sub.1], [l.sub.1]), ([p.sub.2], [X.sub.2], [l.sub.2]),..., ([p.sub.n], [X.sub.n], [l.sub.n]) so that the l's are close, in which case differencing tends to remove the nonparametric effects. To get a sense of the implications of differencing, suppose that locations constitute a uniform grid on the unit square (the re-scaling is without loss of generality). Each point may then be thought of as taking up an area of 1/n and the distance between adjacent observations is therefore [1/[square root of n]]. Suppose further that the data have been re-ordered so that ||[l.sub.1] - [l.sub.i-1]|| = [1/[square root of n]]. First-differencing of (1) then yields

[p.sub.i] - [p.sub.i-1] = ([X.sub.i] - [X.sub.i-1])[beta] + f([l.sub.i]) - f([l.sub.i-1]) + [[epsilon].sub.i] - [[epsilon].sub.i-1]. (6)

Assuming that equation (4) holds, the difference in nonparametric components in (6) vanishes asymptotically. Yatchew (1997) then shows that the ordinary least squares estimator of [beta] using the differenced data (i.e., by regressing [p.sub.i] - [p.sub.i-1] on [X.sub.i] - [X.sub.i-1]) is also [square root of n] consistent. This estimator of [beta], however, achieves only 2/3 efficiency relative to the one produced by Robinson's method. This can be improved dramatically by way of higher-order differencing. Specifically, define [DELTA]p to be the (n - m) x 1 vector whose elements are [[[DELTA]p].sub.i] = [[SIGMA].sub.s]=0.sup.m][[omega].sub.s][P.sub.i-s], [DELTA]X to be the (n - m) x. k matrix with entries [[[DELTA]X].sub.ij] = [[SIGMA].sub.s=0.sup.m][[omega].sub.s][X.sub.i-s,j], and similarly for [DELTA]s. The parameter m governs the order of differencing and the [omega]'s denote constant differencing weights. Equation (6) can then be generalized as

[delta]p = [delta]X[beta] + [m.summation over (s=0)]] [[omega].sub.s] f([l.sub.i-s]) + [delta] [epsilon], i = m + 1,..., n, (7)

where the following two conditions are imposed on the differencing coefficients, [[omega].sub.0],..., [[omega].sub.m]:

[m.summation over (s=0)]] [[omega].sub.s] = 0 and [m.summation over (s=0)]] [[omega].sub.s.sup.2] = 1. (8)

The first condition ensures that differencing removes the nonparametric effect in (1) as the sample size increases and the re-ordered l's get closer to each other. The second condition is a normalization restriction that implies that the variance of the transformed residuals in (7) is the same as the variance of the residuals in (1). When the differencing weights are chosen optimally, the difference estimator [[beta].sub.[DELTA]] obtained by regressing [[DELTA].sub.p] on [[DELTA].sub.X] approaches asymptotic efficiency by selecting m sufficiently large. (1) In particular, Yatchew (1997) shows that

[^.[beta].sub.[delta]] [~.sup.A] N([beta], (1 + 1/2m) [[sigma].sub.[epsilon].sup.2]/n [[SIGMA].sub.X|l.sup.-1)], [S.sub.[DELTA].sup.2] = 1/n [[n.summation over (i=1)] [([DELTA][p.sub.i] - [DELTA][X.sub.i] [^.[beta]]).sup.2] [[right arrow].sup.P] [[sigma].sub.[epsilon].sup.2], and (9)

[[^.[SIGMA].sub.u, [DELTA]] = 1/n [DELTA]X'[DELTA]X [[right arrow].sup.P] [[SIGMA].sub.X|l.

These results will allow us to do inference on [^.[beta]].sub.[DELTA]]. By equation (9), the [R.sup.2] statistic associated with our original empirical specification (1) can be estimated as [R.sup.2] = 1 - [[s.sub.[DEALTA].sup.2]/[s.sub.p.sup.2]], In our estimation exercise, we use m = 10, which produces coefficient estimates that are approximately 95 percent efficient when using optimal differencing weights.

Finally, note that because locations lie in [R.sup.2], the initial re-ordering of the l's is not unambiguous in this case. A Hamiltonian cycle over distance is the ordering of housing sale coordinates such that the sum of differenced distances is minimized. However, computing a Hamiltonian cycle for over 100,000 points is not yet tractable on a personal computer. (2) In this article, we reorder locations using a nearest-neighbor algorithm that finds an approximate Hamiltonian cycle in the following way: First, we select an arbitrary starting location from which we then find the location of a sale in our data set nearest to it in Euclidean distance. From this second location, we then find the nearest third sale location among the set of remaining observations (i.e., those not already identified). This process is repeated until every sale location has been selected. (3) Figure 2 displays the path chosen by our algorithm. The median distance between points in our sample is 86 feet. As this figure is darker in areas where there are more residential sales, it also serves as a rough guide to residential density in the Richmond area. The origin represents the CBD of Richmond City.

[FIGURE 2 OMITTED]

Nonparametric Kernel Estimation of f(l)

Denote by z the price of a home "purged" of its contribution from housing characteristics, where z, is obtained using first stage estimates, z = p - X[^.[beta].sub.[delta], and construct the data ([z.sub.1], [l.sub.1]), ([z.sub.2], [l.sub.2]),..., ([Z.sub.n],[l.sub.n]). Then, because [^.[beta]].sub.[DELTA]] is a consistent estimator of [beta], the consistency of f(l) obtained using standard kernel estimation methods applied to purged home prices remains valid.

The Nadaraya-Watson kernel estimator of f at location [l.sub.j] is given by

f([l.sub.j]) = [n.sup.-1] [n.summation over (i=1)] [W.sub.hi] ([l.sub.j] [z.sub.j]. (10)

In other words, the value of land at location [l.sub.j] is a weighted-average of the z's in our data sample. The weight, [W.sub.hi]([l.sub.j]), attached to each purged price,

[z.sub.i], is given by

[W.sub.hi] ([l.sub.j] = [[[K.sub.h] ([l.sub.j] - [l.sub.i])]/[[n.sup.-1] [[SIGMA].sub.i=1.sup.n] [K.sub.h] ([l.sub.j] - [l.sub.i])]], (11)

where

[K.sub.h] (u) = [h.sup.-1] K (u/h),

and K([PSI]) is a symmetric real function such that [integral] K([PSI])\d [PSI] < [infinity] and [integral] K([PSI])d[PSI] = 1. Thus, we may choose to attach greater weight to observations on prices of homes located near [l.sub.j] rather than far away by suitable choice of the function K. In particular, as in much of the literature, our estimation is carried out using the Epanechnikov kernel,

K(u/h) = 3/4 (1-[(u/h).sup.2]) I(|u/h| [less than or equal to] 1), (12)

where I (*) is an indicator function that takes the value 1 if its argument is true and 0 otherwise. The distance between location [l.sub.j] and some other location [l.sub.j] in Richmond is simply measured as a Euclidean distance in feet. The kernel in (12) then implies that prices of homes located more than a distance of h feet from [l.sub.j] will receive a zero weight in the estimation of f([l.sub.j]). In that sense, the bandwidth, h, has a very natural interpretation in this case. In practice, the estimation of f(l) is affected to a greater degree by the choice of bandwidth rather than the choice of kernel. (4)

How then does one choose what bandwidth is appropriate? A seemingly natural method for choosing the bandwidth is to minimize the sum of squared residuals,

MSE = [n.sub.-1] [n.summation over (i=1)] [[[z.sub.j] - f([l.sub.i])].sup.2]

However, because [z.sub.i] is used when estimating f([l.sub.i]) the mean squared error can be arbitrarily reduced by decreasing the bandwidth until all weight in f([l.sub.i]) is effectively placed on [z.sub.i]. To avoid this problem, the cross-validation method proposes that the bandwidth parameter be chosen by minimizing the sum of squared residuals from an alternative kernel regression in which [z.sub.i] is dropped in the estimation of f([l.sub.i]) Hence, we select h so that it solves

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (13)

where

[[~.f].sub.n] ([l.sub.j]) = [n.sub.-1] [n.summation over.sub.i [not equal to] j] [W.sub.hi] ([l.sub.j]) [Z.sub.j].

We estimate equation (7) using 103,543 observations over the period 2002-2006. All prices are deflated using the consumer price index and measured in 2006 constant dollars. We include among our conditioning variables, X, a set of time dummies associated with the sale date of a home that captures secular citywide increases in real home prices, where 2006 is set as the base year.

3. FINDINGS

This section reviews our findings. Before describing the results from the semiparametric estimation, we first present estimates from the polynomial specification of Galster, Tatian, and Accordino (2006), equation (2), and the parameterized distance function of Zheng and Khan (2008), equation (3), as benchmarks.

Table 2 presents estimates from the specification where the value of land is modeled as a quadratic function of location data, as in equation (2), under the heading "Parametric Model 1." The estimation of the coefficients is carried out using least squares. Virtually all housing characteristics are statistically significant at the 5 percent critical level and most are significant at the 1 percent level. The coefficients associated with the sale date are significant over and above prices being measured in constant dollars. In particular, the findings suggest a general increase in real home prices over our sample period (recall that 2006 is set as the base year). In addition, the regression achieves a relatively good fit for cross-sectional data, with an [R.sup.2] of about .50.

Table 2 Modeling Land Prices as Functions of Local Data

 Parametric Model 1 Parametric Model 2

Variable Coeff. t-Statistics Coeff. t-Statistics

2002 -0.41 -52.37 -0.40 -50.10
2003 -0.32 -42.09 -0.31 -40.11
2004 -0.22 -29.32 -0.22 -27.81
2005 -0.09 -12.39 -0.09 -11.37
Age (a) 0.30 25.40 0.26 20.37
No. of Bathrooms 0.13 30.81 0.16 35.21
Air Conditioning 0.33 61.35 0.36 65.07
Sq. Ft. (b) 0.15 31.55 0.18 35.55
(Sq. Ft.) (2c) -0.38 -20.37 -0.45 -23.27
Acreage -0.39 -222.58 -0.40 -225.31
(Acreage) (2d) 0.54 123.50 0.56 124.35
Brick Exterior 0.04 6.02 0.04 5.52
Vinyl Exterior 0.06 10.05 -0.01 -1.48
Wood Exterior -0.08 -11.01 -0.07 -9.21
Gas Heating 0.22 25.16 0.21 22.94
Oil Heating 0.04 4.56 0.18 25.03
Hot Water Heating 0.24 29.21 0.36 46.12
Central Air Heating 0.17 21.67 0.20 23.34
x 2.45 10.04
y -3.65 -31.88
[x.sup.2] -0.09 -13.87
[y.sup.2] -0.13 -37.07
xy -0.03 -3.88
Distance from CBD -0.01 -14.63
[R.sup.2] 0.50 0.47

 Semiparaimletric Model

Variable Coeff. t-Statistics

2002 -0.41 -74.64
2003 -0.33 -60.93
2004 -0.23 -43.07
2005 -0.10 -19.13
Age (a) -0.40 -29.65
No. of Bathrooms 0.04 11.76
Air Conditioning 0.10 21.26
Sq. Ft. (b) 0.11 25.29
(Sq. Ft.) (2c) -0.25 -17.61
Acreage -0.31 -161.19
(Acreage) (2d) 0.38 106.39
Brick Exterior -0.01 -1.79
Vinyl Exterior -0.02 -4.37
Wood Exterior -0.02 -3.99
Gas Heating 0.09 12.79
Oil Heating 0.06 6.96
Hot Water Heating 0.07 8.25
Central Air Heating 0.03 4.06
x
y
[x.sup.2]
[y.sup.2]
xy
Distance from CBD
[R.sup.2] 0.77

Notes: (a) Measured in 100 years; (b) measured in 1,000 sq. ft.; (c)
measured in 100 million sq. ft. squared; (d) measured in 100 acres
squared.

Of central interest in the first two columns of Table 2 are the parameters that govern the value of land associated with the Cartesian location data. The coefficients of the polynomial in equation (2) are all highly significant, with the exception of the cross-term, which is statistically significant at the 10 percent critical level only. Figure 3 shows the land value surface associated with these parameters. The origin roughly represents the CBD of the city of Richmond, at the intersection of 7th Street and Canal Street. This location corresponds to coordinates within an area generally considered the employment center of Richmond, with high employment density and a preponderance of commercial and office buildings. The polynomial estimate of land prices, however, has a peak nearly 20 miles away from the CBD, roughly located in a far western suburb known as Short Pump. Since the polynomial in equation (2) permits, at most, one local interior maximum, this parametric regression imposes a land price surface typical of a monocentric city. Essentially, the polynomial will choose a maximum in an area for which there exist many house sales with a fairly high land value, even if there are other local maxima (near the CBD) on the true land price surface with higher prices but fewer sales.

[FIGURE 3 OMITTED]

Table 2 also presents estimates from the parametric specification where land prices (per square foot) are assumed to decline at an exponential rate with distance from the CBD, equation (3), under the heading "Parametric Model 2." The results generally mimic those of our first parametric model, both in terms of the coefficients associated with the different housing attributes and their statistical significance. For example, an additional bathroom adds approximately $0.13 to the log per-square-foot price of a home under the first specification in Table 2 and $0.16 under the second parameterization. This translates to an additional bathroom, adding $2.39 and $2.68, respectively, to the per-square-foot price of an average home. Note, however, that the parametric specification for Model 2 achieves a slightly worse fit than that for Model 1, with an [R.sup.2] statistic of 0.47.

Under the parametric specification including distance from the CBD, we find that the log price per square foot of land declines at a rate of .0103 per mile as one moves away from Richmond's CBD. This translates into a price per square foot of land that falls exponentially at a rate of 1.03 percent per mile as distance from the CBD increases. Alternatively, this exponential rate of decay implies that land prices fall by 1/2 approximately every 67 miles. This rate of decline in land prices is significantly slower than those estimated by Zheng and Khan (2008) for Beijing, and Colwell and Munneke (1997) for Chicago. The difference occurs because our estimation exercise extends over an area much larger than that covered in the latter two papers, extending 50 miles in diameter in our case, using a similar specification. Because differences in geography are potentially much more pronounced over a larger area, the restriction embedded in the parametric specification related to location will be more stringent and its fit becomes poorer. Figure 4 shows the land price surface associated with our second parametric model. By construction, given the specification in equation (3), this surface reaches a peak at the location that we defined as the center of the CBD, [l.sub.c] = ([x.sub.c], [y.sub.c]) in equation (3). As in the other parametric regression estimated above, this specification imposes a single peak on the land price surface as expected for a monocentric city. Compared to Figure 3, Figure 4 suggests considerably less variation in land prices throughout the city, with a peak price of $7.60 per square foot at the CBD and approximately $3.80 at the boundaries of greater Richmond. These figures translate to $33,106 and $16,553, respectively, for a 0.1 acre lot.

[FIGURE 4 OMITTED]

Given the variation in home prices in greater Richmond depicted in Table 1, this relatively narrow range in estimated land prices implied by Model 2 is potentially surprising. Moreover, the fact that these estimates stem from a parametric regression whose fit is slightly worse than that associated with the first parametric model in Table 2 should leave us somewhat skeptical. As we now discuss, the alternative specification where f(l) in equation (1) is treated nonparametrically yields a significantly better fit, and implies a much more varied and greater range in land prices.

In contrast to the findings from the parametric approaches we have just reviewed, the last two columns of Table 2 present estimates from the semi-parametric method described in Section 2. As before, virtually all variables are statistically significant at the 1 percent critical level, but the magnitude of the coefficients differs somewhat from those of our first two parametric specifications. For example, an additional bathroom now contributes .038 to the log per-square-foot price of a home as opposed to .13 for Model 1 and .16 for Model 2. The difference stems from the fact that we now estimate the component of home prices associated with location nonparametrically. In particular, observe that this semiparametric specification achieves a notice-ably better fit relative to the previous two parametric specifications with an [R.sup.2] statistic of 0.77 instead of 0.50. Put another way, the semiparametric method adopted here improves the fit of the parametric regressions carried out earlier by almost 60 percent.

Figure 5 illustrates the land price surface obtained using kernel estimation. Evidently, this surface differs considerably from those shown in Figures 3 and 4 along at least two important dimensions. First, this surface displays more variation in land prices across different areas of Richmond than could be obtained from parametric estimates. Observe, for instance, that the West End of Richmond is generally characterized by higher land prices than the area east of the city. The surface also displays multiple local peaks in prices associated with different parts of the city. Second, although the nonparametric estimation identifies one main peak, land prices where this peak is located are as high as $130 per square foot, in contrast to $10 per square foot using the polynomial specification in (2). A typical 0.1 acre lot in the most expensive neighborhoods of Richmond, therefore, is estimated at around $566,280 as opposed to $43,560 obtained earlier with the polynomial parameterization.

[FIGURE 5 OMITTED]

The main reason underlying these differences arises because nonparametric estimation relies on local averaging of the data--sharp peaks and valleys are much more easily discovered with a nonparametric estimation. More specifically, the bandwidth that minimizes the cross-validation criterion in equation (13) is around 5,000 feet in our case. In other words, in estimating land prices at any given location, our procedure uses data within 5,000 feet of that location, with weights that decay quadratically in (12) as one moves away from the point of estimation.

Figure 6 shows the contour map corresponding to the land price surface shown in Figure 5. A main peak is clearly visible just to the north and west of the CBD and corresponds to an area of expensive row houses known as the Fan District in Richmond. Prices in that neighborhood are as high as $80 to $ 130 per square foot.

[FIGURE 6 OMITTED]

In contrast, land prices near the boundaries of Richmond range from only $ 1 to $2 per square foot and capture the opportunity cost of land related to agricultural activity at those locations. Local peaks in prices in Figure 6 are also visible five to 15 miles north and west of the city, around areas known as Short Pump and the West End more generally. These areas lie around Interstate 64 and consist of a number of newer suburban neighborhoods generally made up of single-family homes. The contour plot also shows evidence of local peaks extending north from the CBD around the Interstate 95 corridor and mid-sized towns such as Ashland and Mechanicsville. Finally, a series of local maxima can be found 12 miles west and 6 miles south of the CBD, in a region featuring a number of small lakes and golf courses.

Interestingly, despite the variations in land values shown in Figures 5 and 6, our findings suggest that Richmond remains largely a monocentric city. The more recent expansion in activity in the areas of Short Pump, west of the city, is associated with higher land prices on average compared to other areas located a similar distance away from Richmond's CBD. On the whole, however, land values are highest in the older sections of Richmond near the center of the city.

To compare our estimated land prices with alternative estimates, we obtain land price assessments from Chesterfield County, the county that lies in the southern portion of our region. (5) For tax purposes, Chesterfield County computes assessments both for underlying land value and for improvements. The land value assessment is updated every year and is based on "comparable" vacant land sales from other regions in the county. Though this method allows for much less local variation than our semiparametric approach, it provides a rough estimate of land costs per square foot in the county. Over the period studied, land assessments average just under $3 per square foot, with a number of plots assessed at under $1 per square foot, and the most expensive county land assessed at $20 to $30 per square foot. Table 3 displays characteristics of each distribution. The most expensive plots were located near the golf courses and lakes southwest of Richmond's CBD that were identified as a local maximum in the semiparametric estimation. The assessed value of land tends to be $3 to $6 per square foot less than our semiparametric estimation. This difference reflects, in part, housing characteristics that we are unable to control for in the first step of the nonparametric estimation. For data availability reasons, we cannot remove every piece of a house--for instance, there is no dummy for the number of fireplaces, whether the lot is fenced, the particular layout of the house, the quality of interior materials, etc. The fact that a house exists on every land plot in our sample means that our definition of land is more precisely that of a plot with a zero square foot, zero bathroom house with some unidentified exterior and heating type. Practically, this means that houses in our sample have already installed utility hookups and already have the appropriate zoning for a residential house.

Table 3 Estimates versus Assessments

(Per Square Foot) Assessments Estimate

Mean 2.46 9.50
Median 2.02 9.15
Maximum 30.75 33.89
Standard Deviation 1.98 5.18

Unless the house is of very poor quality, a plot with those characteristics will be more valuable than empty land, and, therefore, our nonparametric land price estimates will be somewhat biased upwards. However, whatever bias exists will be evident across all locations, so that differences in the price of land at one location relative to other locations should be similar in both the land assessments and our estimated land price; this indeed appears to be the case for Chesterfield County.

Finally, recall that the parametric specification, including distance from the CBD, delivers a small range of land values and a very shallow price gradient. Figure 7 shows an estimate of the land gradient obtained by projecting our estimates of (log) per-square-foot land prices at a given location onto distance from the CBD at that location. Put another way, we estimate the following equation,

[FIGURE 7 OMITTED]

[^.f] (l) - [[alpha].sub.0] + [[alpha].sub.d] d(l) + e, (14)

where [^.f](l)] denotes our nonparametric estimate of (log) land value at location l, d(l) is the distance to the center of the CBD from l, and [[alpha].sub.d] represents an exponential rate of change in prices. Our findings suggest a substantially faster rate of decay in land prices (the solid line) than estimated previously for Richmond using the parametric specification in equation (3) (the dashed line). The parametric specification estimated earlier gave a decline of 1.03 percent per mile. In contrast, we now find that [[alpha].sub.d] is approximately--.0278, as opposed to --.0103 in Table 2. This implies that land prices decline at a rate of 2.78 percent per mile on average as one moves away from Richmond's CBD. Alternatively, land prices fall by 1/2 approximately every 25 miles as distance from the CBD increases.

4. CONCLUDING REMARKS

The complexity of the urban price surface means that the assumptions that prices decline monotonically from a CBD or reflect a simple polynomial function of location data are not innocuous. Transit corridors, bodies of water, parkland, golf courses, employment subcenters, and other topographical features can have significant effects on land prices around a city. While these features could in theory be controlled for, it is not straightforward to identify what features or employment centers might be worth identifying.

Drawing on recent work by Yatchew (1997) and Yatchew and No (2001), the semiparametric procedure outlined in this article allows for an approach that does not require a priori assumptions regarding what features of the landscape might affect land prices. It also allows a very large data set--that of all housing transactions in a region--to be used when estimating the land price gradient. Since this procedure does not, unlike earlier work on land prices, rely on local knowledge, it can be applied wholesale to any region or city.

Empirically, an application of this semiparametric approach to land price estimation in Richmond, Virginia, identifies local maxima in the land price surface principally along the Interstate 64 and 95 corridors, in the suburbs of Ashland and Short Pump, and around the lakes and golf courses south of Midlothian. The most expensive land in the region, by a large margin, lies in the historic district of the Fan located close to the CBD; prices in the Fan per acre are over 100 times more expensive than rural land in the surrounding counties.

REFERENCES

Anas, Alex, Richard Arnott, and Kenneth A. Small. 1998. "Urban Spatial Structure." Journal of Economic Literature 36 (September): 1,426-64.

Applegate, David L., Robert E. Bixby, Vasek Chvatal, and William J. Cook. 2006. The Traveling Salesman Problem: A Computational Study. Princeton, N.J.: Princeton University Press.

Brownstone, David F., and Arthur De Vany. 1991. "Zoning, Returns to Scale, and the Value of Undeveloped Land." Review of Economics & Statistics 73 (November): 699-704.

Colwell, Peter F., and Henry J. Munneke. 1997. "The Structure of Urban Land Prices." Journal of Urban Economics 41 (May): 321-36.

Colwell, Peter F., and Henry J. Munneke. 2003. "Estimating a Price Surface for Vacant Land in an Urban Area." Land Economics 79 (February): 15-28.

DiNardo, John, and Justin Tobias. 2001. "Nonparametric Density and Regression Estimation." Journal of Economic Perspectives 15 (Fall): 11-28.

Fujita, Masahisa. 1989. Urban Economic Theory. Cambridge, U.K.: Cambridge University Press.

Galster, George, Peter Tatian, and John Accordino. 2006. "Targeting Investments for Neighborhood Revitalization." Journal of the American Planning Association 72 (Autumn): 457-74.

Giuliano, Genevieve, and Kenneth A. Small. 1991. "Subcenters in the Los Angeles Region." Regional Science and Urban Economics 21 (July): 163-82.

McMillen, Daniel P. 2001. "Nonparametric Employment Subcenter Identification." Journal of Urban Economics 50 (November): 448-73.

Mills, Edwin S. 1972. Studies in the Structure of the Urban Economy. Baltimore: Johns Hopkins University Press.

Redfearn, Christian. 2007. "The Topography of Metropolitan Employment: Identifying Centers of Employment in a Polycentric Urban Area." Journal of Urban Economics 61 (May): 519-41.

Robinson, P.M. 1988. "Root-N-Consistent Semiparametric Estimation." Econometrica 56: 931-54.

Rosenkrantz, Daniel J., Richard E. Stearns, and Philip M. Lewis II. 1977. "An Analysis of Several Heuristics for the Traveling Salesman Problem." SIAM Journal of Computing 6: 563-81.

Rossi-Hansberg, Esteban, Pierre-Daniel G. Sarte, and Raymond E. Owens, III. 2008. "Housing Externalities: Evidence from Spatially Concentrated Urban Revitalization Programs." Federal Reserve Bank of Richmond Working Paper 08-03.

Seyfried, Warren R. 1963. "The Centrality of Urban Land Values." Land Economics 39 (August): 275-84.

Yatchew, Adonis. 1997. "An Elementary Estimator of the Partial Linear Model." Economics Letters 57 (December): 135-43.

Yatchew, Adonis, and Joungyeo Angelo No. 2001. "Household Gasoline Demand in Canada." Econometrica 69 (November): 1,697-709.

Zheng, Siqi, and Matthew E. Khan. 2008. "Land and Residential Property Markets in a Booming Economy: New Evidence from Beijing." Journal of Urban Economics 63 (March): 743-57.

(1) Optimal differencing weights. [[omega].sub.0],..., [[omega].sub.m], solve min [delta] = [[SIGMA].sub.k = 1.sup.m][([[SIGMA].sub.s] [[omega].sub.s][[omega].sub.s]+k).sup.2] subject to the constraints in (7). See Proposition 1 in Yatchew (1997).

(2) The Hamiltonian cycle is the solution to the famous Traveling Salesman Problem (TSP). As of 2007, the largest TSP ever solved on a supercomputer involved 85,900 points, which is smaller than our problem (Applegate et al. 2006).

(3) See Rosenkrantz, Stearns, and Lewis (1977). Although the starting point is arbitrary, it has little implications for our findings.

(4) See DiNardo and Tobias (2001).

(5) In many counties, full records of assessment data are not free to obtain; the Chesterfield Assessment Office, however, was able to send us land assessments linked to our housing sales data.

Kevin A. Bryan and Pierre-Daniel G. Sarte

We wish to thank Nadezhda Malysheva, Yash Mehra, and Ray Owens for their comments. We also thank Sarah Watt for helpful research assistance. The views expressed in this article do not necessarily represent those of the Federal Reserve Bank of Richmond or the Federal Reserve System. All errors are our own. E-mail: pierre.sarte@rich.frb.org.