Exposure Measurement Error in Time-Series Studies of Air Pollution: Concepts and Consequences
Scott L. ZegerMisclassification of exposure is a well-recognized inherent limitation of epidemiologic studies of disease and the environment. For many agents of interest, exposures take place over time and in multiple locations; accurately estimating the relevant exposures for an individual participant in epidemiologic studies is often daunting, particularly within the limits set by feasibility, participant burden, and cost. Researchers have taken steps to deal with the consequences of measurement error by limiting the degree of error through a study's design, estimating the degree of error using a nested validation study, and by adjusting for measurement error in statistical analyses. In this paper, we address measurement error in observational studies of air pollution and health. Because measurement error may have substantial implications for interpreting epidemiologic studies on air pollution, particularly the time-series analyses, we developed a systematic conceptual formulation of the problem of measurement error in epidemiologic studies of air pollution and then considered the consequences within this formulation. When possible, we used available relevant data to make simple estimates of measurement error effects. This paper provides an overview of measurement errors in linear regression, distinguishing two extremes of a continuum-Berkson from classical type errors, and the univariate from the multivariate predictor case. We then propose one conceptual framework for the evaluation of measurement errors in the log-linear regression used for time-series studies of particulate air pollution and mortality and identify three main components of error. We present new simple analyses of data on exposures of particulate matter [is less than] 10 [micro]m in aerodynamic diameter from the Particle Total Exposure Assessment Methodology Study. Finally, we summarize open questions regarding measurement error and suggest the kind of additional data necessary to address them. Key words: air pollution, design methods, exposure, measurement error, time-series. Environ Health Perspect 108:419-426(2000). [Online 24 March 2000] http://ehpnet1.niehs.nih.gov/docs/2000/108p419-426zeger/abstract.html
Misclassification of exposure has long been recognized as an inherent limitation of epidemiologic studies of the environment and disease (1). For many agents of interest, exposures take place over time and in multiple locations so that it is difficult to accurately estimate the relevant exposures for individual study participants, particularly within the limits set by feasibility, participant burden, and cost. In general, exposure measurement error tends to blunt the sensitivity of epidemiologic studies for detecting the effects of environmental agents, although the specific impact of exposure error on effect estimates depends on several factors including the study design, the types of error, and the relationships between the outcome and the independent variables (1,2). As the problem of exposure error has become well recognized, researchers have taken steps to control its consequences by limiting the degree of error through careful study design and data collection, by estimating the degree of error using a nested validation study, and by making adjustments for measurement error in statistical analyses.
In this paper, we address the problem of exposure error in observational ecologic time-series studies of air pollution and health. The pollution of outdoor air is a public health concern throughout the world. For decades, epidemiologic studies have been a cornerstone of our approach to investigating the health effects of air pollution and have been a principal basis for setting regulations to protect the public against adverse health effects. Two broad types of observational study designs have been used in research on air pollution: ecologic or aggregate-level studies, either cross-sectional or time-series in design, and individual-level studies, primarily of the cross-sectional or cohort designs. In ecologic studies, population-level indicators of exposure are typically drawn from centrally sited air pollution monitors. In individual-level cross-sectional and cohort studies, exposure estimates for individual participants may be based on centrally located monitors, on the combination of central monitors with personal records of environments where participants spend time, or on personal exposure monitoring (3).
Regardless of study design, any pollution exposure assessment strategy introduces some degree of exposure measurement error. For example, in the Six Cities Study (4,5), a prospective cohort study of air pollution and respiratory health and mortality, exposure estimates for persons from each of the six cities were based on centrally sited monitors. Exposures were further characterized for samples of participants using personal monitors and monitors placed in their homes; the resulting data provide an understanding of the components of error associated with using the central site data for all participants.
The problem of measurement errors in predictor variables in regression analysis has been carefully studied in the statistics and epidemiologic literature for several decades. Fuller (6) summarized early research on linear regression with so-called "errors-in-x" variables. Carroll et al. (7) extended this literature to generalized linear models including Poisson, logistic, and survival regression analyses. Thomas et al. (2) presented an overview of the exposure error or misclassification problem from the general epidemiologic perspective. Spiegelman et al. (8), Willett (9), and Pierce et al. (10) provided recent illustrations of statistical approaches to measurement error in epidemiologic research.
In one of the early papers on the topic of exposure error in studies of air pollution, Shy et al. (11) described the problem and addressed its consequences in an epidemiologic framework. Goldstein and Landovitz (12,13) recognized that a single monitoring station may not adequately represent a geographic area and conducted an analysis of correlations among concentration data from several monitors in New York City. In the ensuing decades, there has been deepening understanding of measurement error in general and of its potential implications for the study of air pollution (14,15).
During the 1990s, substantial new evidence, largely from ecologic time-series analyses of air pollution and mortality, showed that daily variation in ambient measures of particulate air pollution within the current standards of the U.S. Environmental Protection Agency was associated with daily mortality levels (16). There are strong concerns about interpreting these associations in view of potential errors in the exposure measurements. In a series of papers, Lipfert and Wyzga (17) and Lipfert (18,19) suggested that the central monitoring data used in the time-series analyses have uncertain relationships with the exposures of individuals in the study communities; they further argued that those errors vary among pollutants, complicating interpretation of any multipollutant models. Lipfert and Wyzga (17) referred specifically to an analysis by Schwartz et al. (20) that attributed effects on mortality to fine rather than coarse particles, based in part on the results of multivariable models which included variables for both particulate measures.
A number of exposure assessment studies found sizable differences between actual personal exposures to particles and estimates based on central monitor values (21). Some investigators have questioned whether the observed associations are plausible given these findings. However, Schwartz et al. (20) responded that as the number of deaths per day is calculated over the population, the relevant exposure measure is the mean of personal exposures on that day, which is probably more tightly correlated with central station monitoring than individual exposures. Janssen et al. (22) reported that much of the variation in particulate matter [is less than or equal to] 10 [micro]m in aerodynamic diameter ([PM.sub.10]) measurements is between people and that the longitudinal correlation between average and ambient [PM.sub.10] measures is relatively high. The debate over measurement error and its consequences has taken place, however, without the development of a more comprehensive formulation of the problem.
Because exposure measurement error may have substantial implications for interpreting epidemiologic studies on air pollution, particularly the time-series analyses, we developed one systematic conceptual formulation of the problem of exposure error in epidemiologic time-series studies of air pollution and considered the possible consequences for relative risk estimation. We used available and relevant data to obtain rough estimates of the magnitudes of the effects of measurement error for one city.
Overview of Measurement Error Effects in Regression Models
The fundamental concepts of how exposure error can affect an epidemiologic study of pollution and health can be shown by considering the effects of exposure measurement error in a standard linear Gaussian regression model. The effects in Gaussian models have been discussed in full detail elsewhere (2,6,7,23,24). For simplicity, consider a regression of the health response (e.g., log mortality rate on day t) and predictors (e.g., [PM.sub.10], [O.sub.3], and weather):
[1] [y.sub.t] = [Alpha] + [[Beta].sub.x][x.sub.t] + [[Epsilon].sub.t]
where [Alpha] and [[Beta].sub.x] are regression coefficients to be estimated, and [[Epsilon].sub.t] represents residual error that is assumed to be independent of [x.sub.t]. Here [[Beta].sub.x] is the expected change in mortality per unit change in true exposure. Given observations ([x.sub.t], [y.sub.t]), t = 1, ..., T and appropriate assumptions about the distribution of the residuals, ordinary least-squares estimation provides optimal (unbiased and minimum varianced) estimates of the regression coefficients.
Now we assume that instead of the true exposure levels [x.sub.t] we have only an imperfect measure of exposure, denoted [z.sub.t]. The overall difference between [x.sub.t] and [z.sub.t] comprises multiple components of error, including differences between individual- and population-average exposures; between population-average exposures and ambient levels at central sites; and between actual ambient levels and the measurements of those levels. Suppose we regress the health outcome [y.sub.t] on the imperfect [z.sub.t] rather than [x.sub.t], which is unavailable:
[2] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
How will [[Beta].sub.z] differ from [[Beta].sub.x]?
To answer this question, we first assume that [z.sub.t] is a surrogate for [x.sub.t], which means that, given [x.sub.t], there is no additional information in [z.sub.t] about [y.sub.t] We then can distinguish two fundamentally distinct types of relationships between the true and measured exposures, which represent poles of a measurement error continuum. The first type is referred to as the classical error model (7), in which we assume that z is an imperfect measure of x, so that the average z within each x stratum equals x [E(z|x) = x]. Then it follows that the measurement error z - x is uncorrelated with the true value x. This classical model is a reasonable one for the difference between measured ambient levels of pollution and the true values for a measuring device that is unbiased. That is, when the true level of pollution is x, an unbiased instrument will measure x on average, even if individual measurements z differ from x.
The second type of model for measurement error is the Berkson error model (2). In this model, we assume that the average value of the true exposure x within each stratum of measured level z equals z [E(x|z) = z]. This Berkson model is appropriate when z represents a measurable environmental factor that is shared by a group of participants whose individual exposures x might vary because of time-activity patterns. For example, z might be the spatially averaged ambient level of a pollutant without major indoor sources and x might be the personal exposures that, when averaged across people, match the ambient level.
Classical and Berkson models for exposure measurement errors represent two extremes of a continuum. Most exposure errors combine elements of each, but because the consequences on risk assessment of classical and Berkson errors differ, it is useful to consider each in turn. In the case of the Berkson error, if we regress [y.sub.t] on [z.sub.t], rather than on [x.sub.t], the estimate [[Beta].sub.z] is an unbiased estimate of the coefficient [[Beta].sub.x] that would be obtained by regressing [y.sub.t] on the actual exposure [x.sub.t]. Having [z.sub.t] rather than [x.sub.t] does not lead to bias in the regression coefficients under the surrogacy assumption. The exposure measurement error increases the variance of the regression coefficient, however, because having [z.sub.t] rather than [x.sub.t] is obviously not as informative about the coefficient [[Beta].sub.x] Bias is not introduced, however. The same is true if the average x at each value of z differs from z by a fixed amount a, i.e., E(x|z) = z - a.
In contrast, under the classical error model [[Beta].sub.z] obtained by regressing [y.sub.t] on the imperfect measure exposure [z.sub.t] is a biased estimate of [[Beta].sub.x] In the simple linear regression with one explanatory variable, [[Beta].sub.z] is expected to be smaller than [[Beta].sub.x], or attenuated. The degree of attenuation increases as the variance of the exposure error increases. Again, a constant difference in the expected values of the two measures does not change this result.
It is useful to establish these results on the effects of exposure error on simple linear regression coefficients and helpful to do so in advance of considering a multiple regression case. The model of interest is Equation 1, but because [x.sub.t] is unobserved we instead might regress [y.sub.t] on [z.sub.t] (Equation 2).
The question is how will [[Beta].sub.z] from Equation 2 estimate [[Beta].sub.x] in Equation 1. Under the Berkson model, [x.sub.t] is assumed to vary about [z.sub.t], so that by Equation 1,
[3] E([y.sub.t]|[z.sub.t]) = [Alpha] + [[Beta].sub.x] E([x.sub.t]|[z.sub.t]) = [Alpha] + [[Beta].sub.x][z.sub.t].
Comparing Equation 3 and Equation 2 shows that [[Beta].sub.z] = [[Beta].sub.x] in the Berkson error case; that is, [[Beta].sub.z] is an unbiased estimate of [[Beta].sub.x]. Adding a constant to one exposure variable only affects the intercept.
Under the classical model, [z.sub.t] is assumed to vary about [x.sub.t] or E([z.sub.t]|[x.sub.t]) = [x.sub.t], which does not imply E([x.sub.t]|[z.sub.t]) = [z.sub.t]. If we further assume that [x.sub.t] and [z.sub.t] - [x.sub.t] are jointly normally distributed, it can be shown that
E([y.sub.t]|[z.sub.t] = [Alpha]** + c[[Beta].sub.x][z.sub.t],
where c is an attenuation factor between 0 and 1 given by c = var([x.sub.t]/|[var([x.sub.t]) + var([[Delta].sub.t])] where [[Delta].sub.t] = [z.sub.t] - [x.sub.t] is the exposure error. Again, a constant difference between the two exposure measures only changes the intercept.
Thus, the estimated regression coefficient is biased toward zero. In one pertinent case, [[Beta].sub.x] = 0, the naive estimate [[Beta].sub.z] is unbiased with E([[Beta].sub.z])= [[Beta].sub.x] = 0; that is, under the classical error model, measurement error does not lead to spurious associations if there is truly no association. Random variation, of course, can produce such associations by chance, as it can when there is no measurement error. How-ever, the probability of such false positive associations (the type 1 error rate) remains the same.
Realistic models for estimating the effects of air pollution on mortality have elements of both classical and Berkson error models. In general, the effect of such exposure errors is intermediate between the two extreme models. The effect of measurement error, therefore, likely depends on the direction and magnitude of the correlation of measurement errors with the measured exposures and not just on the variance of the measurement errors.
More complex multipollutant models are often applied in an attempt to estimate the independent effect of a pollutant present in a mixture with other pollutants. For example, in an analysis of air pollution and mortality in Philadelphia, Kelsall et al. (25) regress mortality on as many as five pollutants. Because little empirical evidence about the simultaneous errors in multiple pollutants is currently available, we only lay a foundation that can inform the design of future studies, as discussed in "Summary and Research Recommendations." Confining our attention to the classical and the Berkson error cases, we again assume a linear regression model of the form given by Equation 1, where [x.sub.t] now represents a vector of exposure variables, with a corresponding vector of regression coefficients [[Beta].sub.x], and [z.sub.t] denotes a vector of measurements of each exposure variable. In the Berkson error case, the assumption that [x.sub.t] is an imprecise version of [z.sub.t] or E([x.sub.t]|[z.sub.t]) = [z.sub.t] still assures that the estimates of the regression coefficients are unbiased, as in the univariate instance. Under the classical error model, however, the multiple regression extension is not so straightforward. We again assume that [z.sub.t] is an imprecise measure of [x.sub.t] i.e., E([z.sub.t]|[x.sub.t]) = [x.sub.t]. To compute E([x.sub.t]|[z.sub.t]), the average [x.sub.t] at each [z.sub.t], let V denote the covariance matrix of [x.sub.t] and let T denote the covariance matrix of the difference [[Delta].sub.t] = [z.sub.t] - [x.sub.t], and, as before, we assume that [Delta] and x are independent. The matrix generalization of the earlier result is that [[Beta].sub.z] = [[Beta].sub.x]C, where C = T [(T + V).sup.-1]. Now it is no longer true that [[Beta].sub.zj] [is less than] [[Beta].sub.xj] for each component (j) and estimates or regression coefficients can be biased toward or away from the null; that is, positive associations can be produced when the component is correlated with at least one component having a nonzero effect, even though the true coefficient for a particular component is zero.
Table 1 illustrates the magnitude of bias that can result from regressing [y.sub.t] on two predictors [z.sub.1t] and [z.sub.2t] instead of on [x.sub.1t] and [x.sub.2t]. This example might refer to estimating the effects of [PM.sub.10] and [O.sub.3] on mortality when ambient values (z values) instead of personal exposure (x values) are available. We assume [z.sub.1t] = [x.sub.1t] + [[Delta].sub.1t] [z.sub.2t] = [z.sub.2t]+ [[Delta].sub.2t] [V.sub.11] = var([x.sub.1t]) = [V.sub.22] = var([x.sub.2t]) = 1. Table 1 presents the expected values for the estimated regression coefficients when the true values are both one ([[Beta].sub.x1] = [[Beta].sub.x2] = 1) for varying values of the correlation between [x.sub.1t] and [x.sub.2t], the variances of [[Delta].sub.1t] and [[Delta].sub.2t], and the correlation between the measurement errors [[Delta].sub.1t] and [[Delta].sub.2t]. At present, there is little empirical evidence about the nature or size of the correlations between pairs of pollutant measurements and Table 1 is intended to illustrate the consequences of measurement error in the two-predictor model.
Table 1. Predicted bias in bivariate regression coefficients under different correlations (corr) between the true exposures and measurement errors with indicated variances (var) when both variables have a true effect: [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
Corr ([x.sub.1], Var Var [x.sub.2]) ([[Delta].sub.1]) ([[Delta].sub.2]) 0.0 1.0 1.0 0.5 1.0 1.0 -0.5 1.0 1.0 0.0 1.0 1.0 0.0 1.0 1.0 0.0 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 E ([MATHEMATICAL Corr Corr EXPRESSION NOT ([x.sub.1], ([[Delta].sub.1], REPRODUCIBLE [x.sub.2]) [[Delta].sub.2]) IN ASCII]) 0.0 0.0 0.50 0.5 0.0 0.60 -0.5 0.0 0.33 0.0 0.5 0.40 0.0 -0.5 0.67 0.0 0.0 0.67 0.5 0.0 0.71 0.5 0.3 0.66 0.5 0.5 0.64 0.5 0.7 0.64 0.5 -0.5 0.83 0.5 -0.7 0.91 0.5 -0.9 1.00 E ([MATHEMATICAL Corr EXPRESSION NOT ([x.sub.1], REPRODUCIBLE [x.sub.2]) IN ASCII]) 0.0 0.50 0.5 0.60 -0.5 0.33 0.0 0.40 0.0 0.67 0.0 0.33 0.5 0.53 0.5 0.27 0.5 0.21 0.5 0.14 0.5 0.50 0.5 0.57 0.5 0.66
We assume var([x.sub.1]) = var([x.sub.2]) = 1.
The first line of Table 1 refers to an example in which there is no correlation between [x.sub.1t] and [x.sub.2t] and there is equal variability of the two exposure errors [[Delta].sub.1t] and [[Delta].sub.2t], and these errors are not correlated; that is, the error in one predictor does not predict the error in the other. Here, there is an equal degree of attenuation in the coefficients for the two variables. With unequal variances but no correlation, i.e., the sixth row, the degree of attenuation is greater for the variable with greater variance. If the exposures are correlated but the errors are uncorrelated (the second and third rows), the two effect estimates are similarly altered with the direction of the effect depending on the sign of the correlation. Introducing correlation between the errors, i.e., the fourth and fifth rows, has an effect that depends on the pattern of correlation. The bottom half of Table 1 shows more complex patterns with differing patterns of correlation and variation of the two errors. Some of the scenarios introduce substantially different effects of the two variables, but none yield effect estimates above the true value of one, even with more extreme differences in error variances or the two correlations.
Table 2 also addresses the consequences of measurement error in a two-variable model, but in this example only one variable ([x.sub.2]) has a true effect; the other exposure ([x.sub.1]) has no effect on the health outcome (y). Either correlation between [x.sub.1t] and [x.sub.2t] or their errors can introduce an apparent effect of [x.sub.1] on y. Some scenarios of variance and correlation even bring the apparent effects of the two variables quite close (e.g., the tenth and eleventh rows), but in every case, including more extreme situations than shown, the estimate for the true predictor ([[Beta].sub.2]) is always larger than for the null predictor ([[Beta].sub.1]).
Table 2. Predicted bias in bivariate regression coefficients under different correlations (corr) between the true exposures and measurement errors with indicated variance (var) when only one variable has a true effect: [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
Corr ([x.sub.1], Var Var [x.sub.2]) ([[Delta].sub.1]) ([[Delta].sub.2]) 0.0 0.5 2.0 0.0 0.5 2.0 0.0 0.5 2.0 0.5 0.5 2.0 -0.05 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 0.5 0.5 2.0 E ([MATHEMATICAL Corr Corr EXPRESSION NOT ([x.sub.1], ([[Delta].sub.1], REPRODUCIBLE [x.sub.2]) [[Delta].sub.2]) IN ASCII]) 0.0 0.0 0.00 0.0 0.5 -0.12 0.0 -0.5 0.12 0.5 0.0 0.06 -0.05 0.0 -0.06 0.5 0.3 -0.01 0.5 0.5 -0.07 0.5 0.7 -0.15 0.5 -0.5 0.17 0.5 -0.7 0.21 0.5 -0.9 0.26 E ([MATHEMATICAL Corr EXPRESSION NOT ([x.sub.1], REPRODUCIBLE [x.sub.2]) IN ASCII]) 0.0 0.33 0.0 0.35 0.0 0.35 0.5 0.29 -0.05 0.29 0.5 0.28 0.5 0.29 0.5 0.29 0.5 0.33 0.5 0.36 0.5 0.39
We assume var([x.sub.1]) = var([x.sub.2]) = 1.
Some general conclusions can be offered concerning multipollutant models under this simple classical error model.
Conclusion 1. There is a general tendency for the coefficient from the regression on [z.sub.t] to be smaller than the corresponding coefficient from the regression on [x.sub.t], i.e., [[Beta].sub.zj] [is greater than] [[Beta].sub.xj] if all [[Beta].sub.xj] [is greater than] 0.
Conclusion 2. The degree of attenuation of each coefficient depends, in large part, on its measurement error variance relative to the variance of the true exposure--i.e. [T.sub.jj]/[V.sub.jj]. Thus, the coefficients for variables that are measured with considerable error will be attenuated more than those of variables with less error.
Conclusion 3. Depending on the correlation structure of the attenuation matrix C, some of the effect of one variable, [[Beta].sub.xj] may be transferred to the estimate of another variable's effect, [[Beta].sub.zk]. Such transfers of effect are generally from a more poorly measured variable to a better measured variable. However, for such transfers to be large, the true exposure variables or their measurement errors need to be substantially correlated.
Conclusion 4. As a consequence of conclusion 3, the estimate of a parameter can be biased away from the true value. However, this type of bias generally arises only with a very strong negative correlation between the measurement errors (e.g., rows 9-11 of Table 2).
Conclusion 5. Also as a consequence of conclusion 3, there will generally be spurious associations for a variable [x.sub.j] that, in fact has no effect only if [x.sub.j] is substantially correlated with one or more variables which actually have an effect. Generally, the correlation among the errors has a larger influence on the bias than the correlation among the true pollutant levels.
These conclusions are obtained from and therefore pertain to the classical linear regression model with two predictors, assuming that [z.sub.t] is a surrogate for [x.sub.t] (nondifferential errors). The actual exposure measurement situation in the air pollution-mortality context is obviously more complex. First, log-linear, not linear, models are used, although the degree of nonlinearity is usually small in mortality studies. Second, the measurement errors are not purely of the classical non-differential type. For example, the degree of error for gaseous pollutants may depend on temperature or other covariates. Finally, errors may be multiplicative rather than additive. Nonetheless, the linear regression with classical measurement error is a leading case that provides insight into the major possible consequences of exposure errors.
Framework for Assessing Measurement Error Effects in Pollution-Mortality Studies
We now build on the fundamental concepts underlying statistical models of exposure measurement error and focus on the specific log-linear regressions used for assessing the pollutant-mortality association, controlling for weather variables. Our discussion is based on the premise that the ideal investigation of the health effects of air pollution would be conducted at the individual level with measurements of personal exposure to pollutants. However, exposure and mortality data are generally only available after aggregation to a municipal level; little or no data from indoor air monitoring are available. Finally, air pollutant measurements are imprecise and this imprecision has consequences for estimates of pollutant effects on mortality.
To investigate the effects of exposure error in the log-linear regressions widely used to assess the pollutant-mortality association, consider the following model for an individual's risk of mortality:
[4] [[Lambda].sub.it] = [[Lambda].sub.0it]exp([x.sub.it][[Beta].sub.x])
where [[Lambda].sub.it] is the risk of death for person i on day t; [[Lambda].sub.0it] is that individual's baseline risk in the absence of exposure, i.e., [x.sub.it] = 0, and exp([x.sub.it] [[Beta].sub.x]) is the relative risk of death associated with the explanatory variables [x.sub.it]. Let [y.sub.it] = 1 if person i dies on day t and 0 if that person does not. We typically observe the total number of deaths for a population
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],
where [n.sub.t] [approximately equals] n is the population size on day t. By Equation 4, the expected total number of deaths 5Lt in a community is
[5] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
In analyzing population-level data on mortality and air pollution, log-linear regressions of the following form have been fit
[6] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
where s(t) is an arbitrary but smooth function of time introduced to control for the confounding of longer-term trends and seasonality, [z.sub.t] is the average of multiple monitor measurements of ambient pollution measurement for day t, and [u.sub.t] are other possible confounders such as temperature and dew point temperature on the same and previous days.
If the regression coefficient [[Beta].sub.x] for a pollutant in the personal risk model Equation 4 is the target for inference, how closely do estimates of [[Beta].sub.z] from model Equation 6 approximate [[Beta].sub.x]?
Figure 1 poses a model of the relationship between the personal exposure to a pollutant [x.sub.it] for person i on day t and the available ambient values [z.sub.t] measured with error by monitors. Assuming, for simplicity, a high degree of spatial homogeneity in ambient levels, personal exposure is contributed to by [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], the true outdoor level, and [w.sub.it], the indoor level, which is also influenced by [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] from penetration of the pollutant in outdoor air into indoor spaces. For example, personal exposure to [PM.sub.10] is determined by the time spent outdoors, the concentration during that time, and by the concentrations in indoor environments that are determined by indoor sources such as cigarette smoking and the penetration of particles indoors because air is exchanged between the outdoors and the indoor environments. Figure 1 further shows that the personal risk of dying is influenced by an individual's baseline risk in addition to the unobserved personal exposure to pollutant [x.sub.it]. Only the measured ambient pollution data are observed and are therefore shown in a rectangular box.
[Figure 1 ILLUSTRATION OMITTED]
In considering the consequences for [[Beta].sub.z] as an estimate of [[Beta].sub.x] with an imprecise measure of ambient pollution [z.sub.t], rather than actual personal exposure [x.sub.it], it is useful to begin by decomposing the pollution measurement difference between [x.sub.it] and [z.sub.t] into three components:
[7] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
Here, ([x.sub.it] - [[bar]x.sub.t]) is the error due to having aggregated rather than individual exposure data; ([MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]) is the difference between the average personal exposure and the true ambient pollutant level; and ([MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]) represents the difference between the true and the measured ambient concentration.
The first term ([x.sub.it] - [[bar]x.sub.t]) is an example of Berksonian error so that, in a simple linear model, having aggregate rather than individual exposure does not itself lead to bias into the regression coefficient. The second term ([MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]) is not Berksonian and is likely to be a source of bias. The final term ([MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]) is largely of the Berkson type if the average of the available monitors [z.sub.t] is an unbiased estimate of the true spatially averaged ambient level [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
We can now further study the effects of these three terms on risk estimation by substituting the decomposition in Equation 7 into Equation 5. After some straightforward calculations detailed in the "Appendix," the expected number of deaths on day t can be written
[8] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
Here [[Beta].sub.x] is the personal log-relative risk of interest from Equation 5. Note the approximation Equation 8 retains only linear terms in the expansion of an exponential function. The second-order terms are an order of magnitude smaller and are ignored to simplify the exposition. For studies of particulate pollution effects on mortality, the effect sizes are on the order of 1 or 2% so that ignoring second-order terms should not qualitatively affect the results. In studies of morbidity, higher order terms may be more important.
The total baseline risk ([n.sub.t][[bar][Lambda].sub.0t]) almost certainly varies smoothly over time because it is an average risk over a large population. Hence, it will be appropriately controlled for in log-linear regressions by inclusion of the smooth s(t) in Equation 6. We now consider [z.sub.t][[Beta].sub.x] and the three components of error in turn.
The first error term [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the difference between the baseline risk-weighted average personal exposure and the unweighted average personal exposure. It derives from the Berkson error ([x.sub.it] - [[bar]x.sub.t]) and produces no bias in the linear unaggregated model. This difference due to risk weighting in our log-linear model with person-specific baseline risks is likely to be small and to vary slowly over time. Hence, it can be adequately controlled by inclusion of the smooth function s(t) in the log-linear regression of [y.sub.t] on [z.sub.t] One scenario in which this difference would vary from day to day and therefore not be adequately controlled would occur if the more frail individuals were to follow pollution reports (or a correlate such as weather) and reduce their exposures to ambient air on high pollution days by, for example, staying indoors. Current warning systems for air pollution alerts are intended to reduce exposures of susceptible persons in this fashion.
The second error term [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is non-Berksonian and has the greatest potential to introduce bias in the estimate [[Beta].sub.z] when [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is correlated with [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. Even if the terms are uncorrelated so that [[Beta].sub.z] will be a roughly unbiased estimate of [[Beta].sub.x], it will reduce efficiency relative to a study in which [x.sub.t] is available because [z.sub.t] and [[bar]x.sub.t] - [z.sub.t] share the same coefficient in Equation 8.
The difference [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] between average personal exposures and the true ambient value can be analyzed further by considering an individual personal exposure [x.sub.it], Because individual i's exposure on day t derives either from indoor or ambient sources, we can write [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] where [I.sub.it] is the concentration of pollutant generated by indoor sources such as tobacco smoke and pets and [[Alpha].sub.it] is his or her fraction of exposure from ambient sources that take place either outdoors or result from the penetration of ambient pollution indoors. It follows that [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. That is, the average personal exposure is proportional to the ambient level offset by the effects of the population average of the non-ambient indoor sources.
Wilson and Sub (26) argued that the daily population average concentrations of fine particles derived from indoor sources [[bar]I.sub.t] are approximately independent of ambient levels [z.sub.t] across time. When this is true, failure to measure indoor sources will not introduce further bias in the estimation of [[Beta].sub.x] because the deviations due to indoor air exposure are a second example of Berkson error, and the errors will tend to cancel one another out when averaged over the population. Never-theless, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is only proportional to [[bar]x.sub.t] so that even if [[bar][Alpha].sub.t] varied little over time ([[Alpha].sub.t] [approximately equals] [Alpha]), the coefficient [[Beta].sub.z] from a regression of [y.sub.t] on [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] would estimate [Alpha][[Beta].sub.x], not [[Beta].sub.x]. Hence, if 20% of daily exposure results from indoor sources independent of the ambient levels, the regression on ambient levels will yield coefficients that are roughly 20% smaller than would have occurred with actual personal exposures. However, this may be the appropriate coefficient for policymakers seeking an estimate of the effect of an inarguable measure of ambient levels. This, however, assumes that particles from indoor sources and outdoor sources are identical; that is, they are similar in composition and toxicity. If this is not the case, then the two types of particles are more appropriately treated as separate pollutants, and the personal exposure measure desired would be [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], the personal exposure to particles from outdoor sources. Studies that use sulfates as a tracer for particles from outdoor sources indicate that indoor/outdoor ratios are [is less than] 1. Because people spend most of their time indoors, this suggests that [[Alpha].sub.it] will be [is less than] 1 and that the second term in Equation 8 will be negatively correlated with [z.sub.t], and will bias the estimated coefficient downward. This also illustrates that the model is not restricted to cases where E(x) = E(z).
The final of the three error terms in Equation 8, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], represents the instrument measurement error in the ambient levels; like [x.sub.it] - [[bar]x.sub.t], it is close to the Berkson type. This term would tend to be cancelled out by spatial averaging across multiple unbiased ambient monitors in a region. For example, Kelsall et al. (25) averaged daily total suspended particulate data from up to nine monitors in their analysis of the effects of particles on mortality in Philadelphia. However, in many cities there is only one monitor or a few monitors operating concurrently. Even with a small number of monitors, longer term drift in instruments will not substantially affect estimates of [[Beta].sub.x] because the time--series models control for such trends by inclusion of s(t) in Equation 6. For this final error term to cause substantial bias in [[Beta].sub.z], the error [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] must be strongly correlated with [z.sub.t] at shorter time scales. Further investigations of this correlation in cities with many monitors are warranted.
We have discussed three components of measurement error: a) an individual's deviation from the risk-weighted average personal exposure; b) the difference between the average personal exposure and the true ambient level; and c) the difference between the measured and the true ambient levels, which includes spatial variation and instrument error. We argue that the first and third components are of the Berkson type and therefore are likely to have smaller effects on the relative risk estimates. However, the second component can be a source of substantial bias if, for example, there are short-term associations of the contributions of indoor sources with ambient concentrations. We present one simple analysis of the Particle Total Assessment Methodology (PTEAM) data (27) that illustrates how we can further study the effects of the most important second component.
Evaluating Potential Measurement Error Bias in Pollutant-Mortality Relative Risk Estimates
The "Framework for Assessing Measurement Error Effects in Pollution-Mortality Studies" can be used, in combination with data on the components of error, to quantify the consequences of exposure measurement error. One of the few available data sets with ambient and personal measurements will be used to illustrate one approach. We used daily measurements of personal exposure for persons followed in the PTEAM Study (27) to quantify the difference between concentration measured by an ambient monitor and the average of personal exposures. We studied one approach for estimating the size of bias in estimated [PM.sub.10]-mortality regression coefficients [[Beta].sub.z] as an estimate of the true relative risk for personal exposure [[Beta].sub.x] with data from one or a few ambient monitors rather than personal exposure data for [PM.sub.10].
Data from the PTEAM Study. The PTEAM Study (27,28) generated a daily measurement of personal exposure to [PM.sub.10] for a sample of 178 nonsmoking residents of Riverside, California, 10 years of age or older for the period 22 September through 9 November 1990. In addition, a daily average [PM.sub.10] value from an ambient monitor positioned near the homes was also collected; Pellizzari and Spengler (29) provided details on the methods used to collect these data.
We used the PTEAM Study data to estimate the correlation between the daily [PM.sub.10] concentration for the ambient monitor [z.sub.t] and the difference between the average personal exposure and concentration measured by the ambient monitor [[bar]x.sub.t] - [z.sub.t]. These estimates correctly account for the varying number of observations on a given day; however, the average personal exposure value is based on relatively few measurements and is therefore more variable across time than the actual mean exposure. Equation 8 includes a weighted average of personal exposures, with weights determined by the baseline personal risk for each individual. Those weights were unavailable in the PTEAM Study and hence, we used an unweighted average. Figure 2 displays a time-series plot of the daily ambient values and the average personal exposures. The correlation across time of these two series is estimated to be 0.58 [95% confidence interval (CI), 0.35-0.74]. This correlation is much greater than the more widely cited cross-sectional correlation from the same study. It would likely be even greater if the mean personal exposure was calculated on a larger number of persons each day. The corresponding correlation across time between the ambient monitor concentrations and the daily differences between the personal and ambient values is -0.63 (CI, -0.77 to -0.42). Hence, the hypothesis that the measurement error [[bar]x.sub.t] - [z.sub.t] is uncorrelated with [z.sub.t] is not consistent with the PTEAM Study data. Some bias in the regression coefficient is therefore expected. Because the correlation of [[bar]x.sub.t] - [z.sub.t] and [z.sub.t] is negative, the coefficient [[Beta].sub.z] in the regression on [z.sub.t] will tend to underestimate the co-efficient in the regression on [[bar]x.sub.t] in a single-pollutant analysis. We now assess the size of the bias that will result from this measurement error.
[Figure 2 ILLUSTRATION OMITTED]
Addressing the Bias in [PM.sub.10]-Mortality Regression Coefficients
The PTEAM Study results or other, perhaps more appropriate, data sets on the difference between average risk-weighted personal exposure and ambient monitor concentrations, can be used to estimate bias in the results of log-linear regression models.
If available, we would have used the average personal exposure series, [[bar]x.sub.t], for at-risk residents of each city in the standard log-linear regression model rather than [z.sub.t] as was used in the original analyses. We would then have compared the regression coefficients obtained when [[bar]x.sub.t] is the predictor with those using [z.sub.t] to assess the bias.
Obviously, [[bar]x.sub.t] is not available except in special circumstances. However, from the PTEAM Study data (shown in Figure 2) or similar data, we can estimate the relationship of [[bar]x.sub.t] and [z.sub.t], for example, by assuming:
[9] [[bar]x.sub.t] = [[Theta].sub.0] + [[Theta].sub.1][z.sub.t] + [[Epsilon].sub.t]
where [[Theta].sub.0] and [[Theta].sub.1] are the intercept and slope to be estimated from the available data. We can then use the fitted Equation 9 to predict the unobserved [[bar]x.sub.t] from the available [z.sub.t] and then use the predicted value [[bar]x.sub.t] as the desired exposure values when estimating the pollution--mortality relative risk [[Beta].sub.x]. In fact the estimate of [[Beta].sub.x] has the simple form [[Beta].sub.x] = [[Beta].sub.z]/[[Theta].sub.1]. This well-known approach to adjust for exposure measurement error is called regression calibration (7). As an illustration, we applied this strategy to a regression of daily mortality on ambient concentrations of [PM.sub.10] for Riverside, California, for the period 1987-1994. We estimated [[Theta].sub.0] = 59.95 (SE = 7.21), [[Theta].sub.1] = 0.60 (SE = 0.080), and var([Epsilon]) = 22.4.
Calibration is easy to implement and apply. Its limitations are that confidence intervals for [[Beta].sub.x] depend on large sample theory and that it does not extend easily to situations where multiple sources of information about the [[bar]x.sub.t], [z.sub.t] relationship are available.
It is simple to overcome the possible limitations of calibration by using a simulated value [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] rather than the predicted value [[bar]x.sub.t] from Equation 9. That is, we use Equation 9 to simulate the average personal exposure, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], from the ambient exposure, [z.sub.t], for a city or period of interest when [[bar]x.sub.t] is not available, under the assumption that the estimated [[Theta].sub.s] and var([Epsilon]) are applicable. This simulated series [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is then used instead of [z.sub.t] in the log-linear regression. The result is one estimate of [[Beta].sub.x]--call it [[Beta].sub.x]. If we then repeatedly simulate [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] and fit the log-linear regression for each to obtain [[Beta].sub.x], we obtain a distribution of [[Beta].sub.x]s. The difference between the mean of the simulated [[Beta].sub.x]s and the [[Beta].sub.z] derived from the log-linear regression of mortality on [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is a measure of the bias resulting from having [z.sub.t] rather than the average personal exposure for that city. By simulating [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] rather than using a fixed predicted value [[bar]x.sub.t], we properly account for nonlinearities and sources of variation in [[Beta].sub.x] and can extend the analysis to more complicated situations.
Figure 3 shows the distribution of the [[Beta].sub.x]s for Riverside (solid curve). Also shown is the normal approximation of the likelihood function for the coefficient [[Beta].sub.z] from the log-linear regression of mortality directly on [z.sub.t] (dotted curve). Solid and dotted lines are at the centers of these distributions. We find that the [[Beta].sub.x]s have a mean 1.42% increase in mortality (CI, -0.11-2.95) per 10-unit change in [PM.sub.10]. In comparison, the estimate of [[Beta].sub.z] from the usual log-linear model (dashed vertical line) is [[Beta].sub.z] = 0.84% (CI, -0.06-1.76). Hence, measurement error has biased the result toward the null. Second, the distribution of the [[Beta].sub.x]s is more dispersed than the distribution of [[Beta].sub.z]. This is because we have taken into account the variability due to having [z.sub.t] not [[bar]x.sub.t], i.e., arising from var([[Epsilon].sub.t]) in Equation 9. The results are very similar to what we obtain from calibration.
[Figure 3 ILLUSTRATION OMITTED]
This calculation assumes that the estimated relationship between [x.sub.t] and [z.sub.t] for the PTEAM Study is the true one, and hence, we ignored a second component of uncertainty due to estimation of the relationship between [[bar]x.sub.t] and [[bar]z.sub.t] from the finite sample size of the PTEAM Study data taken at one site and at a particular time period. That is, even if we assume that the relationship between [[bar]x.sub.t] and [z.sub.t] is known, estimating the association of mortality with [[bar]x.sub.t] is less precise than with [z.sub.t] given only [z.sub.t] in that particular city. Of course, the relationship of [[bar]x.sub.t] and [z.sub.t] is not precisely known and needs to be quantified further. Dominici et al. (30) provided a more complete analysis of the bias in [[Beta].sub.z] as an estimate of [[Beta].sub.x] using the PTEAM Study and four other data sets and a more complete statistical model. Their findings were qualitatively similar to those presented here. Finally, our assessment of bias assumed that the health effects of personal exposure to particles originating outdoors and indoors are the same. To assume otherwise would require substantially more detailed data and modeling.
Summary and Research Recommendations
The differences between true personal exposure for every individual ([x.sub.it]) and measured ambient concentrations, averaged over a few fixed imprecise monitors ([z.sub.t]), are inherently complex, as is the effect of this exposure measurement error on estimates of pollution-mortality relative risks. Nonetheless, it is useful and imperative to analyze these effects in light of our current understanding of the measurement process. This paper presented one framework for doing so. We distinguished two extremes of a continuum of types of measurement errors: Berkson and classical errors. The former is likely to create little bias in mortality-relative risk estimates; the latter has more serious consequences.
We posited a relative risk model in which an individual's hazard of death on a given day is expressed as a function of his or her personal exposure, which is decomposed to highlight three types of exposure errors. We then aggregated the model to produce the model for the expected total deaths in a population used in most time-series analyses. This model showed that a risk-weighted average personal exposure measure is the desired exposure measure. The likely consequence of using ambient concentrations instead is to underestimate the pollution effects. In contrast, differences between individual exposures on a given day and the risk-weighted average of personal exposures are examples of Berkson error and are not likely to cause substantial bias in coefficients from time-series morbidity studies. Our analysis suggested that the largest biases in inferences about the mortality-personal exposure relative risk will occur because of the more complex errors between ambient and average personal exposure measures. If indoor sources produce particles of similar composition and toxicity as outdoor source particles, indoor sources may be a major component of this error. Finally, we used the best available data (from the PTEAM Study in Riverside) with both personal exposure and ambient time-series data to quantify the size of this error. Our analysis indicated that the coefficient obtained from regressing mortality on measured ambient levels ([z.sub.t]) is smaller than what we expect if we regress mortality on average personal exposure ([[bar]x.sub.t]).
For tractability and clarity, we conducted a first-order analysis of exposure errors and ignored possible second- and higher order effects in which daily fluctuations in the variance of personal exposures across a population or in the covariations among the measurement errors could introduce additional biases. Second-order terms will be insignificant in studies of particulate effects on mortality where the first-order terms are on the order of percent. Such higher order analyses for other studies of, for example, morbidity, are beyond the scope of this paper and will require substantially more detailed models and data. It is, however, possible that higher order effects are important; further investigation is necessary.
Epidemiologic research is necessarily limited by the quality of the health outcome and risk factor measurements (31). Time-series studies of the acute effects of air quality on mortality are subject to the limitations posed by the available measurements of pollution levels. The generic criticism--that measurement errors render the results of such time-series models uninterpretable--is inaccurate. The consequences of measurement error can be quantified, although only a few informative data sets are presently available. Differences between the average personal exposure and ambient measurements are the most likely source of substantial bias. Data should be collected for the comparison of risk-weighted average personal exposure with ambient levels in several cities with varying degrees of spatial heterogeneity in ambient levels, population composition, and indoor pollution sources. Given such data, models like those summarized by Dominici et al. (32) can be used to quantify more precisely the biases due to pollutant measurement errors.
This paper focuses on the effects on relative risk estimates of using [z.sub.t] (measured ambient particle levels) rather than [x.sub.it] (actual personal exposures in log-linear regressions). Such effects are important from a scientific perspective to quantify the health risks of exposure to particulate pollution. From a regulatory perspective, the effect of having the imprecise [z.sub.t] rather than the true ambient value [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] may be of greater interest because it is ambient levels that may or may not be regulated further. A more detailed error analysis of the [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] difference would investigate the spatial variation in particulate levels and how the number of monitors used to calculate [z.sub.t] reduced this source of measurement error.
In measurement errors in a single pollutant measure, [PM.sub.10], simultaneous errors in several pollutants can complicate the analysis. However, qualitative biases--that is, changes in the sign of a coefficient--can occur only when the measurement errors for different pollutants are highly correlated with one another. This level of correlation might arise if two or more pollutants are measured by the same instrument (e.g., different fractions of particulate matter) or if multiple instruments are housed in the same location, which is subject to atypical exposure patterns. The possibility nevertheless requires detailed investigation, because in this case the findings of epidemiologic studies could be misleading. Personal exposure studies that collect multiple exposures can provide the necessary data to investigate the effects of co-occurring errors using straightforward extensions of the approaches in "Framework for Assessing Measurement Error Effects in Pollution-Mortality Studies" and "Evaluating Potential Measurement Error Bias in Pollutant-Mortality Relative Risk Estimates."
We considered the effects of exposure measurement error on regression coefficients from log-linear models in which serial correlation is accounted for using flexible smoothing splines. An alternate analytic strategy is to fit a linear regression with time-series errors [ARIMA model (33)]. In certain specific time-series models, the degree of attentuation due to classical error might be reduced became to account for the autocorrelated errors, the ARIMA filters or smooths both the responses and the predictors that might reduce the degree of measurement error. Further research on this possibility is warranted.
The measurement error framework and the illustrative calculations discussed here make apparent several open questions and opportunities for additional data collection. These opportunities would enable more accurate quantification of the effects of measurement error in assessing the air pollution-mortality relationship. In relation to single-pollutant models, the two most important questions are a) Is the average personal exposure to pollutants from indoor sources correlated over time with ambient levels? and b) Does the difference between baseline risk-weighted average exposure and population average exposure vary slowly over time?
For models with multiple pollutants, the additional key question follows: How do the components of error identified in Equation 5 covary across pollutants? For example, how do the differences between actual ambient levels and the measured levels correlate across the different pollutants and how do these differences depend on the true values of other pollutants or covariates?
Wilson and Suh (26) conducted a meta-analysis of data from multiple sites and concluded, in answer to the first question above, that concentrations of fine particles originating from indoor sources are independent of ambient levels over time. To confirm this finding and to address the remaining key questions, additional research is warranted. A stratified sample of the population in several cities with diverse pollution sources and patterns should be drawn, with one stratum representing the entire population and the second representing the frail subgroup. Daily measurements of personal exposure and indicators of indoor sources should be collected for multiple pollutants for each person. Ambient levels should also be monitored. Decisions about the number of persons within each subgroup and the number of days of monitoring for each person should be made based on preliminary analyses of data from one city.
REFERENCES AND NOTES
(1.) Armstrong BK, Saracci R, White E. Principles of Exposure Measurement in Epidemiology. New York:Oxford University Press, 1992.
(2.) Thomas D, Stram D, Dwyer J. Exposure measurement error: influence on exposure-disease relationships and methods of correction. Annu Rev Public Health 14:69-93 (1983).
(3.) National Research Council, Committee on Advances in Assessing Human Exposure to Airborne Pollutants. Human Exposure Assessment for Airborne Pollutants: Advances and Opportunities. Washington, DC:National Academy Press, 1991.
(4.) Ferris BG Jr, Speizer FE, Spengler JD, Dockery OW, Bishop YM, Wolfson M, Humble C. Effects of sulfur oxides and respirable particles on human health: methodology and demography of populations in study. Am Rev Respir Dis 120:767-779 (1979).
(5.) Dockery DW, Pope C III, Xu X, Spengler JD, Ware JH, Fay ME, Ferris BG Jr, Speizer FE. An association between air pollution end mortality in six U.S. cities. N Engl J Med 329:1753-1759 (1993).
(6.) Fuller WA. Measurement Error Models. New York:John Wiley & Sons, 1987.
(7.) Carroll RJ, Ruppert D, Stefanski LA. Measurement Error in Nonlinear Models. London:Chapman and Hall, 1995.
(8.) Spiegelmen D, McDermott A, Rossner B. Regression calibration method for correcting measurement-error bias in nutritional epidemiology. Am J Clin Nutr 65:1179S-1186S (1997).
(9.) Willett W. Correction for the effects of measurement error. In: Nutritional Epidemiology (Willett W, ed). New York:Oxford University Press, 1998;301-320.
(10.) Pierce DA, Stram DO, Vaeth M. Allowing for random errors in radiation dose estimates for the atomic bomb survivor data. Radiat Res 123:275-284 (1990).
(11.) Shy CM, Kleinbaum DG, Morgenstern H. The effect of misclassification of exposure status in epidemiological studies of air pollution health effects. Bull N Y Aced Med 54:1155-1165(1978).
(12.) Goldstein IF, Landovitz L. Analysis of air pollution patterns in New York City. I: Can one station represent the large metropolitan area? Atmos Environ 11:47-52 (1977).
(13.) Goldstein IF, Landovitz L Analysis of air pollution patterns in New York City. II: Can one aerometric station represent the area surrounding it? Atmos Environ 11:53-57 (1977).
(14.) Navidi W, Thomas D, Stram D, Peters J. Design end analysis of multilevel analytic studies with applications to a study of air pollution. Environ Health Perspect 102(suppl 8):25-32 (1984).
(15.) National Research Council, Commission on Life Sciences, Board on Toxicology and Environmental Health Hazards, Committee on the Epidemiology of Air Pollutants. Epidemiology and Air Pollution. Washington, DC:National Academy Press, 1985.
(16.) Dockery DW, Pope CA III. Acute respiratory effects of particulate air pollution. Annu Rev Public Health 15:107-132 (1984).
(17.) Lipfert FW, Wyzga RE. Air pollution and mortality: the implications of uncertainties in regression modeling and exposure measurement J Air Waste Manag Assoc 47:517-523 (1987).
(18.) Lipfert F. Clean air skepticism. Science 278:19-20 (1997).
(19.) Lipfert FW. Air pollution and human health: perspectives for the '90s and beyond. Risk Anal 17:137-146 (1997).
(20.) Schwartz J, Dockery DW, Neas LM. Is daily mortality associated specifically with fine particles? J Air Waste Manag Assoc 46:927-939 (1996).
(21.) Wallace L. Indoor particles: a review. J Air Waste Manag Assoc 46:98-126 (1986).
(22.) Janssen NA, Hoek G, Brunekreef B, Harssema H, Mensink I, Zuidhof A. Personal sampling of particles in adults: relation among personal, indoor, end outdoor air concentrations. Am J Epidemiol 147:537-547 (1998).
(23.) Snedecor GW, Cochran WG. Statistical Methods. Ames, IA:Iowa State University Press, 1980.
(24.) Carroll RJ, Spiegelman CH, Lan KKG, Bailey KT, Abbott RD. On errors in variables for binary regression models. Biometrika 71:19-25 (1984).
(25.) Kelsell JE, Semet JM, Zeger SL, Xu J. Air pollution and mortality in Philadelphia, 1974-1988. Am J Epidemiol 146:750-762 (1997).
(26.) Wilson WE, Suh HH. Fine particles and coarse particles: concentration relationships relevant to epidemiologic studies. J Air Waste Manag Assoc 47:1238-1249 (1987).
(27.) Ozkaynak H, Xue J, Spengler J, Wallace L, Pellizzari E, Jenkins P. Personal exposure to airborne particles and metals: results from the Particle TEAM Study in Riverside, California. J Expos Anal Environ Epidemiol 6:57-78 (1996).
(28.) Mendelsohn R, Orcutt G. An empirical analysis of air pollution dose-response curves. J Environ Econ Manag 6:85-106 (1979).
(29.) Pellizzari E, Spengler J. Particle Total Exposure Assessment Methodology (PTEAM): Pilot Study, Volume II: Protocols for Environmental Sampling end Analysis. Work Plan for EPA Contract No. 68-02-4544, EPA Work Assignment 67, CARB Agreement No. A833-060. Research Triangle Park, NC:U.S. Environmental Protection Agency, 1990.
(30.) Dominici F, Semet J, Xu J, Zeger S. Combining evidence on air pollution and daily mortality from the largest 20 U.S. cities: e hierarchical modeling strategy. J R Stat Soc Ser C (1989).
(31.) Vedal S. Ambient particles and health: lines that divide. J Air Waste Manag Assoc 47:551-581 (1987).
(32.) Dominici F, Zeger S, Samet J. A Measurement Error Correction Model for Time-Series Studies of Air Pollution and Mortality. Technical Report. Baltimore, MD:Johns Hopkins University, 1999.
(33.) Fuller WA. Introduction to statistical time series. New York:Wiley & Sons, 1996.
Appendix
We start with a personal risk model (Equation 5) and a decomposition of the exposure error (Equation 7) to obtain
[A.1] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII].
Now
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]
where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the average baseline risk across the population on day t, [n.sub.t] is population size on day t, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the baseline risk-weighted average exposure. We can now rewrite
[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII],
which, when substituted back into Equation A.1, gives Equation 8.
Address correspondence to S.L. Zeger, Johns Hopkins University, School of Public Health, 615 North Wolfe Street, Baltimore, MD 21205 USA. Telephone: (410) 955-3067. Fax: (410) 955-0958. E-mail: szeger@jhsph.edu
Research described in this article was conducted under contract to the Health Effects Institute (HEI), an organization funded jointly by the U.S. EPA (EPA R824835) and automotive manufacturers. Funding was also provided by the Johns Hopkins Center in Urban Environmental Health (5P30 ESO 3819-12).
The contents of this article do not necessarily reflect the views and policies of HEI, the EPA, or automotive manufacturers.
Received 1 July 1999; accepted 16 November 1999.
Scott L. Zeger,(1) Duncan Thomas,(2) Francesca Dominici,(1) Jonathan M. Samet,(1) Joel Schwartz,(3) Douglas Dockery,(3) and Aaron Cohen(4)
(1) Johns Hopkins University, School of Public Health, Baltimore, Maryland, USA
(2) Department of Preventive Medicine, University of Southern California School of Medicine, Los Angeles, California, USA
(3) Harvard University, Boston, Massachusetts, USA; (4) Health Effects Institute, Cambridge, Massachusetts, USA
COPYRIGHT 2000 National Institute of Environmental Health Sciences
COPYRIGHT 2004 Gale Group