首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
  • 本地全文:下载
  • 作者:James A. Hanley
  • 期刊名称:American journal of public health
  • 印刷版ISSN:0090-0036
  • 出版年度:2017
  • 卷号:107
  • 期号:4
  • 页码:503-505
  • DOI:10.2105/AJPH.2016.303644
  • 语种:English
  • 出版社:American Public Health Association
  • 摘要:In previous articles in the American Journal of Epidemiology ( Am J Epidemiol . 2013;177(5):431–442) and American Journal of Public Health ( Am J Public Health . 2013;103(10):1895–1901), Masters et al. reported age-specific hazard ratios for the contrasts in mortality rates between obesity categories. They corrected the observed hazard ratios for selection bias caused by what they postulated was the nonrepresentativeness of the participants in the National Health Interview Study that increased with age, obesity, and ill health. However, it is possible that their regression approach to remove the alleged bias has not produced, and in general cannot produce, sensible hazard ratio estimates. First, we must consider how many nonparticipants there might have been in each category of obesity and of age at entry and how much higher the mortality rates would have to be in nonparticipants than in participants in these same categories. What plausible set of numerical values would convert the (“biased”) decreasing-with-age hazard ratios seen in the data into the (“unbiased”) increasing-with-age ratios that they computed? Can these values be encapsulated in (and can sensible values be recovered from) one additional internal variable in a regression model? Second, one must examine the age pattern of the hazard ratios that have been adjusted for selection. Without the correction, the hazard ratios are attenuated with increasing age. With it, the hazard ratios at older ages are considerably higher, but those at younger ages are well below one. Third, one must test whether the regression approach suggested by Masters et al. would correct the nonrepresentativeness that increased with age and ill health that I introduced into real and hypothetical data sets. I found that the approach did not recover the hazard ratio patterns present in the unselected data sets: the corrections overshot the target at older ages and undershot it at lower ages. This article was jointly published in the American Journal of Epidemiology ( Am J Epidemiol . 2017;185(6):409–411) and the American Journal of Public Health ( Am J Pub Health . 2017;107(4):503–505). The Editors of the American Journal of Public Health ( AJPH ) and the American Journal of Epidemiology ( AJE ) asked me to comment on hazard ratio calculations in two articles by Masters et al. 1,2 At issue are age-specific hazard ratios for the differences in mortality rates between obesity categories. The hazard ratio estimates in the AJE article 2 were derived using Cox regression models that allowed for different hazard ratios for different age intervals; those in the subsequent AJPH article 1 were obtained using smooth-in-age hazard ratio models. These estimates deserve scrutiny because they were derived from a statistical approach that the authors claimed corrected the observed hazard ratios for selection bias, 3 that is, a distortion caused by what they postulated was the nonrepresentativeness of the participants in the National Health Interview Study (NHIS) that increased with age, obesity, and ill health. In the AJPH article, the authors used these corrected hazard ratios to arrive at population attributable fractions (a secondary topic that is addressed in Appendices 1 and 2, available as supplements to the online version of this article at http://www.ajph.org ). The way Masters et al. arrived at the age-specific hazard ratios was addressed in their response to a letter to the editor in the AJE 4 and was also addressed in an unanswered letter to the editor in the AJPH . 5 Appendix 3 and Figure A (available as supplements to the online version of this article at http://www.ajph.org ) show my simple reanalyses of some of the NHIS data and the age-specific hazard ratios (which were very similar to those of Masters et al.) that I obtained using their correction for selection bias. However, it is possible that the way they removed the alleged bias has not produced, and in general cannot produce, sensible hazard ratio estimates. Masters et al. postulated that the nonrepresentativeness of the NHIS participants, which increased with age and ill health, distorted the age-specific mortality gradients across the body mass index (BMI) categories: Participants who were older at entry and in the higher-risk BMI categories would have been healthier than their same-age counterparts who did not participate. Ideally, to directly and accurately adjust the observed age-specific hazard ratios for this discrepancy, one would (1) add appropriate (specific to age at entry) numbers of nonparticipants to each of the BMI categories, (2) assign them higher mortality rates than those seen in persons in the same category who did participate, and (3) calculate a new (higher) rate for each category. It is difficult to see how estimates of these additional (age-at-entry–specific sets of) quantities implicated in the selection bias can be extracted from the NHIS data simply by using a regression model. Masters et al. claim to have removed the bias merely by adding a centered version of age at entry as an effect modifier in their regression model and then (after fitting) setting it to that central value (see Appendix 3 for details). Epidemiologists often set a known confounder to a typical central value and use a statistical model to standardize the comparison of interest. In the present case, Masters et al. relied on a piece of information that was recorded only for the participants to somehow fill in (impute) some never-estimated number of participants who were selectively missing. This new statistical cure for selection bias appears to be too simple to be true. In this commentary, I focus on three concerns. The first arises when one considers the plausible values to be used in the ideal adjustment described above; this sensitivity analysis would provide bounds for the correct age-specific hazard ratio estimates. For example, might one plausibly double the hazard ratios for those missing from the higher-risk BMI categories? It is unlikely that the majority of the 10% to 15% who chose not to participate in the 19 NHIS waves 6 were in these older and higher-risk categories. Moreover, not all of the missing minority are necessarily at higher risk than those in the same category who did participate. With all of these constraints, can an uncorrected hazard ratio of 1.2 at, for example, 80 years of age realistically become a corrected hazard ratio of 2.4? A second and very concrete concern is the age pattern of the “adjusted-for-selection” hazard ratios shown in Figure 2 of the article by Masters et al. in the AJE 2 and Figure 2 of the article by Masters et al. in the AJPH . 1 Without their correction, the hazard ratios are attenuated with age. With it, the hazard ratios at older ages are considerably higher but those at younger ages are well below one. The media coverage of these articles missed an important implication for persons younger than 50 or even 55 years of age. All other things being equal, should life insurance premiums be lower for those in an obese BMI category than for those in the normal-weight category? The third, and equally worrying, concern stems from two simulations in which the regression approach of Masters et al. did not fix the nonrepresentativeness that increased with age and ill health that I deliberately introduced. I introduced this selection bias into samples from known populations. In one scenario, the correct hazard ratios were higher at older ages, whereas in the other, they were lower at older ages. The simulations and results are described in detail in Appendix 4, and the R code is provided in Appendix 5 (available as supplements to the online version of this article at http://www.ajph.org ). The simulation for hazard ratios that are higher at older ages, shown in black in Figure 1 , is based on a purely mathematical population in which the true hazard ratios for exposed versus not exposed are approximately 1.1 at the lower end of the age category and 2.5 at the upper end. From it, I simulated 19 sample waves in which (increasingly with age at selection) exposed persons who did participate were less likely to die in the next six years than were their sampled exposed peers who did not participate, thereby attenuating the observed hazard ratio curve. Applied to these selective data, the approach proposed by Masters et al. was not able to recover the hazard ratio pattern present in the unselected data. The corrected age-specific hazard ratios overshot the target at older ages and undershot it at lower ages. Open in a separate window FIGURE 1— Attempt to Recover the True Hazard Ratio Pattern After Some Data Have Been Selectively Removed Note. Panel a: In a purely mathematical population, the true hazard ratios for exposed versus not exposed participants are approximately 1.1 at the lower end of the age category and 2.5 at the upper age (black line). From this population, 19 sample waves were simulated in which, increasingly with age at potential selection, exposed persons who did participate were less likely to die in the next six years than were their sampled unexposed peers who did not participate. This attenuated the observed hazard ratio curve (red line). Panel b: The selectivity that increases with age excludes some of the exposed persons who are so ill that they will die within six years. For example, the six red dots beginning at age 65 years (a potential age at entry) indicate what fractions of these frail 65-year-old individuals who die in the next six years were excluded from the survey wave. Applied to these selective data, the approach proposed by Masters et al. (panel a, blue line) was not able to recover the hazard pattern present in the unselected data. The second simulation was based on hazard ratio patterns seen in actual populations, in which the hazard ratio for cohort-type mortality rates in men versus women is typically greater than two at age 60 years, 1.5 at age 80 years, and near one at age 100 years (see Figures B and C and Appendix 4, available as supplements to the online version of this article at http://www.ajph.org ). Using the published cohort mortality rates for Finland, I simulated 19 sample waves in which, increasingly with age at selection, men who did participate were less likely to die in the next five years than were their sampled male peers who did not participate. Again, applied to these selective data, the approach of Masters et al. was not able to recover the “lower-at-older-ages” hazard ratio pattern present in the full data (Figure C and Appendix 4). The greater the selectivity I introduced, the more the corrected age-specific male-to-female hazard ratios overshot the target at older ages and undershot it at lower ages. The Appendices 1, 2, and 4 contains several implicit pleas to authors. I end with two explicit ones: that new ways to directly correct heretofore uncorrectable biases first be tested on simulated data generated by known parameter values and that reported model-based corrections not be so drastic that (as with the life insurance premiums) they seemingly correct the problem at one end of the age scale by creating one at the other end. I also make a plea to editors: instead of asking authors to report what software—and what version—they used to prepare the data and derive the reported results, might they ensure instead that the computer code used is publicly available?
国家哲学社会科学文献中心版权所有