首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Insights into the Angoff method: results from a simulation study
  • 本地全文:下载
  • 作者:Boaz Shulruf ; Tim Wilkinson ; Jennifer Weller
  • 期刊名称:BMC Medical Education
  • 印刷版ISSN:1472-6920
  • 出版年度:2016
  • 卷号:16
  • 期号:1
  • 页码:134-143
  • DOI:10.1186/s12909-016-0656-7
  • 出版社:BioMed Central
  • 摘要:Background In standard setting techniques involving panels of judges, the attributes of judges may affect the cut-scores. This simulation study modelled the effect of the number of judges and test items, as well as the impact of judges’ attributes such as accuracy, stringency and influence on others on the precision of the cut-scores. Methods Forty nine combinations of Angoff panels ( N = 5, 10, 15, 20, 30, 50, and 80) and test items ( n = 5, 10, 15, 20, 30, 50, and 80) were simulated. Each combination was simulated 100 times (in total 4,900 simulations). The simulation was of judges attributes: stringency, accuracy and leadership. Impact of judges attributes, number of judges, number of test items and Angoff’s second (compared to the first) round on the precision of a panel’s cut-score was measured by the deviation of the panel’s cut-score from the cut-score’s true value. Results Findings from 4900 simulated panels supported Angoff being both reliable and valid. Unless the number of test items is small, panels of around 15 judges with mixed levels of expertise provide the most precise estimates. Furthermore, if test data were not presented, a second round of decision-making, as used in the modified Angoff, adds little to precision. A panel which has only experts or only non-experts yields a cut-score which is less precise than a cut-score yielded by a mixed-expertise panel, suggesting that optimal composition of an Angoff panel should include a range of judges with diverse expertise and stringency. Conclusions Simulations aim to improve our understanding of the models assessed but they do not describe natural phenomena as they do not use observed data. While the simulations undertaken in this study help clarify how to set cut-scores defensibly, it is essential to confirm these theories in practice.
国家哲学社会科学文献中心版权所有