首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:Detecting differential item functioning using generalized logistic regression in the context of large-scale assessments
  • 本地全文:下载
  • 作者:Dubravka Svetina ; Leslie Rutkowski
  • 期刊名称:Large-scale Assessments in Education
  • 电子版ISSN:2196-0739
  • 出版年度:2014
  • 卷号:2
  • 期号:1
  • 页码:1-17
  • DOI:10.1186/s40536-014-0004-5
  • 摘要:Background When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences regarding performance can be regarded as valid. Relatively fewer studies examined an item-level approach to measurement equivalence, particularly in settings where a large number of groups is included. Methods This simulation study examines item-level differential item functioning (DIF) in the context of international large-scale assessment (ILSA) using a generalized logistic regression approach. Manipulated factors included the number of groups (10 or 20), magnitude of DIF, percent of DIF items, the nature of DIF, as well as the percent of affected groups with DIF. Results Results suggested that the number of groups did not have an effect of the performance of the method (high power and low Type I error rates); however, other factors had impacted the accuracy. Specifically, Type I error rates were inflated in non-DIF conditions, while they were very conservative in all of the DIF conditions. Power was generally high, in particular in conditions where DIF magnitude was large, with one exception – in conditions where DIF was introduced in difficulty parameters and the percent of DIF items was 60. Conclusions Our findings presented a mixed picture with respect to the performance of the generalized logistic regression method in the context of large number of groups with large sample sizes. In the presence of DIF, the method was successful in distinguishing between DIF and non-DIF, as evidenced by low Type I error and high power rates. On the other hand, however, in the absence of DIF, the method yielded increased Type I errors.
国家哲学社会科学文献中心版权所有