首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Improving test security and efficiency of computerized adaptive testing for the Force Concept Inventory
  • 本地全文:下载
  • 作者:Jun-ichiro Yasuda ; Michael M. Hull ; Naohiro Mae
  • 期刊名称:Physical Review Physics Education Research
  • 电子版ISSN:2469-9896
  • 出版年度:2022
  • 卷号:18
  • 期号:1
  • 页码:010112
  • DOI:10.1103/PhysRevPhysEducRes.18.010112
  • 语种:English
  • 出版社:American Physical Society
  • 摘要:This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on the post-test or (ii) share information about the items with their classmates who take the assessment later. Second, we will discuss measures to enhance test efficiency, so that a shorter test length can yield a desired accuracy and precision of the measurement. Specifically, we utilized collateral information in the form of a pretest proficiency estimate of each respondent for selecting items and estimating respondent proficiency level in the post-test. To shorten the total testing time further, we also allowed the test lengths to be different for the pre- and post-test. To analyze how these improvements affect the accuracy and precision (which we measure in terms of root-mean-square error) of Cohen’s d, we conducted a Monte Carlo simulation and a post hoc simulation. Then, we calculated the minimal test length of the FCI-CAT whose accuracy and precision are equivalent to that of the paper-and-pencil version of the FCI. Consequently, we obtained the following three findings: (i) By using collateral information, we can achieve the accuracy and precision of the full-length FCI with fewer items via the FCI-CAT. (ii) For a class size of 40, we can control for test security while still reducing the sum of the pre- and post-test lengths of the FCI-CAT to a total of 33 items (17 items on the pretest and 16 items on the post-test), thereby reducing the testing time to 55%. (iii) If one’s goal is to maximize test efficiency, the pretest length should be slightly larger than the post-test length. On the other hand, if the goal is to maximize test security, the pretest length should be smaller and the post-test length should be larger. If one desires a balance of these two goals, it would be reasonable to choose equal pre- and post-test lengths.
国家哲学社会科学文献中心版权所有