期刊名称:International Electronic Journal of Elementary Education
印刷版ISSN:1307-9298
出版年度:2019
卷号:11
期号:5
页码:539-545
DOI:10.26822/iejee.2019553350
出版社:International Electronic Journal of Elementary Education
摘要:Open ended and multiple choice questions are commonly placed on the same tests; however, there is a discussion on the effects of using different
item types on the test and item statistics. This study aims to compare model and item fit statistics in a mixed format test where multiple
choice and constructed response items are used together. In this 25-item fourth grade science test administered to 2351 students in 35 schools
in Turkey, items are calibrated separately and concurrently utilizing different IRT models. An important aspect of this study is that the effect of
the calibration method on model and item fit is investigated on real data. Firstly, while the 1-, 2-, and 3-Parameter Logistic models are utilized
to calibrate the binary coded items, the Graded Response Model and the Generalized Partial Credit Model are used to calibrate the open-ended
ones. Then, combinations of dichotomous and polytomous models are employed concurrently. The results based on model comparisons
revealed that the combination of the 3PL and the Graded Response Model produced the best fit statistics.
其他摘要:Open-ended and multiple choice questions are commonly placed on the same tests; however, there is a discussion on the effects of using different item types on the test and item statistics. This study aims to compare model and item fit statistics in a mixed format test where multiple choice and constructed response items are used together. In this 25-item fourth-grade science test administered to 2351 students in 35 schools in Turkey, items are calibrated separately and concurrently utilizing different IRT models. An important aspect of this study is that the effect of the calibration method on model and item fit is investigated on real data. Firstly, while the 1-, 2-, and 3-Parameter Logistic models are utilized to calibrate the binary coded items, the Graded Response Model and the Generalized Partial Credit Model are used to calibrate the open-ended ones. Then, combinations of dichotomous and polytomous models are employed concurrently. The results based on model comparisons revealed that the combination of the 3PL and the Graded Response Model produced the best fit statistics.
关键词:Item Response Theory; Model Comparison; Mixed Format Tests; Item Fit