文章基本信息

标题：HOW TO ASSESS THE RATERS OF HIGH-STAKES TESTS
其他标题：KUIDAS HINNATA SUURE PANUSEGA TESTIDE HINDAJAID
本地全文：下载
作者：Hille Pajupuu
期刊名称：Eesti Rakenduslingvistika Ühingu Aastaraamat
印刷版ISSN：1736-2563
电子版ISSN：2228-0677
出版年度：2007
卷号：3
页码：221-233
语种：English
出版社：Eesti Rakenduslingvistika Ühing (Estonian Association for Applied Linguistics)
摘要：In a situation where,among all tests,the proportion of high-stakes tests is constantly growing,while their results are increasingly used to pass judgements not only on examinees but also on teachers and teaching quality,and,with time,many of those tests have become obligatory,high demands should certainly be set on the sense of responsibility of anyone involved in test development or test use.Special attention should be paid to the quality of subjective ratings as the writing and speaking parts of a test may often account for half of the total score.If a testee should score lower or higher than their competence is worth,it may change their life as well as that of other people.The commonly used simple statistics (calculation of differences between the marks awarded by two raters,inter-rater correlation) may actually fail to take account of quite a lot of wrong credits if the raters are many and inadequately prepared.In order to reduce unfair assessment a method is suggested to identify poorly performing raters and to reassess their results in good time.The method is meant to be used in the case of double marking.Notably,a quality index is computed to show the degree of similarity between the credits given by the rater to be asessed and an expert rater,even if the two have never worked in a pair.It is assumed that the higher the similarity the fairer the credits.The article describes the general principles of the method suggested,pointing out its advantages over some other simple methods used for the same purpose.
其他摘要：Suure panusega testide (ingl high-stakes test) osatähtsuse kasv toob kaasa vajaduse pöörata rohkem tähelepanu testi subjektiiv hinnatavate osade hindamise kvaliteedile.Kasutatavate lihtsate statistiliste meetodite puhul võib jääda märkamata palju valestihindamisi juhul,kui hindajate arv on suur ning nende koolitus nõrk.Artikkel tutvustab meetodit,mille abil saab kindlaks teha valesti hinnanud hindajad ning nende hinnatud tööd õigeaegselt ümber hinnata.Meetod on mõeldud kasutamiseks kahekordse hindamise puhul,seda demonstreeritakse eesti keele algtaseme testi rääkimisosa hindamise näitel.*.
关键词：rater evaluation;rater consistency;rating;double marking;Estonian
其他关键词：hindajate hindamine;hindajate järjekindlus;subjektiivhindamine;kahekordne hindamine;eesti keel