Reliability Issues in High-Stakes Educational Tests

  • Cees A.W. Glas*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

54 Downloads (Pure)

Abstract

High-stakes tests and examinations often give rise to rather specific measurement problems. Though nowadays item response theory (IRT) has become the standard theoretical framework for educational measurement, in practice, number-correct scores are still prominent in the definition of standards and norms. Therefore, in this chapter methods are developed for relating standards on the number-correct scale to standards on the latent IRT scale. Further, this chapter focuses on two related issues. The first issue is estimating the size of standard errors when equating older versions of a test to the current version. The second issue is estimating the local reliability of number-correct scores and the extra error variance introduced through number-correct scoring rather than using IRT proficiency estimates. It is shown that the first issue can be solved in the framework of maximum a posteriori (MAP) estimation, while the second issue can be solved in the framework of expected a posteriori (EAP) estimation. The examples that are given are derived from simulations studies carried out for linking the nation-wide tests at the end of primary education in the Netherlands.

Original languageEnglish
Title of host publicationTheoretical and Practical Advances in Computer-based Educational Measurement
PublisherSpringer
Pages213-230
Number of pages18
DOIs
Publication statusPublished - 2019

Publication series

NameMethodology of Educational Measurement and Assessment
ISSN (Print)2367-170X
ISSN (Electronic)2367-1718

Fingerprint

Dive into the research topics of 'Reliability Issues in High-Stakes Educational Tests'. Together they form a unique fingerprint.

Cite this