Two Statistical Tests for the Detection of Item Compromise

Wim J. van der Linden*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

1 Citation (Scopus)
45 Downloads (Pure)

Abstract

Two independent statistical tests of item compromise are presented, one based on the test takers’ responses and the other on their response times (RTs) on the same items. The tests can be used to monitor an item in real time during online continuous testing but are also applicable as part of post hoc forensic analysis. The two test statistics are simple intuitive quantities as the sum of the responses and RTs observed for the test takers on the item. Common features of the tests are ease of interpretation and computational simplicity. Both tests are uniformly most powerful under the assumption of known ability and speed parameters for the test takers. Examples of power functions for items with realistic parameter values suggest maximum power for 20–30 test takers with item preknowledge for the response-based test and 10–20 test takers for the RT-based test.

Original languageEnglish
Pages (from-to)485-504
Number of pages20
JournalJournal of educational and behavioral statistics
Volume47
Issue number4
DOIs
Publication statusPublished - Aug 2022

Keywords

  • continuous testing
  • fixed-form testing
  • item compromise
  • item response theory
  • lognormal response-time model
  • statistical hypothesis testing
  • UT-Hybrid-D

Fingerprint

Dive into the research topics of 'Two Statistical Tests for the Detection of Item Compromise'. Together they form a unique fingerprint.

Cite this