Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Flexibility at the Price of Volatility: Concurrent Calibration in Multistage Tests in Practice Using a 2PL Model

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • بيانات النشر:
      Frontiers Research Foundation
    • الموضوع:
      2021
    • Collection:
      University of Zurich (UZH): ZORA (Zurich Open Repository and Archive
    • نبذة مختصرة :
      Multistage test (MST) designs promise efficient student ability estimates, an indispensable asset for individual diagnostics in high-stakes educational assessments. In high-stakes testing, annually changing test forms are required because publicly known test items impair accurate student ability estimation, and items of bad model fit must be continually replaced to guarantee test quality. This requires a large and continually refreshed item pool as the basis for high-stakes MST. In practice, the calibration of newly developed items to feed annually changing tests is highly resource intensive. Piloting based on a representative sample of students is often not feasible, given that, for schools, participation in actual high-stakes assessments already requires considerable organizational effort. Hence, under practical constraints, the calibration of newly developed items may take place on the go in the form of a concurrent calibration in MST designs. Based on a simulation approach this paper focuses on the performance of Rasch vs. 2PL modeling in retrieving item parameters when items are for practical reasons non-optimally placed in multistage tests. Overall, the results suggest that the 2PL model performs worse in retrieving item parameters compared to the Rasch model when there is non-optimal item assembly in the MST; especially in retrieving parameters at the margins. The higher flexibility of 2PL modeling, where item discrimination is allowed to vary, seems to come at the cost of increased volatility in parameter estimation. Although the overall bias may be modest, single items can be affected by severe biases when using a 2PL model for item calibration in the context of non-optimal item placement.
    • File Description:
      application/pdf
    • ISSN:
      2504-284X
    • Relation:
      https://www.zora.uzh.ch/id/eprint/208420/1/feduc-05-572612.pdf; urn:issn:2504-284X
    • الرقم المعرف:
      10.5167/uzh-208420
    • الرقم المعرف:
      10.3389/feduc.2021.679864
    • Rights:
      info:eu-repo/semantics/openAccess ; Creative Commons: Attribution 4.0 International (CC BY 4.0) ; http://creativecommons.org/licenses/by/4.0/
    • الرقم المعرف:
      edsbas.8CC6240B