Exploring the Impact of Rater Effects on Person Fit in Rater‐Mediated Assessments

Researchers have documented the impact of rater effects, or raters’ tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test‐takers’ achiev...

Full description

Saved in:

Bibliographic Details
Published in	Educational measurement, issues and practice Vol. 39; no. 4; pp. 76 - 94
Main Author	Wind, Stefanie A.
Format	Journal Article
Language	English
Published	Washington Wiley 01.12.2020 Wiley Subscription Services, Inc
Subjects	Achievement Educational evaluation Educational tests & measurements Evaluation Methods Evaluators Individual Testing Influences performance assessment Performance Based Assessment person fit Psychometrics rater effects rater‐mediated assessment Simulation Test Interpretation Test Use
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Researchers have documented the impact of rater effects, or raters’ tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test‐takers’ achievement estimates given their response patterns, has not been investigated. In rater‐mediated assessments, person fit reflects the reasonableness of rater judgments of individual test‐takers’ achievement over components of the assessment. This study illustrates an approach to visualizing and evaluating person fit in assessments that involve rater judgment using rater‐mediated person response functions (rm‐PRFs). The rm‐PRF approach allows analysts to consider the impact of rater effects on person fit in order to identify individual test‐takers for whom the assessment results may not have a straightforward interpretation. A simulation study is used to evaluate the impact of rater effects on person fit. Results indicate that rater effects can compromise the interpretation and use of performance assessment results for individual test‐takers. Recommendations are presented that call researchers and practitioners to supplement routine psychometric analyses for performance assessments (e.g., rater reliability checks) with rm‐PRFs to identify students whose ratings may have compromised interpretations as a result of rater effects, person misfit, or both.
ISSN:	0731-1745 1745-3992
DOI:	10.1111/emip.12354