Investigation of IRT-Based Equating Methods in the Presence of Outlier Common Items

Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)—based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IR...

Full description

Saved in:
Bibliographic Details
Published inApplied psychological measurement Vol. 32; no. 4; pp. 311 - 333
Main Authors Huiqin Hu, Rogers, W. Todd, Vukmirovic, Zarko
Format Journal Article
LanguageEnglish
Published Los Angeles, CA SAGE Publications 01.06.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)—based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IRT-based equating methods (i.e., concurrent calibration, separate calibration with test characteristic curve [TCC] and mean/sigma [M/S] transformations, and calibration with fixed common item parameters [FCIP]) when outliers were either ignored or considered. Simulated data were generated for the common-item nonequivalent groups matrix design to reflect the manipulated factors: group ability differences and nonequivalent groups, number/score points of outliers, and types of outliers. When no outliers were present, the TCC and M/S transformations performed the best. When there were outliers, overall, the methods that considered them (except the M/S transformation with outliers weighted) resulted in a vast improvement compared to the methods that ignored them.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0146-6216
1552-3497
DOI:10.1177/0146621606292215