Comparing Performances (Type I error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of dif...

Full description

Saved in:

Bibliographic Details
Published in	Educational sciences : theory & practice Vol. 14; no. 6; pp. 2186 - 2193
Main Authors	Kabasakal, Kübra Atalay, Gök, Bilge, Arsan, Nihan, Kelecioglu, Hülya
Format	Journal Article
Language	English
Published	Istanbul EDAM (Educational Consultancy Ltd.) 01.01.2014 Educational Consultancy, Ltd (EDAM)
Subjects	Ability Comparative Analysis Differences Error of Measurement Foreign Countries Item Response Theory Mantel Haenszel Procedure Sample Size Simulation Statistical Analysis Test Bias Test Items Test Length Turkey Turkey
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying model used to generate data. Results suggest that SIBTEST had the highest Type I error in the detection of uniform DIF, but MH had the highest power under all conditions. In addition, the percentage of DIF and the underlying model appear to have influenced the Type I error rate of IRT-LR. Ability differences between groups, test length, the percentage of DIF, model, and the interactions between ability differencespercentage of DIF, ability differencestest length, test lengthpercentage of DIF, test lengthmodel affected the SIBTEST methods' Type I error rate. In the MH procedure, effective factors for Type I error rate were: sample size, test length, the percentage of DIF, ability differencespercentage of DIF, ability differencesmodel, and ability differencespercentage of DIFmodel. No factors were effective on the power of SIBTEST and MH, but the underlying model had a significant effect on the IRT-LR power rate.
ISSN:	2148-7561 1303-0485 2148-7561
DOI:	10.12738/estp.2014.6.2165