Calibration for Instrument Argumentation Skills on the Subject of Fluid Statics Using Item Response Theory

This study aims to calibrate the instrument of argumentation skills on the subject of Fluid Statics using item response theory (IRT). The blueprint instrument developed based on the aspects and sub-aspects of argumentation skills, totaling 20 items. The scoring uses the Partial Credit Model (PCM) pr...

Full description

Saved in:

Bibliographic Details
Published in	Journal of physics. Conference series Vol. 1842; no. 1; p. 12032
Main Authors	Affandy, Harry, Arya Nugraha, Dewanta, Nourma Pratiwi, Scundy, Cari, C.
Format	Journal Article
Language	English
Published	Bristol IOP Publishing 01.03.2021
Subjects	Adequacy Calibration Colleges & universities Error analysis Factor analysis Goodness of fit Physicists Physics Reliability analysis Skills Standard error Test equipment
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This study aims to calibrate the instrument of argumentation skills on the subject of Fluid Statics using item response theory (IRT). The blueprint instrument developed based on the aspects and sub-aspects of argumentation skills, totaling 20 items. The scoring uses the Partial Credit Model (PCM) principle in 4 categories. Subjects involved in the study were 206 undergraduate students at two different universities. The validation of the instrument using expert judgment techniques, which consisted of linguists, physicists, and educational measurement experts. Instrument calibration for argumentation skills is carried out by (1) analysis of item fit with PCM by testing goodness of fit items, (2) testing prerequisite analysis, (3) reliability, (4) difficulty level, and (5) Standard Error of Measurement (SEM). The test developed is valid with Aiken V index values in the range 0.93 to 1. The analysis of IRT assumptions uses two parameters, the adequacy test of the sample and the factor analysis. The Chi-Square value in the Bartlett test is 2202.67 with df 190, and a p-value value (Sig.) <0.05 indicates that the sample size used has met the unidimensional requirements. The instrument developed has met the criteria for scoring the Polytomus items, based on a) the MNSQ INFIT value is in the range of 0.70 to 1.30, and b) the difficulty index test is in the range -2.00 <b <2.00. The results of the reliability estimate on the test equipment obtained a reliability value of 0.86, which means that the reliability of the test is in the high category. The graph of the information function and SEM converge on the ability scales of -2.8 and +1.5, meaning that the skills test kit argues it is suitable for testees in the low to moderate ability range.
Bibliography:	ObjectType-Conference Proceeding-1 SourceType-Scholarly Journals-1 content type line 14
ISSN:	1742-6588 1742-6596
DOI:	10.1088/1742-6596/1842/1/012032