Construct validity of an objective assessment method for laparoscopic intracorporeal suturing and knot tying

The ideal objective assessment method for laparoscopic technical skills is difficult to achieve in the operating room. Recent “VR2OR” studies have used a blinded, 2-reviewer error-based video tape analysis for intraoperative performance assessment. This study examines the validity of this methodolog...

Full description

Saved in:
Bibliographic Details
Published inThe American journal of surgery Vol. 196; no. 1; pp. 74 - 80
Main Authors Van Sickle, Kent R., Baghai, Mercedeh, Huang, Ih-Ping, Goldenberg, Adam, Smith, C. Daniel, Ritter, E. Matt
Format Journal Article
LanguageEnglish
Published New York, NY Elsevier Inc 01.07.2008
Elsevier
Elsevier Limited
Subjects
Online AccessGet full text
ISSN0002-9610
1879-1883
1879-1883
DOI10.1016/j.amjsurg.2007.08.058

Cover

Loading…
More Information
Summary:The ideal objective assessment method for laparoscopic technical skills is difficult to achieve in the operating room. Recent “VR2OR” studies have used a blinded, 2-reviewer error-based video tape analysis for intraoperative performance assessment. This study examines the validity of this methodology applied to laparoscopic intracorporeal suturing and knot tying. Four groups of subjects—experts (EX), surgery residents trained to expert criterion levels using simulation (TR), surgery residents receiving no supplemental training (NR), and medical students receiving simulation-based training (MS)—performed the fundal suturing portion of a laparoscopic Nissen fundoplication and were video-recorded for analysis. Two separate groups of surgeon reviewers (K.V.S. + M.B.; I.-P.H. + A.G.) were trained to evaluate laparoscopic suturing and knot tying performance using specific metrics. Subjects' operative performance was assessed by reviewers blinded to their training status and scored using an error-based, step specific scoring system to an inter-rater agreement of 80% or greater. Three primary performance measures were assessed: time, errors, and needle manipulations and comparisons between groups were made using a 1-way analysis of variance (ANOVA) with post-test. A total of 40 fundal sutures (10 in each group) were scored by 2 separate rater groups with inter-rater agreement consistently greater than 80%. Inter-rater agreement was highest with the EX group (91%, range 76%–100%) and lowest with the NR group (85%, range 81%–98%). On average, the EX group significantly outperformed the other groups with regards to time ( P <.0001), errors ( P <.002), and needle manipulations ( P <.01). Performance of the TR group was comparable to the EX group with regards to errors and manipulations ( P = not significant [NS]), and outperformed the NR and MS groups with regards to time ( P <.05 and P <.001). Performance between the NR and MS groups were similar for all 3 measures. This assessment method demonstrates discriminative validity. Time appears to be the most sensitive indicator of skill level, as significant differences between EX, TR, and NR/MS groups were seen. The methodology is transferable across different reviewers and is acceptable for high-stakes assessment.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ObjectType-Undefined-3
ISSN:0002-9610
1879-1883
1879-1883
DOI:10.1016/j.amjsurg.2007.08.058