The Surgical Safety Checklist and Teamwork Coaching Tools: a study of inter-rater reliability

Objective To assess the inter-rater reliability (IRR) of two novel observation tools for measuring surgical safety checklist performance and teamwork. Summary background Data surgical safety checklists can promote adherence to standards of care and improve teamwork in the operating room. Their use h...

Full description

Saved in:
Bibliographic Details
Published inBMJ quality & safety Vol. 23; no. 8; pp. 639 - 650
Main Authors Huang, Lyen C, Conley, Dante, Lipsitz, Stu, Wright, Christopher C, Diller, Thomas W, Edmondson, Lizabeth, Berry, William R, Singer, Sara J
Format Journal Article
LanguageEnglish
Published London BMJ Group 01.08.2014
BMJ Publishing Group LTD
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Objective To assess the inter-rater reliability (IRR) of two novel observation tools for measuring surgical safety checklist performance and teamwork. Summary background Data surgical safety checklists can promote adherence to standards of care and improve teamwork in the operating room. Their use has been associated with reductions in mortality and other postoperative complications. However, checklist effectiveness depends on how well they are performed. Methods Authors from the Safe Surgery 2015 initiative developed a pair of novel observation tools through literature review, expert consultation and end-user testing. In one South Carolina hospital participating in the initiative, two observers jointly attended 50 surgical cases and independently rated surgical teams using both tools. We used descriptive statistics to measure checklist performance and teamwork at the hospital. We assessed IRR by measuring percent agreement, Cohen's κ, and weighted κ scores. Results The overall percent agreement and κ between the two observers was 93% and 0.74 (95% CI 0.66 to 0.79), respectively, for the Checklist Coaching Tool and 86% and 0.84 (95% CI 0.77 to 0.90) for the Surgical Teamwork Tool. Percent agreement for individual sections of both tools was 79% or higher. Additionally, κ scores for six of eight sections on the Checklist Coaching Tool and for two of five domains on the Surgical Teamwork Tool achieved the desired 0.7 threshold. However, teamwork scores were high and variation was limited. There were no significant changes in the percent agreement or κ scores between the first 10 and last 10 cases observed. Conclusions Both tools demonstrated substantial IRR and required limited training to use. These instruments may be used to observe checklist performance and teamwork in the operating room. However, further refinement and calibration of observer expectations, particularly in rating teamwork, could improve the utility of the tools.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:2044-5415
2044-5423
DOI:10.1136/bmjqs-2013-002446