Prospective analysis of inter-observer and intra-observer variability in multi ultrasound descriptor assessment of thyroid nodules

The aim of this study was to evaluate the inter- and intra-observer variability and accuracy of ultrasound assessment of thyroid nodules using a descriptive lexicon. A prospective study was performed on complete ultrasound examinations, including sonoelastography and color Doppler ultrasound of 18 p...

Full description

Saved in:
Bibliographic Details
Published inJournal of ultrasonography Vol. 19; no. 78; pp. 198 - 206
Main Authors Dobruch-Sobczak, Katarzyna, Migda, Bartosz, Krauze, Agnieszka, Mlosek, Krzysztof, Słapa, Rafał Z., Wareluk, Paweł, Bakuła-Zalewska, Elwira, Adamczewski, Zbigniew, Lewiński, Andrzej, Jakubowski, Wiesław, Dedecjus, Marek
Format Journal Article
LanguageEnglish
Published Sciendo 01.11.2019
Exeley Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The aim of this study was to evaluate the inter- and intra-observer variability and accuracy of ultrasound assessment of thyroid nodules using a descriptive lexicon. A prospective study was performed on complete ultrasound examinations, including sonoelastography and color Doppler ultrasound of 18 patients with 20 thyroid nodules. A total of 20 records of thyroid nodules from these techniques were duplicated, numbered, and randomly arranged. Five radiologists assessed the recordings independently. Cohen Kappa and Fleiss Kappa statistics were used to determine the degree of intra- and inter-observer agreement. Mean accuracy rates for all radiologists, for all ultrasound features, ranged from 82.7 to 87.8%. For B-mode and strain elastography, accuracies ranged from 65.0 to 100% and 47.4 to 86.8%, respectively. Concerning intra-observer variability, three radiologists demonstrated almost perfect agreement (the κ-value ranged from 0.81 to 0.86), and a substantial agreement was noted for the two remaining radiologists. The κ-values for inter-observer agreement ranged from 0.61 for macrocalcifications (substantial agreement) to 0.33 for Asteria four-point elastography scale criteria (fair agreement). The results suggest relatively good inter-observer and excellent intra-observer agreement in the assessment of thyroid nodules using ultrasound, and fair agreement in the case of strain elastography.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2084-8404
2451-070X
DOI:10.15557/jou.2019.0030