Test-retest reliability of freesurfer measurements within and between sites: Effects of visual approval process

In the last decade, many studies have used automated processes to analyze magnetic resonance imaging (MRI) data such as cortical thickness, which is one indicator of neuronal health. Due to the convenience of image processing software (e.g., FreeSurfer), standard practice is to rely on automated res...

Full description

Saved in:

Bibliographic Details
Published in	Human brain mapping Vol. 36; no. 9; pp. 3472 - 3485
Main Authors	Iscan, Zafer, Jin, Tony B., Kendrick, Alexandria, Szeglin, Bryan, Lu, Hanzhang, Trivedi, Madhukar, Fava, Maurizio, McGrath, Patrick J., Weissman, Myrna, Kurian, Benji T., Adams, Phillip, Weyandt, Sarah, Toups, Marisa, Carmody, Thomas, McInnis, Melvin, Cusin, Cristina, Cooper, Crystal, Oquendo, Maria A., Parsey, Ramin V., DeLorenzo, Christine
Format	Journal Article
Language	English
Published	United States Blackwell Publishing Ltd 01.09.2015 John Wiley & Sons, Inc John Wiley and Sons Inc
Subjects	Adolescent Adult Aged Cerebral Cortex - anatomy & histology cerebral cortical surface area cerebral cortical thickness cerebral cortical volume FreeSurfer Humans Image Processing, Computer-Assisted - methods Magnetic Resonance Imaging - methods Middle Aged multisite MRI Organ Size Pattern Recognition, Automated - methods Reproducibility of Results Software test-retest reliability Young Adult multisite MRI test-retest reliability FreeSurfer cerebral cortical volume cerebral cortical surface area cerebral cortical thickness
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In the last decade, many studies have used automated processes to analyze magnetic resonance imaging (MRI) data such as cortical thickness, which is one indicator of neuronal health. Due to the convenience of image processing software (e.g., FreeSurfer), standard practice is to rely on automated results without performing visual inspection of intermediate processing. In this work, structural MRIs of 40 healthy controls who were scanned twice were used to determine the test–retest reliability of FreeSurfer‐derived cortical measures in four groups of subjects—those 25 that passed visual inspection (approved), those 15 that failed visual inspection (disapproved), a combined group, and a subset of 10 subjects (Travel) whose test and retest scans occurred at different sites. Test–retest correlation (TRC), intraclass correlation coefficient (ICC), and percent difference (PD) were used to measure the reliability in the Destrieux and Desikan–Killiany (DK) atlases. In the approved subjects, reliability of cortical thickness/surface area/volume (DK atlas only) were: TRC (0.82/0.88/0.88), ICC (0.81/0.87/0.88), PD (0.86/1.19/1.39), which represent a significant improvement over these measures when disapproved subjects are included. Travel subjects’ results show that cortical thickness reliability is more sensitive to site differences than the cortical surface area and volume. To determine the effect of visual inspection on sample size required for studies of MRI‐derived cortical thickness, the number of subjects required to show group differences was calculated. Significant differences observed across imaging sites, between visually approved/disapproved subjects, and across regions with different sizes suggest that these measures should be used with caution. Hum Brain Mapp 36:3472–3485, 2015. © 2015 Wiley Periodicals, Inc.
Bibliography:	ArticleID:HBM22856 ark:/67375/WNG-G7N1M5X3-Z National Institute of Mental Health (NIMH) - No. U01 MH092250 and K01MH091354 istex:C1DC3F38E43A189F2D7C2E1501872D45E1112CB0 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Undefined-1 ObjectType-Feature-3 content type line 23 ObjectType-Article-1 ObjectType-Feature-2
ISSN:	1065-9471 1097-0193 1097-0193
DOI:	10.1002/hbm.22856