Quantitative evaluation of automated skull-stripping methods applied to contemporary and legacy images: Effects of diagnosis, bias correction, and slice location

Performance of automated methods to isolate brain from nonbrain tissues in magnetic resonance (MR) structural images may be influenced by MR signal inhomogeneities, type of MR image set, regional anatomy, and age and diagnosis of subjects studied. The present study compared the performance of four m...

Full description

Saved in:
Bibliographic Details
Published inHuman brain mapping Vol. 27; no. 2; pp. 99 - 113
Main Authors Fennema-Notestine, Christine, Ozyurt, I. Burak, Clark, Camellia P., Morris, Shaunna, Bischoff-Grethe, Amanda, Bondi, Mark W., Jernigan, Terry L., Fischl, Bruce, Segonne, Florent, Shattuck, David W., Leahy, Richard M., Rex, David E., Toga, Arthur W., Zou, Kelly H., Brown, Gregory G.
Format Journal Article
LanguageEnglish
Published Hoboken Wiley Subscription Services, Inc., A Wiley Company 01.02.2006
Wiley-Liss
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Performance of automated methods to isolate brain from nonbrain tissues in magnetic resonance (MR) structural images may be influenced by MR signal inhomogeneities, type of MR image set, regional anatomy, and age and diagnosis of subjects studied. The present study compared the performance of four methods: Brain Extraction Tool (BET; Smith [2002]: Hum Brain Mapp 17:143–155); 3dIntracranial (Ward [1999] Milwaukee: Biophysics Research Institute, Medical College of Wisconsin; in AFNI); a Hybrid Watershed algorithm (HWA, Segonne et al. [2004] Neuroimage 22:1060–1075; in FreeSurfer); and Brain Surface Extractor (BSE, Sandor and Leahy [1997] IEEE Trans Med Imag 16:41–54; Shattuck et al. [2001] Neuroimage 13:856–876) to manually stripped images. The methods were applied to uncorrected and bias‐corrected datasets; Legacy and Contemporary T1‐weighted image sets; and four diagnostic groups (depressed, Alzheimer's, young and elderly control). To provide a criterion for outcome assessment, two experts manually stripped six sagittal sections for each dataset in locations where brain and nonbrain tissue are difficult to distinguish. Methods were compared on Jaccard similarity coefficients, Hausdorff distances, and an Expectation‐Maximization algorithm. Methods tended to perform better on contemporary datasets; bias correction did not significantly improve method performance. Mesial sections were most difficult for all methods. Although AD image sets were most difficult to strip, HWA and BSE were more robust across diagnostic groups compared with 3dIntracranial and BET. With respect to specificity, BSE tended to perform best across all groups, whereas HWA was more sensitive than other methods. The results of this study may direct users towards a method appropriate to their T1‐weighted datasets and improve the efficiency of processing for large, multisite neuroimaging studies. Hum. Brain Mapping, 2005. © 2005 Wiley‐Liss, Inc.
Bibliography:National Center for Research Resources at the National Institutes of Health (NIH) - No. U24 RR021382; No. BIRN002; No. BIRN004; No. M01RR00827; No. P41-RR14075; No. R01 RR16594-01A1; No. P41-RR13642
National Institute for Biomedical Imaging and Bioengineering (NIBIB) at NIH - No. R01 EB002010
San Diego Alzheimer's Disease Research Center - No. P50AGO5131
Department of Veterans Affairs Medical Research Service - No. VA Merit Review and Research Enhancement award programs
HIV Neurobehavioral Research Center - No. MH45294
istex:AD915FDCFCEBECCBD59A2585AFFADA80479676F9
National Institute on Aging at NIH - No. R01 AG12674; No. AG04085
ArticleID:HBM20161
Mental Illness and Neuroscience Discovery (MIND) Institute
ark:/67375/WNG-88Z1D3BX-5
National Institute of Mental Health at NIH - No. 5K08MH01642; No. R01MH42575
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:1065-9471
1097-0193
DOI:10.1002/hbm.20161