Site effects how-to and when: An overview of retrospective techniques to accommodate site effects in multi-site neuroimaging analyses
Site differences, or systematic differences in feature distributions across multiple data-acquisition sites, are a known source of heterogeneity that may adversely affect large-scale meta- and mega-analyses of independently collected neuroimaging data. They influence nearly all multi-site imaging mo...
Saved in:
Published in | Frontiers in neurology Vol. 13; p. 923988 |
---|---|
Main Authors | , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Switzerland
Frontiers Media S.A
31.10.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Site differences, or systematic differences in feature distributions across multiple data-acquisition sites, are a known source of heterogeneity that may adversely affect large-scale meta- and mega-analyses of independently collected neuroimaging data. They influence nearly all multi-site imaging modalities and biomarkers, and methods to compensate for them can improve reliability and generalizability in the analysis of genetics, omics, and clinical data. The origins of statistical site effects are complex and involve both
differences (scanner vendor, head coil, acquisition parameters, imaging processing) and differences in
(inclusion/exclusion criteria, sample size, ancestry) between sites. In an age of expanding international consortium research, there is a growing need to disentangle technical site effects from sample characteristics of interest. Numerous statistical and machine learning methods have been developed to control for, model, or attenuate site effects - yet to date, no comprehensive review has discussed the benefits and drawbacks of each for different use cases. Here, we provide an overview of the different existing statistical and machine learning methods developed to remove unwanted site effects from independently collected neuroimaging samples. We focus on linear mixed effect models, the ComBat technique and its variants, adjustments based on image quality metrics, normative modeling, and deep learning approaches such as generative adversarial networks. For each method, we outline the statistical foundation and summarize strengths and weaknesses, including their assumptions and conditions of use. We provide information on software availability and comment on the ease of use and the applicability of these methods to different types of data. We discuss validation and comparative reports, mention caveats and provide guidance on when to use each method, depending on context and specific research questions. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 ObjectType-Review-3 content type line 23 These authors share last authorship Reviewed by: Paul Gerson Unschuld, Université de Genève, Switzerland; Muhamed Barakovic, University of Basel, Switzerland Edited by: Maxime Descoteaux, Université de Sherbrooke, Canada This article was submitted to Applied Neuroimaging, a section of the journal Frontiers in Neurology |
ISSN: | 1664-2295 1664-2295 |
DOI: | 10.3389/fneur.2022.923988 |