Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results

•A novel framework for multi-site fMRI analysis without data-sharing using privacy-preserving federated learning.•The first employment of domain adaptation techniques on federated learning formulation for medical image analysis.•Comparisons to baseline strategies and innovative model evaluation meth...

Full description

Saved in:
Bibliographic Details
Published inMedical image analysis Vol. 65; p. 101765
Main Authors Li, Xiaoxiao, Gu, Yufeng, Dvornek, Nicha, Staib, Lawrence H., Ventola, Pamela, Duncan, James S.
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 01.10.2020
Elsevier BV
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A novel framework for multi-site fMRI analysis without data-sharing using privacy-preserving federated learning.•The first employment of domain adaptation techniques on federated learning formulation for medical image analysis.•Comparisons to baseline strategies and innovative model evaluation methods from the biomarker interpretation perspective.•New insights into utilizing multi-site medical data to improve both tasks performance and replicable and informative biomarker detection.•Potential solution to training deep learning models on multiple small, heterogeneous, privacy-sensitive medical datasets. [Display omitted] Deep learning models have shown their advantage in many different tasks, including neuroimage analysis. However, to effectively train a high-quality deep learning model, the aggregation of a significant amount of patient information is required. The time and cost for acquisition and annotation in assembling, for example, large fMRI datasets make it difficult to acquire large numbers at a single site. However, due to the need to protect the privacy of patient data, it is hard to assemble a central database from multiple institutions. Federated learning allows for population-level models to be trained without centralizing entities’ data by transmitting the global model to local entities, training the model locally, and then averaging the gradients or weights in the global model. However, some studies suggest that private information can be recovered from the model gradients or weights. In this work, we address the problem of multi-site fMRI classification with a privacy-preserving strategy. To solve the problem, we propose a federated learning approach, where a decentralized iterative optimization algorithm is implemented and shared local model weights are altered by a randomization mechanism. Considering the systemic differences of fMRI distributions from different sites, we further propose two domain adaptation methods in this federated learning formulation. We investigate various practical aspects of federated model optimization and compare federated learning with alternative training strategies. Overall, our results demonstrate that it is promising to utilize multi-site data without data sharing to boost neuroimage analysis performance and find reliable disease-related biomarkers. Our proposed pipeline can be generalized to other privacy-sensitive medical data analysis problems. Our code is publicly available at: https://github.com/xxlya/Fed_ABIDE/.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
YG and ND had equal contribution.
This work was done at Yale University.
ISSN:1361-8415
1361-8423
DOI:10.1016/j.media.2020.101765