Multi-center machine learning in imaging psychiatry: A meta-model approach

One of the biggest problems in automated diagnosis of psychiatric disorders from medical images is the lack of sufficiently large samples for training. Sample size is especially important in the case of highly heterogeneous disorders such as schizophrenia, where machine learning models built on rela...

Full description

Saved in:

Bibliographic Details
Published in	NeuroImage (Orlando, Fla.) Vol. 155; pp. 10 - 24
Main Authors	Dluhoš, Petr, Schwarz, Daniel, Cahn, Wiepke, van Haren, Neeltje, Kahn, René, Španiel, Filip, Horáček, Jiří, Kašpárek, Tomáš, Schnack, Hugo
Format	Journal Article
Language	English
Published	United States Elsevier Inc 15.07.2017 Elsevier Limited
Subjects	Automation Classification combining models Computer applications Consortia Datasets as Topic Ethics Female first-episode schizophrenia Humans Learning Learning algorithms Machine learning Magnetic Resonance Imaging Male Medical imaging Mental disorders meta-model multi-center Multicenter Studies as Topic Neuroimaging - methods NMR Nuclear magnetic resonance Pattern Recognition, Automated - methods prediction Psychiatry Schizophrenia Schizophrenia - diagnostic imaging Studies Support Vector Machine support vector machines (SVM) support vector machines (SVM) multi-center meta-model Machine learning prediction combining models classification first-episode schizophrenia
Online Access	Get full text

Cover

Loading…

More Information
Summary:	One of the biggest problems in automated diagnosis of psychiatric disorders from medical images is the lack of sufficiently large samples for training. Sample size is especially important in the case of highly heterogeneous disorders such as schizophrenia, where machine learning models built on relatively low numbers of subjects may suffer from poor generalizability. Via multicenter studies and consortium initiatives researchers have tried to solve this problem by combining data sets from multiple sites. The necessary sharing of (raw) data is, however, often hindered by legal and ethical issues. Moreover, in the case of very large samples, the computational complexity might become too large. The solution to this problem could be distributed learning. In this paper we investigated the possibility to create a meta-model by combining support vector machines (SVM) classifiers trained on the local datasets, without the need for sharing medical images or any other personal data. Validation was done in a 4-center setup comprising of 480 first-episode schizophrenia patients and healthy controls in total. We built SVM models to separate patients from controls based on three different kinds of imaging features derived from structural MRI scans, and compared models built on the joint multicenter data to the meta-models. The results showed that the combined meta-model had high similarity to the model built on all data pooled together and comparable classification performance on all three imaging features. Both similarity and performance was superior to that of the local models. We conclude that combining models is thus a viable alternative that facilitates data sharing and creating bigger and more informative models. [Display omitted] •New method to learn from multi-center neuroimaging data is proposed: meta-modeling.•Locally trained prediction models instead of subject data are shared between sites.•This approach avoids legal and ethical issues and is computationally simpler.•The method is tested in a 4-center (480 images) first-episode schizophrenia study.•Meta-models and models trained on all data together have comparable performance.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1053-8119 1095-9572 1095-9572
DOI:	10.1016/j.neuroimage.2017.03.027