Bridging Big Data: Procedures for Combining Non-equivalent Cognitive Measures from the ENIGMA Consortium

Investigators in neuroscience have turned to Big Data to address replication and reliability issues by increasing sample sizes, statistical power, and representativeness of data. These efforts unveil new questions about integrating data arising from distinct sources and instruments. We focus on the...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Kennedy, Eamonn, Vadlamani, Shashank, Lindsey, Hannah M, Lei, Pui-Wa, Pugh, Mary Jo, Adamson, Maheen, Alda, Martin, Alonso-Lana, Silvia, Ambrogi, Sonia, Anderson, Tim J, Arango, Celso, Asarnow, Robert, Avram, Mihai, Ayesa-Arriola, Rosa, Babikian, Talin, Banaj, Nerisa, Bird, Laura J, Borgwardt, Stefan, Brodtmann, Amy, Brosch, Katharina, Caeyenberghs, Karen, Calhoun, Vince D, Chiaravalloti, Nancy D, Cifu, David X, Crespo-Facorro, Benedicto, Dalrymple-Alford, John C, Dams-O'Connor, Kristen, Dannlowski, Udo, Darby, David, Davenport, Nicholas, DeLuca, John, Diaz-Caneja, Covadonga M, Disner, Seth G, Dobryakova, Ekaterina, Ehrlich, Stefan, Esopenko, Carrie, Ferrarelli, Fabio, Frank, Lea E, Franz, Carol, Fuentes-Claramonte, Paola, Genova, Helen, Giza, Christopher C, Goltermann, Janik, Grotegerd, Dominik, Gruber, Marius, Gutierrez-Zotes, Alfonso, Ha, Minji, Haavik, Jan, Hinkin, Charles, Hoskinson, Kristen R, Hubl, Daniela, Irimia, Andrei, Jansen, Andreas, Kaess, Michael, Kang, Xiaojian, Kenney, Kimbra, Kerkova, Barbora, Khlif, Mohamed Salah, Kim, Minah, Kindler, Jochen, Kircher, Tilo, Knizkova, Karolina, Kolskar, Knut K, Krch, Denise, Kremen, William S, Kuhn, Taylor, Kumari, Veena, Kwon, Jun Soo, Langella, Roberto, Laskowitz, Sarah, Lee, Jungha, Lengenfelder, Jean, Liebel, Spencer W, Liou-Johnson, Victoria, Lippa, Sara M, Lovstad, Marianne, Lundervold, Astri J, Marotta, Cassandra, Marquardt, Craig A, Mattos, Paulo, Mayeli, Ahmad, McDonald, Carrie R, Meinert, Susanne, Melzer, Tracy R, Merchan-Naranjo, Jessica, Michel, Chantal, Morey, Rajendra A, Mwangi, Benson, Myall, Daniel J, Nenadi, Igor, Newsome, Mary R, Nunes, Abraham, O'Brien, Terence, Oertel, Viola, Ollinger, John, Olsen, Alexander, Ortiz Garcia de la Foz, Victor, Ozmen, Mustafa, Pardoe, Heath, Parent, Marise
Format Journal Article Paper
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 19.01.2023
Cold Spring Harbor Laboratory
Edition1.2
Subjects
Online AccessGet full text
ISSN2692-8205
2692-8205
DOI10.1101/2023.01.16.524331

Cover

More Information
Summary:Investigators in neuroscience have turned to Big Data to address replication and reliability issues by increasing sample sizes, statistical power, and representativeness of data. These efforts unveil new questions about integrating data arising from distinct sources and instruments. We focus on the most frequently assessed cognitive domain - memory testing - and demonstrate a process for reliable data harmonization across three common measures. We aggregated global raw data from 53 studies totaling N = 10,505 individuals. A mega-analysis was conducted using empirical bayes harmonization to remove site effects, followed by linear models adjusting for common covariates. A continuous item response theory (IRT) model estimated each individual's latent verbal learning ability while accounting for item difficulties. Harmonization significantly reduced inter-site variance while preserving covariate effects, and our conversion tool is freely available online. This demonstrates that large-scale data sharing and harmonization initiatives can address reproducibility and integration challenges across the behavioral sciences.
Bibliography:SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-3
content type line 23
Competing Interest Statement: Dr. Arango has been a consultant to or has received honoraria or grants from Acadia, Angelini, Biogen, Boehringer, Gedeon Richter, Janssen Cilag, Lundbeck, Medscape, Menarini, Minerva, Otsuka, Pfizer, Roche, Sage, Servier, Shire, Schering Plough, Sumitomo Dainippon Pharma, Sunovion and Takeda. Dr. Brodtmann serves on the editorial boards of Neurology and International Journal of Stroke. Dr. Diaz-Caneja has received honoraria from Exeltis and Angelinii. Dr. Giza: consultant for NBA, NFL, NHLPA, Los Angeles Lakers; Advisory Board: Highmark Interactive, Novartis, MLS, NBA, USSF; Medicolegal 1-2 cases annually. Dr. Soares: ALKERMES (Research Grant), ALLERGAN (Research Grant), ASOFARMA (Consultant), ATAI (Stock), BOEHRINGER Ingelheim (Consultant), COMPASS (Research Grant), JOHNSON & JOHNSON (Consultant), LIVANOVA (Consultant), PFIZER (Consultant), PULVINAR NEURO LLC (Consultant), RELMADA (Consultant), SANOFI (Consultant), SUNOVIAN (Consultant). Dr. Thompson received partial research support from Biogen, Inc., for research unrelated to this manuscript. Dr. Yatham has been on speaker or advisory boards for, or has received research grants from, Alkermes, Abbvie, Canadian Institutes of Health Research, Sumitomo Dainippon Pharma, GlaxoSmithKline, Intracellular Therapies, Merck, Sanofi, Sequiris, Servier, and Sunovion, over the past 3 years, all outside this work. The collection of this cohort was partially supported by an investigator-initiated research grant from Biogen (US). Biogen had no role in the analysis or writing of this manuscript. Eisai (JP) and Life Molecular Imaging for research unrelated to this manuscript. Dr. Wylie has received research support from the NJ Commission for brain injury research, from the Dept of Veterans' Affairs, from Biogen, from Bristol, Myers, Squibb, from Genetech, and has served on advisory boards for the CDMRP and the VA. All of these activities are unrelated to this research. The views expressed in this article are those of the author(s) and do not reflect the official policy of the Department of Army/Navy/Air Force, Department of Defense, or U.S. Government.
ISSN:2692-8205
2692-8205
DOI:10.1101/2023.01.16.524331