Performing Group Difference Testing on Graph Structured Data From GANs: Analysis and Applications in Neuroimaging

Generative adversarial networks (GANs) have emerged as a powerful generative model in computer vision. Given their impressive abilities in generating highly realistic images, they are also being used in novel ways in applications in the life sciences. This raises an interesting question when GANs ar...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on pattern analysis and machine intelligence Vol. 44; no. 2; pp. 877 - 889
Main Authors	Dinh, Tuan Q., Xiong, Yunyang, Huang, Zhichun, Vo, Tien, Mishra, Akshay, Kim, Won Hwa, Ravi, Sathya N., Singh, Vikas
Format	Journal Article
Language	English
Published	United States IEEE 01.02.2022 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Alzheimer's disease Brain - diagnostic imaging Computer vision Data analysis Diseases Empirical analysis Finite element method Gallium nitride Generative adversarial network Generative adversarial networks Graph theory Group dynamics Humans hypothesis testing Image Processing, Computer-Assisted - methods Machine learning Medical imaging Neural networks Neural Networks, Computer Neuroimaging non-euclidean Statistical analysis Statistical methods Structured data Three-dimensional displays Training Training data
Online Access	Get full text
ISSN	0162-8828 1939-3539 2160-9292 1939-3539
DOI	10.1109/TPAMI.2020.3013433

Cover

More Information
Summary:	Generative adversarial networks (GANs) have emerged as a powerful generative model in computer vision. Given their impressive abilities in generating highly realistic images, they are also being used in novel ways in applications in the life sciences. This raises an interesting question when GANs are used in scientific or biomedical studies. Consider the setting where we are restricted to only using the samples from a trained GAN for downstream group difference analysis (and do not have direct access to the real data). Will we obtain similar conclusions? In this work, we explore if "generated" data, i.e., sampled from such GANs can be used for performing statistical group difference tests in cases versus controls studies, common across many scientific disciplines. We provide a detailed analysis describing regimes where this may be feasible. We complement the technical results with an empirical study focused on the analysis of cortical thickness on brain mesh surfaces in an Alzheimer's disease dataset. To exploit the geometric nature of the data, we use simple ideas from spectral graph theory to show how adjustments to existing GANs can yield improvements. We also give a generalization error bound by extending recent results on Neural Network Distance. To our knowledge, our work offers the first analysis assessing whether the Null distribution in "healthy versus diseased subjects" type statistical testing using data generated from the GANs coincides with the one obtained from the same analysis with real data. The code is available at https://github.com/yyxiongzju/GLapGAN .
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 The superscript “★” next to Xiong and Huang indicates equal contribution.
ISSN:	0162-8828 1939-3539 2160-9292 1939-3539
DOI:	10.1109/TPAMI.2020.3013433