Performance-optimized hierarchical models predict neural responses in higher visual cortex

The ventral visual stream underlies key human visual object recognition abilities. However, neural encoding in the higher areas of the ventral stream remains poorly understood. Here, we describe a modeling approach that yields a quantitatively accurate model of inferior temporal (IT) cortex, the hig...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings of the National Academy of Sciences - PNAS Vol. 111; no. 23; pp. 8619 - 8624
Main Authors	Yamins, Daniel L. K., Hong, Ha, Cadieu, Charles F., Solomon, Ethan A., Seibert, Darren, DiCarlo, James J.
Format	Journal Article
Language	English
Published	United States National Academy of Sciences 10.06.2014 National Acad Sciences
Subjects	Algorithms Animals Architectural models Architecture Biological Sciences Humans Information technology Macaca mulatta - physiology Modeling Models, Neurological Multilevel models Nerve Net - physiology Neural Networks, Computer Neurons Neurosciences Object recognition Parametric models Photic Stimulation - methods Physiology Predictive modeling Psychomotor Performance - physiology Recognition, Psychology - physiology Visual cortex Visual Cortex - physiology Visual Pathways - physiology Visual Perception - physiology computer vision array electrophysiology computational neuroscience
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The ventral visual stream underlies key human visual object recognition abilities. However, neural encoding in the higher areas of the ventral stream remains poorly understood. Here, we describe a modeling approach that yields a quantitatively accurate model of inferior temporal (IT) cortex, the highest ventral cortical area. Using high-throughput computational techniques, we discovered that, within a class of biologically plausible hierarchical neural network models, there is a strong correlation between a model's categorization performance and its ability to predict individual IT neural unit response data. To pursue this idea, we then identified a high-performing neural network that matches human performance on a range of recognition tasks. Critically, even though we did not constrain this model to match neural data, its top output layer turns out to be highly predictive of IT spiking responses to complex naturalistic images at both the single site and population levels. Moreover, the model's intermediate layers are highly predictive of neural responses in the V4 cortex, a midlevel visual area that provides the dominant cortical input to IT. These results show that performance optimization—applied in a biologically appropriate model class— can be used to build quantitative predictive models of neural processing.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 1D.L.K.Y. and H.H. contributed equally to this work. Author contributions: D.L.K.Y., H.H., and J.J.D. designed research; D.L.K.Y., H.H., and E.A.S. performed research; D.L.K.Y. contributed new reagents/analytic tools; D.L.K.Y., H.H., C.F.C., and D.S. analyzed data; and D.L.K.Y., H.H., and J.J.D. wrote the paper. Edited by Terrence J. Sejnowski, Salk Institute for Biological Studies, La Jolla, CA, and approved April 8, 2014 (received for review March 3, 2014)
ISSN:	0027-8424 1091-6490
DOI:	10.1073/pnas.1403112111