Universality of deep convolutional neural networks

Deep learning has been widely applied and brought breakthroughs in speech recognition, computer vision, and many other domains. Deep neural network architectures and computational issues have been well studied in machine learning. But there lacks a theoretical foundation for understanding the approx...

Full description

Saved in:

Bibliographic Details
Published in	Applied and computational harmonic analysis Vol. 48; no. 2; pp. 787 - 794
Main Author	Zhou, Ding-Xuan
Format	Journal Article
Language	English
Published	Elsevier Inc 01.03.2020
Subjects	Approximation theory Convolutional neural network Deep learning Universality Deep learning Approximation theory Universality Convolutional neural network
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep learning has been widely applied and brought breakthroughs in speech recognition, computer vision, and many other domains. Deep neural network architectures and computational issues have been well studied in machine learning. But there lacks a theoretical foundation for understanding the approximation or generalization ability of deep learning methods generated by the network architectures such as deep convolutional neural networks. Here we show that a deep convolutional neural network (CNN) is universal, meaning that it can be used to approximate any continuous function to an arbitrary accuracy when the depth of the neural network is large enough. This answers an open question in learning theory. Our quantitative estimate, given tightly in terms of the number of free parameters to be computed, verifies the efficiency of deep CNNs in dealing with large dimensional data. Our study also demonstrates the role of convolutions in deep CNNs.
ISSN:	1063-5203 1096-603X
DOI:	10.1016/j.acha.2019.06.004