Absence of Barren Plateaus in Quantum Convolutional Neural Networks

Quantum neural networks (QNNs) have generated excitement around the possibility of efficiently analyzing quantum data. But this excitement has been tempered by the existence of exponentially vanishing gradients, known as barren plateau landscapes, for many QNN architectures. Recently, quantum convol...

Full description

Saved in:

Bibliographic Details
Published in	Physical review. X Vol. 11; no. 4; p. 041011
Main Authors	Pesah, Arthur, Cerezo, M., Wang, Samson, Volkoff, Tyler, Sornborger, Andrew T., Coles, Patrick J.
Format	Journal Article
Language	English
Published	College Park American Physical Society 01.10.2021
Subjects	Artificial neural networks Cognitive tasks Computer simulation Cost function Landscape architecture Machine learning Neural networks Optimization Quantum computers Qubits (quantum computing)
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Quantum neural networks (QNNs) have generated excitement around the possibility of efficiently analyzing quantum data. But this excitement has been tempered by the existence of exponentially vanishing gradients, known as barren plateau landscapes, for many QNN architectures. Recently, quantum convolutional neural networks (QCNNs) have been proposed, involving a sequence of convolutional and pooling layers that reduce the number of qubits while preserving information about relevant data features. In this work, we rigorously analyze the gradient scaling for the parameters in the QCNN architecture. We find that the variance of the gradient vanishes no faster than polynomially, implying that QCNNs do not exhibit barren plateaus. This result provides an analytical guarantee for the trainability of randomly initialized QCNNs, which highlights QCNNs as being trainable under random initialization unlike many other QNN architectures. To derive our results, we introduce a novel graph-based method to analyze expectation values over Haar-distributed unitaries, which will likely be useful in other contexts. Finally, we perform numerical simulations to verify our analytical results.
Bibliography:	USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) 20200677PRD1; 20190065DR
ISSN:	2160-3308 2160-3308
DOI:	10.1103/PhysRevX.11.041011