PDE-Based Group Equivariant Convolutional Neural Networks

We present a PDE-based framework that generalizes Group equivariant Convolutional Neural Networks (G-CNNs). In this framework, a network layer is seen as a set of PDE-solvers where geometrically meaningful PDE-coefficients become the layer’s trainable weights. Formulating our PDEs on homogeneous spa...

Full description

Saved in:

Bibliographic Details
Published in	Journal of mathematical imaging and vision Vol. 65; no. 1; pp. 209 - 239
Main Authors	Smets, Bart M. N., Portegies, Jim, Bekkers, Erik J., Duits, Remco
Format	Journal Article
Language	English
Published	New York Springer US 01.01.2023 Springer Nature B.V
Subjects	Applications of Mathematics Approximation Artificial neural networks Computer Science Image Processing and Computer Vision Kernels Mathematical analysis Mathematical Methods in Physics Morphology Neural networks Signal,Image and Speech Processing Solvers PDE Deep learning Group equivariance Morphological scale-space
Online Access	Get full text

Cover

Loading…

More Information
Summary:	We present a PDE-based framework that generalizes Group equivariant Convolutional Neural Networks (G-CNNs). In this framework, a network layer is seen as a set of PDE-solvers where geometrically meaningful PDE-coefficients become the layer’s trainable weights. Formulating our PDEs on homogeneous spaces allows these networks to be designed with built-in symmetries such as rotation in addition to the standard translation equivariance of CNNs. Having all the desired symmetries included in the design obviates the need to include them by means of costly techniques such as data augmentation. We will discuss our PDE-based G-CNNs (PDE-G-CNNs) in a general homogeneous space setting while also going into the specifics of our primary case of interest: roto-translation equivariance. We solve the PDE of interest by a combination of linear group convolutions and nonlinear morphological group convolutions with analytic kernel approximations that we underpin with formal theorems. Our kernel approximations allow for fast GPU-implementation of the PDE-solvers; we release our implementation with this article in the form of the LieTorch extension to PyTorch, available at https://gitlab.com/bsmetsjr/lietorch . Just like for linear convolution, a morphological convolution is specified by a kernel that we train in our PDE-G-CNNs. In PDE-G-CNNs, we do not use non-linearities such as max/min-pooling and ReLUs as they are already subsumed by morphological convolutions. We present a set of experiments to demonstrate the strength of the proposed PDE-G-CNNs in increasing the performance of deep learning-based imaging applications with far fewer parameters than traditional CNNs.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0924-9907 1573-7683
DOI:	10.1007/s10851-022-01114-x