Enabling Equivariance for Arbitrary Lie Groups

Although provably robust to translational perturbations, convolutional neural networks (CNNs) are known to suffer from extreme performance degradation when presented at test time with more general geometric transformations of inputs. Recently, this limitation has motivated a shift infocus from CNNs...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 8173 - 8182
Main Authors	MacDonald, Lachlan E., Ramasinghe, Sameera, Lucey, Simon
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2022
Subjects	Benchmark testing Computer vision Computer vision theory; Deep learning architectures and techniques; Explainable computer vision Degradation Mathematical models Pattern recognition Perturbation methods Robustness
Online Access	Get full text
ISSN	1063-6919
DOI	10.1109/CVPR52688.2022.00801

Cover

More Information
Summary:	Although provably robust to translational perturbations, convolutional neural networks (CNNs) are known to suffer from extreme performance degradation when presented at test time with more general geometric transformations of inputs. Recently, this limitation has motivated a shift infocus from CNNs to Capsule Networks (CapsNets). However, CapsNets suffer from admitting relatively few theoretical guarantees of invariance. We introduce a rigourous mathematical framework to permit invariance to any Lie group of warps, exclusively using convolutions (over Lie groups), without the need for capsules. Previous work on group convolutions has been hampered by strong assumptions about the group, which precludes the application of such techniques to common warps in computer vision such as affine and homographic. Our framework enables the implementation of group convolutions over any finite-dimensional Lie group. We empirically validate our approach on the benchmark affine-invariant classification task, where we achieve ~30% improvement in accuracy against conventional CNNs while outperforming most CapsNets. As further illustration of the generality of our framework, we train a homography-convolutional model which achieves superior robustness on a homography-perturbed dataset, where CapsNet results degrade.
ISSN:	1063-6919
DOI:	10.1109/CVPR52688.2022.00801