Explainable equivariant neural networks for particle physics: PELICAN

A bstract PELICAN is a novel permutation equivariant and Lorentz invariant or covariant aggregator network designed to overcome common limitations found in architectures applied to particle physics problems. Compared to many approaches that use non-specialized architectures that neglect underlying p...

Full description

Saved in:

Bibliographic Details
Published in	The journal of high energy physics Vol. 2024; no. 3; pp. 113 - 66
Main Authors	Bogatskiy, Alexander, Hoffman, Timothy, Miller, David W., Offermann, Jan T., Liu, Xiaoyang
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 20.03.2024 Springer Nature B.V SpringerOpen
Subjects	Algorithms Classical and Quantum Gravitation Complexity Elementary Particles Jets and Jet Substructure Machine learning Neural networks Particle physics Permutations Physics Physics and Astronomy Quantum Field Theories Quantum Field Theory Quantum Physics Quarks Regular Article - Theoretical Physics Relativity Theory Specific QCD Phenomenology String Theory Symmetry Top Quark Top Quark Specific QCD Phenomenology Jets and Jet Substructure
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A bstract PELICAN is a novel permutation equivariant and Lorentz invariant or covariant aggregator network designed to overcome common limitations found in architectures applied to particle physics problems. Compared to many approaches that use non-specialized architectures that neglect underlying physics principles and require very large numbers of parameters, PELICAN employs a fundamentally symmetry group-based architecture that demonstrates benefits in terms of reduced complexity, increased interpretability, and raw performance. We present a comprehensive study of the PELICAN algorithm architecture in the context of both tagging (classification) and reconstructing (regression) Lorentz-boosted top quarks, including the difficult task of specifically identifying and measuring the W -boson inside the dense environment of the Lorentz-boosted top-quark hadronic final state. We also extend the application of PELICAN to the tasks of identifying quark-initiated vs. gluon-initiated jets, and a multi-class identification across five separate target categories of jets. When tested on the standard task of Lorentz-boosted top-quark tagging, PELICAN outperforms existing competitors with much lower model complexity and high sample efficiency. On the less common and more complex task of 4-momentum regression, PELICAN also outperforms hand-crafted, non-machine learning algorithms. We discuss the implications of symmetry-restricted architectures for the wider field of machine learning for physics.
ISSN:	1029-8479 1029-8479
DOI:	10.1007/JHEP03(2024)113