Towards an automated data cleaning with deep learning in CRESST

The CRESST experiment employs cryogenic calorimeters for the sensitive measurement of nuclear recoils induced by dark matter particles. The recorded signals need to undergo a careful cleaning process to avoid wrongly reconstructed recoil energies caused by pile-up and read-out artefacts. We frame th...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Angloher, G, Banik, S, Bartolot, D, Benato, G, Bento, A, Bertolini, A, Breier, R, Bucci, C, Burkhart, J, Canonica, L, D'Addabbo, A, S Di Lorenzo, Einfalt, L, Erb, A, Feilitzsch, F v, N Ferreiro Iachellini, Fichtinger, S, Fuchs, D, Fuss, A, Garai, A, Ghete, V M, Gerster, S, Gorla, P, Guillaumon, P V, Gupta, S, Hauff, D, Ješkovský, M, Jochum, J, Kaznacheeva, M, Kinast, A, Kluck, H, Kraus, H, Lackner, M, Langenkämper, A, Mancuso, M, Marini, L, Meyer, L, Mokina, V, Nilima, A, Olmi, M, Ortmann, T, Pagliarone, C, Pattavina, L, Petricca, F, Potzel, W, Povinec, P, Pröbst, F, Pucci, F, Reindl, F, Rizvanovic, D, Rothe, J, Schäffner, K, Schieck, J, Schmiedmayer, D, Schönert, S, Schwertner, C, Stahlberg, M, Stodolsky, L, Strandhagen, C, Strauss, R, Usherov, I, Wagner, F, Willers, M, Zema, V, Waltenberger, W
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 07.01.2023
Subjects	Automation Classifiers Cleaning Computer architecture Dark matter Machine learning Neural networks Physics - Instrumentation and Detectors Physics - Instrumentation and Methods for Astrophysics Selectivity Signal processing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The CRESST experiment employs cryogenic calorimeters for the sensitive measurement of nuclear recoils induced by dark matter particles. The recorded signals need to undergo a careful cleaning process to avoid wrongly reconstructed recoil energies caused by pile-up and read-out artefacts. We frame this process as a time series classification task and propose to automate it with neural networks. With a data set of over one million labeled records from 68 detectors, recorded between 2013 and 2019 by CRESST, we test the capability of four commonly used neural network architectures to learn the data cleaning task. Our best performing model achieves a balanced accuracy of 0.932 on our test set. We show on an exemplary detector that about half of the wrongly predicted events are in fact wrongly labeled events, and a large share of the remaining ones have a context-dependent ground truth. We furthermore evaluate the recall and selectivity of our classifiers with simulated data. The results confirm that the trained classifiers are well suited for the data cleaning task.
ISSN:	2331-8422
DOI:	10.48550/arxiv.2211.00564