No (good) loss no gain: systematic evaluation of loss functions in deep learning-based side-channel analysis

Deep learning is a powerful direction for profiling side-channel analysis as it can break targets protected with countermeasures even with a relatively small number of attack traces. Still, it is necessary to conduct hyperparameter tuning to reach strong attack performance, which can be far from tri...

Full description

Saved in:

Bibliographic Details
Published in	Journal of cryptographic engineering Vol. 13; no. 3; pp. 311 - 324
Main Authors	Kerkhof, Maikel, Wu, Lichao, Perin, Guilherme, Picek, Stjepan
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.09.2023 Springer Nature B.V
Subjects	Circuits and Systems Communications Engineering Computer architecture Computer Communication Networks Computer Science Cryptology Data Structures and Information Theory Deep learning Machine learning Networks Neural networks Operating Systems Regular Paper Loss function Evaluation Side-channel analysis Deep Learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep learning is a powerful direction for profiling side-channel analysis as it can break targets protected with countermeasures even with a relatively small number of attack traces. Still, it is necessary to conduct hyperparameter tuning to reach strong attack performance, which can be far from trivial. Besides many options stemming from the machine learning domain, recent years also brought neural network elements specially designed for side-channel analysis. The loss function, which calculates the error or loss between the actual and desired output, is one of the most important neural network elements. The resulting loss values guide the weights update associated with the connections between the neurons or filters of the deep learning neural network. Unfortunately, despite being a highly relevant hyperparameter, there are no systematic comparisons among different loss functions regarding their effectiveness in side-channel attacks. This work provides a detailed study of the efficiency of different loss functions in the SCA context. We evaluate five loss functions commonly used in machine learning and three loss functions specifically designed for SCA. Our results show that an SCA-specific loss function (called CER) performs very well and outperforms other loss functions in most evaluated settings. Still, categorical cross-entropy represents a good option, especially considering the variety of neural network architectures.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2190-8508 2190-8516
DOI:	10.1007/s13389-023-00320-6