Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we revisit the loss of SAM and propose a more general method, cal...
Saved in:
Main Authors | , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
25.05.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Deep Neural Networks (DNNs) generalization is known to be closely related to
the flatness of minima, leading to the development of Sharpness-Aware
Minimization (SAM) for seeking flatter minima and better generalization. In
this paper, we revisit the loss of SAM and propose a more general method,
called WSAM, by incorporating sharpness as a regularization term. We prove its
generalization bound through the combination of PAC and Bayes-PAC techniques,
and evaluate its performance on various public datasets. The results
demonstrate that WSAM achieves improved generalization, or is at least highly
competitive, compared to the vanilla optimizer, SAM and its variants. The code
is available at
https://github.com/intelligent-machine-learning/dlrover/tree/master/atorch/atorch/optimizers. |
---|---|
DOI: | 10.48550/arxiv.2305.15817 |