Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term

Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we revisit the loss of SAM and propose a more general method, cal...

Full description

Saved in:

Bibliographic Details
Main Authors	Yue, Yun, Jiang, Jiadi, Ye, Zhiling, Gao, Ning, Liu, Yongchao, Zhang, Ke
Format	Journal Article
Language	English
Published	25.05.2023
Subjects	Computer Science - Learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we revisit the loss of SAM and propose a more general method, called WSAM, by incorporating sharpness as a regularization term. We prove its generalization bound through the combination of PAC and Bayes-PAC techniques, and evaluate its performance on various public datasets. The results demonstrate that WSAM achieves improved generalization, or is at least highly competitive, compared to the vanilla optimizer, SAM and its variants. The code is available at https://github.com/intelligent-machine-learning/dlrover/tree/master/atorch/atorch/optimizers.
DOI:	10.48550/arxiv.2305.15817