Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term, by Yun Yue and 5 other authors

View PDF
HTML (experimental)

Abstract:Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we revisit the loss of SAM and propose a more general method, called WSAM, by incorporating sharpness as a regularization term. We prove its generalization bound through the combination of PAC and Bayes-PAC techniques, and evaluate its performance on various public datasets. The results demonstrate that WSAM achieves improved generalization, or is at least highly competitive, compared to the vanilla optimizer, SAM and its variants. The code is available at this https URL.

Submission history

From: Yun Yue [view email]
[v1]
Thu, 25 May 2023 08:00:34 UTC (568 KB)
[v2]
Fri, 9 Jun 2023 07:58:13 UTC (569 KB)
[v3]
Thu, 5 Dec 2024 07:31:10 UTC (569 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.