View a PDF of the paper titled On Newton’s Method to Unlearn Neural Networks, by Nhung Bui and 4 other authors
Abstract:With the widespread applications of neural networks (NNs) trained on personal data, machine unlearning has become increasingly important for enabling individuals to exercise their personal data ownership, particularly the “right to be forgotten” from trained NNs. Since retraining is computationally expensive, we seek approximate unlearning algorithms for NNs that return identical models to the retrained oracle. While Newton’s method has been successfully used to approximately unlearn linear models, we observe that adapting it for NN is challenging due to degenerate Hessians that make computing Newton’s update impossible. Additionally, we show that when coupled with popular techniques to resolve the degeneracy, Newton’s method often incurs offensively large norm updates and empirically degrades model performance post-unlearning. To address these challenges, we propose CureNewton’s method, a principle approach that leverages cubic regularization to handle the Hessian degeneracy effectively. The added regularizer eliminates the need for manual finetuning and affords a natural interpretation within the unlearning context. Experiments across different models and datasets show that our method can achieve competitive unlearning performance to the state-of-the-art algorithm in practical unlearning settings, while being theoretically justified and efficient in running time.
Submission history
From: Xinyang Lu [view email]
[v1]
Thu, 20 Jun 2024 17:12:20 UTC (529 KB)
[v2]
Tue, 27 Aug 2024 17:19:20 UTC (4,764 KB)
Source link
lol