Effective Layer Pruning Through Similarity Metric Perspective

stp2yNovember 5, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 27 May 2024 (v1), last revised 4 Nov 2024 (this version, v2)]

View a PDF of the paper titled Effective Layer Pruning Through Similarity Metric Perspective, by Ian Pons and 2 other authors

View PDF
HTML (experimental)

Abstract:Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction, most efforts focus on removing weights or filters. Studies have also been devoted to layer pruning as it promotes superior computational gains. However, layer pruning often hurts the network predictive ability (i.e., accuracy) at high compression rates. This work introduces an effective layer-pruning strategy that meets all underlying properties pursued by pruning methods. Our method estimates the relative importance of a layer using the Centered Kernel Alignment (CKA) metric, employed to measure the similarity between the representations of the unpruned model and a candidate layer for pruning. We confirm the effectiveness of our method on standard architectures and benchmarks, in which it outperforms existing layer-pruning strategies and other state-of-the-art pruning techniques. Particularly, we remove more than 75% of computation while improving predictive ability. At higher compression regimes, our method exhibits negligible accuracy drop, while other methods notably deteriorate model accuracy. Apart from these benefits, our pruned models exhibit robustness to adversarial and out-of-distribution samples.

Submission history

From: Ian Pons [view email]
[v1]
Mon, 27 May 2024 11:54:51 UTC (1,358 KB)
[v2]
Mon, 4 Nov 2024 18:39:10 UTC (1,358 KB)

Source link
lol

By stp2y