22
Nov
[Submitted on 28 Oct 2024 (v1), last revised 21 Nov 2024 (this version, v2)] Authors:Shih-Yang Liu, Huck Yang, Chien-Yi Wang, Nai Chit Fung, Hongxu Yin, Charbel Sakr, Saurav Muralidharan, Kwang-Ting Cheng, Jan Kautz, Yu-Chiang Frank Wang, Pavlo Molchanov, Min-Hung Chen View a PDF of the paper titled EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation, by Shih-Yang Liu and 11 other authors View PDF HTML (experimental) Abstract:In this work, we re-formulate the model compression problem into the customized compensation problem: Given a compressed model, we aim to introduce residual low-rank paths to compensate for compression errors under customized…