ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework, by Hengyuan Zhang and 7 other authors

View PDF
HTML (experimental)

Abstract:Although fine-tuning Large Language Models (LLMs) with multilingual data can rapidly enhance the multilingual capabilities of LLMs, they still exhibit a performance gap between the dominant language (e.g., English) and non-dominant ones due to the imbalance of training data across languages. To further enhance the performance of non-dominant languages, we propose ShifCon, a Shift-based Contrastive framework that aligns the internal forward process of other languages toward that of the dominant one. Specifically, it shifts the representations of non-dominant languages into the dominant language subspace, allowing them to access relatively rich information encoded in the model parameters. The enriched representations are then shifted back into their original language subspace before generation. Moreover, we introduce a subspace distance metric to pinpoint the optimal layer area for shifting representations and employ multilingual contrastive learning to further enhance the alignment of representations within this area. Experiments demonstrate that our ShifCon framework significantly enhances the performance of non-dominant languages, particularly for low-resource ones. Further analysis offers extra insights to verify the effectiveness of ShifCon and propel future research

Submission history

From: Hengyuan Zhang [view email]
[v1]
Fri, 25 Oct 2024 10:28:59 UTC (2,374 KB)
[v2]
Wed, 6 Nov 2024 11:49:10 UTC (2,374 KB)
[v3]
Wed, 27 Nov 2024 08:17:09 UTC (2,374 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.