View a PDF of the paper titled Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks, by Yang Wang and 1 other authors
Abstract:Recent advancements in natural language processing have highlighted the vulnerability of deep learning models to adversarial attacks. While various defence mechanisms have been proposed, there is a lack of comprehensive benchmarks that evaluate these defences across diverse datasets, models, and tasks. In this work, we address this gap by presenting an extensive benchmark for textual adversarial defence that significantly expands upon previous work. Our benchmark incorporates a wide range of datasets, evaluates state-of-the-art defence mechanisms, and extends the assessment to include critical tasks such as single-sentence classification, similarity and paraphrase identification, natural language inference, and commonsense reasoning. This work not only serves as a valuable resource for researchers and practitioners in the field of adversarial robustness but also identifies key areas for future research in textual adversarial defence. By establishing a new standard for benchmarking in this domain, we aim to accelerate progress towards more robust and reliable natural language processing systems.
Submission history
From: Yang Wang [view email]
[v1]
Sun, 5 Jan 2025 20:39:52 UTC (91 KB)
[v2]
Wed, 8 Jan 2025 14:53:41 UTC (91 KB)
Source link
lol