Provably effective detection of effective data poisoning attacks

stp2yJanuary 23, 20250 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 21 Jan 2025]

View a PDF of the paper titled Provably effective detection of effective data poisoning attacks, by Jonathan Gallagher and 3 other authors

View PDF
HTML (experimental)

Abstract:This paper establishes a mathematically precise definition of dataset poisoning attack and proves that the very act of effectively poisoning a dataset ensures that the attack can be effectively detected. On top of a mathematical guarantee that dataset poisoning is identifiable by a new statistical test that we call the Conformal Separability Test, we provide experimental evidence that we can adequately detect poisoning attempts in the real world.

Submission history

From: Yasaman Esfandiari [view email]
[v1]
Tue, 21 Jan 2025 00:07:55 UTC (23,253 KB)

Source link
lol

By stp2y