Impact of Inaccurate Contamination Ratio on Robust Unsupervised Anomaly Detection

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


[Submitted on 14 Aug 2024]

View a PDF of the paper titled Impact of Inaccurate Contamination Ratio on Robust Unsupervised Anomaly Detection, by Jordan F. Masakuna and 4 other authors

View PDF
HTML (experimental)

Abstract:Training data sets intended for unsupervised anomaly detection, typically presumed to be anomaly-free, often contain anomalies (or contamination), a challenge that significantly undermines model performance. Most robust unsupervised anomaly detection models rely on contamination ratio information to tackle contamination. However, in reality, contamination ratio may be inaccurate. We investigate on the impact of inaccurate contamination ratio information in robust unsupervised anomaly detection. We verify whether they are resilient to misinformed contamination ratios. Our investigation on 6 benchmark data sets reveals that such models are not adversely affected by exposure to misinformation. In fact, they can exhibit improved performance when provided with such inaccurate contamination ratios.

Submission history

From: Jordan Felicien Masakuna [view email]
[v1]
Wed, 14 Aug 2024 08:49:41 UTC (1,026 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.