Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


[Submitted on 26 Nov 2024]

View a PDF of the paper titled Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics, by Nima Sedaghat and 4 other authors

View PDF
HTML (experimental)

Abstract:Motivated by the scarcity of proper labels in an astrophysical application, we have developed a novel technique, called Selfish Evolution, which allows for the detection and correction of corrupted labels in a weakly supervised fashion. Unlike methods based on early stopping, we let the model train on the noisy dataset. Only then do we intervene and allow the model to overfit to individual samples. The “evolution” of the model during this process reveals patterns with enough information about the noisiness of the label, as well as its correct version. We train a secondary network on these spatiotemporal “evolution cubes” to correct potentially corrupted labels. We incorporate the technique in a closed-loop fashion, allowing for automatic convergence towards a mostly clean dataset, without presumptions about the state of the network in which we intervene. We evaluate on the main task of the Supernova-hunting dataset but also demonstrate efficiency on the more standard MNIST dataset.

Submission history

From: Nima Sedaghat Alvar [view email]
[v1]
Tue, 26 Nov 2024 18:51:43 UTC (1,656 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.