Optimal Transport for Domain Adaptation through Gaussian Mixture Models

stp2yJanuary 23, 20250 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 18 Mar 2024 (v1), last revised 22 Jan 2025 (this version, v2)]

View a PDF of the paper titled Optimal Transport for Domain Adaptation through Gaussian Mixture Models, by Eduardo Fernandes Montesuma and 2 other authors

Abstract:Machine learning systems operate under the assumption that training and test data are sampled from a fixed probability distribution. However, this assumptions is rarely verified in practice, as the conditions upon which data was acquired are likely to change. In this context, the adaptation of the unsupervised domain requires minimal access to the data of the new conditions for learning models robust to changes in the data distribution. Optimal transport is a theoretically grounded tool for analyzing changes in distribution, especially as it allows the mapping between domains. However, these methods are usually computationally expensive as their complexity scales cubically with the number of samples. In this work, we explore optimal transport between Gaussian Mixture Models (GMMs), which is conveniently written in terms of the components of source and target GMMs. We experiment with 9 benchmarks, with a total of $85$ adaptation tasks, showing that our methods are more efficient than previous shallow domain adaptation methods, and they scale well with number of samples $n$ and dimensions $d$.

Submission history

From: Eduardo Fernandes Montesuma [view email]
[v1]
Mon, 18 Mar 2024 09:32:33 UTC (4,641 KB)
[v2]
Wed, 22 Jan 2025 12:47:49 UTC (9,058 KB)

Source link
lol

By stp2y