Approximating Langevin Monte Carlo with ResNet-like Neural Network architectures

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled Approximating Langevin Monte Carlo with ResNet-like Neural Network architectures, by Charles Miranda and 3 other authors

View PDF

Abstract:We sample from a given target distribution by constructing a neural network which maps samples from a simple reference, e.g. the standard normal distribution, to samples from the target. To that end, we propose using a neural network architecture inspired by the Langevin Monte Carlo (LMC) algorithm. Based on LMC perturbation results, we show approximation rates of the proposed architecture for smooth, log-concave target distributions measured in the Wasserstein-$2$ distance. The analysis heavily relies on the notion of sub-Gaussianity of the intermediate measures of the perturbed LMC process. In particular, we derive bounds on the growth of the intermediate variance proxies under different assumptions on the perturbations. Moreover, we propose an architecture similar to deep residual neural networks and derive expressivity results for approximating the sample to target distribution map.

Submission history

From: Janina Enrica Schütte [view email]
[v1]
Mon, 6 Nov 2023 16:31:09 UTC (350 KB)
[v2]
Mon, 22 Jan 2024 13:40:16 UTC (699 KB)
[v3]
Tue, 10 Dec 2024 10:55:44 UTC (1,090 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.