Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks, by Zixuan Zhang and 6 other authors

View PDF
HTML (experimental)

Abstract:Convolutional residual neural networks (ConvResNets), though overparameterized, can achieve remarkable prediction performance in practice, which cannot be well explained by conventional wisdom. To bridge this gap, we study the performance of ConvResNeXts, which cover ConvResNets as a special case, trained with weight decay from the perspective of nonparametric classification. Our analysis allows for infinitely many building blocks in ConvResNeXts, and shows that weight decay implicitly enforces sparsity on these blocks. Specifically, we consider a smooth target function supported on a low-dimensional manifold, then prove that ConvResNeXts can adapt to the function smoothness and low-dimensional structures and efficiently learn the function without suffering from the curse of dimensionality. Our findings partially justify the advantage of overparameterized ConvResNeXts over conventional machine learning models.

Submission history

From: Zixuan Zhang [view email]
[v1]
Tue, 4 Jul 2023 11:08:03 UTC (184 KB)
[v2]
Sun, 18 Feb 2024 03:29:20 UTC (1,689 KB)
[v3]
Tue, 10 Dec 2024 06:15:04 UTC (1,566 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.