A Generalization Bound for a Family of Implicit Networks

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled A Generalization Bound for a Family of Implicit Networks, by Samy Wu Fung and Benjamin Berkels

View PDF
HTML (experimental)

Abstract:Implicit networks are a class of neural networks whose outputs are defined by the fixed point of a parameterized operator. They have enjoyed success in many applications including natural language processing, image processing, and numerous other applications. While they have found abundant empirical success, theoretical work on its generalization is still under-explored. In this work, we consider a large family of implicit networks defined parameterized contractive fixed point operators. We show a generalization bound for this class based on a covering number argument for the Rademacher complexity of these architectures.

Submission history

From: Samy Wu Fung [view email]
[v1]
Wed, 9 Oct 2024 20:44:15 UTC (127 KB)
[v2]
Fri, 11 Oct 2024 02:55:20 UTC (127 KB)
[v3]
Sun, 26 Jan 2025 00:17:48 UTC (141 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.