A Generalization Bound for a Family of Implicit Networks

stp2yJanuary 29, 20250 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 9 Oct 2024 (v1), last revised 26 Jan 2025 (this version, v3)]

View a PDF of the paper titled A Generalization Bound for a Family of Implicit Networks, by Samy Wu Fung and Benjamin Berkels

View PDF
HTML (experimental)

Abstract:Implicit networks are a class of neural networks whose outputs are defined by the fixed point of a parameterized operator. They have enjoyed success in many applications including natural language processing, image processing, and numerous other applications. While they have found abundant empirical success, theoretical work on its generalization is still under-explored. In this work, we consider a large family of implicit networks defined parameterized contractive fixed point operators. We show a generalization bound for this class based on a covering number argument for the Rademacher complexity of these architectures.

Submission history

From: Samy Wu Fung [view email]
[v1]
Wed, 9 Oct 2024 20:44:15 UTC (127 KB)
[v2]
Fri, 11 Oct 2024 02:55:20 UTC (127 KB)
[v3]
Sun, 26 Jan 2025 00:17:48 UTC (141 KB)

Source link
lol

By stp2y