View a PDF of the paper titled A Generalization Bound for a Family of Implicit Networks, by Samy Wu Fung and Benjamin Berkels
Abstract:Implicit networks are a class of neural networks whose outputs are defined by the fixed point of a parameterized operator. They have enjoyed success in many applications including natural language processing, image processing, and numerous other applications. While they have found abundant empirical success, theoretical work on its generalization is still under-explored. In this work, we consider a large family of implicit networks defined parameterized contractive fixed point operators. We show a generalization bound for this class based on a covering number argument for the Rademacher complexity of these architectures.
Submission history
From: Samy Wu Fung [view email]
[v1]
Wed, 9 Oct 2024 20:44:15 UTC (127 KB)
[v2]
Fri, 11 Oct 2024 02:55:20 UTC (127 KB)
[v3]
Sun, 26 Jan 2025 00:17:48 UTC (141 KB)
Source link
lol