The Numerical Stability of Hyperbolic Representation Learning

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled The Numerical Stability of Hyperbolic Representation Learning, by Gal Mishne and 3 other authors

View PDF
HTML (experimental)

Abstract:Given the exponential growth of the volume of the ball w.r.t. its radius, the hyperbolic space is capable of embedding trees with arbitrarily small distortion and hence has received wide attention for representing hierarchical datasets. However, this exponential growth property comes at a price of numerical instability such that training hyperbolic learning models will sometimes lead to catastrophic NaN problems, encountering unrepresentable values in floating point arithmetic. In this work, we carefully analyze the limitation of two popular models for the hyperbolic space, namely, the Poincaré ball and the Lorentz model. We first show that, under the 64 bit arithmetic system, the Poincaré ball has a relatively larger capacity than the Lorentz model for correctly representing points. Then, we theoretically validate the superiority of the Lorentz model over the Poincaré ball from the perspective of optimization. Given the numerical limitations of both models, we identify one Euclidean parametrization of the hyperbolic space which can alleviate these limitations. We further extend this Euclidean parametrization to hyperbolic hyperplanes and exhibits its ability in improving the performance of hyperbolic SVM.

Submission history

From: Zhengchao Wan [view email]
[v1]
Mon, 31 Oct 2022 22:51:59 UTC (3,494 KB)
[v2]
Tue, 6 Jun 2023 23:22:11 UTC (4,848 KB)
[v3]
Wed, 28 Jun 2023 02:54:30 UTC (6,454 KB)
[v4]
Tue, 24 Dec 2024 04:28:34 UTC (6,142 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.