View a PDF of the paper titled Estimating Body and Hand Motion in an Ego-sensed World, by Brent Yi and 8 other authors
Abstract:We present EgoAllo, a system for human motion estimation from a head-mounted device. Using only egocentric SLAM poses and images, EgoAllo guides sampling from a conditional diffusion model to estimate 3D body pose, height, and hand parameters that capture a device wearer’s actions in the allocentric coordinate frame of the scene. To achieve this, our key insight is in representation: we propose spatial and temporal invariance criteria for improving model performance, from which we derive a head motion conditioning parameterization that improves estimation by up to 18%. We also show how the bodies estimated by our system can improve hand estimation: the resulting kinematic and temporal constraints can reduce world-frame errors in single-frame estimates by 40%. Project page: this https URL
Submission history
From: Brent Yi [view email]
[v1]
Fri, 4 Oct 2024 17:59:57 UTC (10,364 KB)
[v2]
Thu, 17 Oct 2024 20:51:19 UTC (20,276 KB)
[v3]
Tue, 17 Dec 2024 18:39:00 UTC (18,751 KB)
Source link
lol