Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

[Submitted on 3 Feb 2024 (v1), last revised 19 Sep 2024 (this version, v4)]

Authors:Youjia Wang, Yiwen Wu, Hengan Zhou, Hongyang Lin, Xingyue Peng, Jingyan Zhang, Yingsheng Zhu, Yingwenqi Jiang, Yatu Zhang, Lan Xu, Jingya Wang, Jingyi Yu

View a PDF of the paper titled Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units, by Youjia Wang and 11 other authors

View PDF
HTML (experimental)

Abstract:We present Capturing the Unseen (CAPUS), a novel facial motion capture (MoCap) technique that operates without visual signals. CAPUS leverages miniaturized Inertial Measurement Units (IMUs) as a new sensing modality for facial motion capture. While IMUs have become essential in full-body MoCap for their portability and independence from environmental conditions, their application in facial MoCap remains underexplored. We address this by customizing micro-IMUs, small enough to be placed on the face, and strategically positioning them in alignment with key facial muscles to capture expression dynamics. CAPUS introduces the first facial IMU dataset, encompassing both IMU and visual signals from participants engaged in diverse activities such as multilingual speech, facial expressions, and emotionally intoned auditions. We train a Transformer Diffusion-based neural network to infer Blendshape parameters directly from IMU data. Our experimental results demonstrate that CAPUS reliably captures facial motion in conditions where visual-based methods struggle, including facial occlusions, rapid movements, and low-light environments. Additionally, by eliminating the need for visual inputs, CAPUS offers enhanced privacy protection, making it a robust solution for vision-free facial MoCap.

Submission history

From: Yiwen Wu [view email]
[v1]
Sat, 3 Feb 2024 14:27:18 UTC (42,175 KB)
[v2]
Wed, 29 May 2024 09:47:25 UTC (21,069 KB)
[v3]
Wed, 12 Jun 2024 12:06:01 UTC (21,069 KB)
[v4]
Thu, 19 Sep 2024 06:34:12 UTC (14,782 KB)

Source link
lol

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Submission history

By stp2y

Leave a Reply Cancel reply