MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans

arXiv:2410.00253v1 Announce Type: new
Abstract: In this paper, we present a novel dataset captured using a VR headset to record conversations between participants within a physics simulator (AI2-THOR). Our primary objective is to extend the field of co-speech gesture generation by incorporating rich contextual information within referential settings. Participants engaged in various conversational scenarios, all based on referential communication tasks. The dataset provides a rich set of multimodal recordings such as motion capture, speech, gaze, and scene graphs. This comprehensive dataset aims to enhance the understanding and development of gesture generation models in 3D scenes by providing diverse and contextually rich data.

Source link
lol

MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans

By stp2y

Leave a Reply Cancel reply