The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives, by Samee Arif and 5 other authors

View PDF
HTML (experimental)

Abstract:This paper introduces the concept of an education tool that utilizes Generative Artificial Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven narrative co-creation, text-to-speech conversion, and text-to-video generation to produce an engaging experience for learners. We describe the co-creation process, the adaptation of narratives into spoken words using text-to-speech models, and the transformation of these narratives into contextually relevant visuals through text-to-video technology. Our evaluation covers the linguistics of the generated stories, the text-to-speech conversion quality, and the accuracy of the generated visuals.

Submission history

From: Samee Arif [view email]
[v1]
Tue, 17 Sep 2024 15:10:23 UTC (2,834 KB)
[v2]
Wed, 18 Sep 2024 09:38:22 UTC (2,834 KB)
[v3]
Thu, 19 Sep 2024 09:50:58 UTC (2,834 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.