T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

stp2ySeptember 11, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 8 Jul 2024 (v1), last revised 8 Sep 2024 (this version, v3)]

View a PDF of the paper titled T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models, by Yibo Miao and 5 other authors

View PDF
HTML (experimental)

Abstract:The recent development of Sora leads to a new era in text-to-video (T2V) generation. Along with this comes the rising concern about its security risks. The generated videos may contain illegal or unethical content, and there is a lack of comprehensive quantitative understanding of their safety, posing a challenge to their reliability and practical deployment. Previous evaluations primarily focus on the quality of video generation. While some evaluations of text-to-image models have considered safety, they cover fewer aspects and do not address the unique temporal risk inherent in video generation. To bridge this research gap, we introduce T2VSafetyBench, a new benchmark designed for conducting safety-critical assessments of text-to-video models. We define 12 critical aspects of video generation safety and construct a malicious prompt dataset including real-world prompts, LLM-generated prompts and jailbreak attack-based prompts. Based on our evaluation results, we draw several important findings, including: 1) no single model excels in all aspects, with different models showing various strengths; 2) the correlation between GPT-4 assessments and manual reviews is generally high; 3) there is a trade-off between the usability and safety of text-to-video generative models. This indicates that as the field of video generation rapidly advances, safety risks are set to surge, highlighting the urgency of prioritizing video safety. We hope that T2VSafetyBench can provide insights for better understanding the safety of video generation in the era of generative AI.

Submission history

From: Yibo Miao [view email]
[v1]
Mon, 8 Jul 2024 14:04:58 UTC (17,749 KB)
[v2]
Sun, 1 Sep 2024 15:13:52 UTC (17,758 KB)
[v3]
Sun, 8 Sep 2024 16:19:53 UTC (17,758 KB)

Source link
lol

By stp2y