RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images, by Benzhi Wang and 6 other authors

View PDF
HTML (experimental)

Abstract:In recent years, diffusion models have revolutionized visual generation, outperforming traditional frameworks like Generative Adversarial Networks (GANs). However, generating images of humans with realistic semantic parts, such as hands and faces, remains a significant challenge due to their intricate structural complexity. To address this issue, we propose a novel post-processing solution named RealisHuman. The RealisHuman framework operates in two stages. First, it generates realistic human parts, such as hands or faces, using the original malformed parts as references, ensuring consistent details with the original image. Second, it seamlessly integrates the rectified human parts back into their corresponding positions by repainting the surrounding areas to ensure smooth and realistic blending. The RealisHuman framework significantly enhances the realism of human generation, as demonstrated by notable improvements in both qualitative and quantitative metrics. Code is available at this https URL.

Submission history

From: Benzhi Wang [view email]
[v1]
Thu, 5 Sep 2024 16:02:11 UTC (5,455 KB)
[v2]
Wed, 13 Nov 2024 01:45:31 UTC (5,455 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.