Image Super-Resolution with Text Prompt Diffusion

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled Image Super-Resolution with Text Prompt Diffusion, by Zheng Chen and 6 other authors

View PDF
HTML (experimental)

Abstract:Image super-resolution (SR) methods typically model degradation to improve reconstruction accuracy in complex and unknown degradation scenarios. However, extracting degradation information from low-resolution images is challenging, which limits the model performance. To boost image SR performance, one feasible approach is to introduce additional priors. Inspired by advancements in multi-modal methods and text prompt image processing, we introduce text prompts to image SR to provide degradation priors. Specifically, we first design a text-image generation pipeline to integrate text into the SR dataset through the text degradation representation and degradation model. The text representation applies a discretization manner based on the binning method to describe the degradation abstractly. This method maintains the flexibility of the text and is user-friendly. Meanwhile, we propose the PromptSR to realize the text prompt SR. The PromptSR utilizes the pre-trained language model (e.g., T5 or CLIP) to enhance restoration. We train the PromptSR on the generated text-image dataset. Extensive experiments indicate that introducing text prompts into SR, yields excellent results on both synthetic and real-world images. Code is available at: this https URL.

Submission history

From: Zheng Chen [view email]
[v1]
Fri, 24 Nov 2023 05:11:35 UTC (11,546 KB)
[v2]
Tue, 12 Mar 2024 12:14:51 UTC (17,868 KB)
[v3]
Tue, 8 Oct 2024 10:30:00 UTC (15,795 KB)
[v4]
Thu, 10 Oct 2024 05:47:46 UTC (15,794 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.