View a PDF of the paper titled TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer, by Zihan Su and 2 other authors
Abstract:Recently, text-guided image editing has achieved significant success. However, existing methods can only apply simple textures like wood or gold when changing the texture of an object. Complex textures such as cloud or fire pose a challenge. This limitation stems from that the target prompt needs to contain both the input image content and <texture>, restricting the texture representation. In this paper, we propose TextureDiffusion, a tuning-free image editing method applied to various texture transfer. Initially, the target prompt is directly set to “<texture>”, making the texture disentangled from the input image content to enhance texture representation. Subsequently, query features in self-attention and features in residual blocks are utilized to preserve the structure of the input image. Finally, to maintain the background, we introduce an edit localization technique which blends the self-attention results and the intermediate latents. Comprehensive experiments demonstrate that TextureDiffusion can harmoniously transfer various textures with excellent structure and background preservation. Code is publicly available at this https URL
Submission history
From: Zihan Su [view email]
[v1]
Sun, 15 Sep 2024 04:34:38 UTC (7,055 KB)
[v2]
Tue, 14 Jan 2025 09:44:01 UTC (5,384 KB)
Source link
lol