SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models

stp2yOctober 14, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 19 Aug 2024 (v1), last revised 11 Oct 2024 (this version, v3)]

View a PDF of the paper titled SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models, by Danush Kumar Venkatesh and 3 other authors

Abstract:Computer-assisted surgery (CAS) systems are designed to assist surgeons during procedures, thereby reducing complications and enhancing patient care. Training machine learning models for these systems requires a large corpus of annotated datasets, which is challenging to obtain in the surgical domain due to patient privacy concerns and the significant labeling effort required from doctors. Previous methods have explored unpaired image translation using generative models to create realistic surgical images from simulations. However, these approaches have struggled to produce high-quality, diverse surgical images. In this work, we introduce emph{SurgicaL-CD}, a consistency-distilled diffusion method to generate realistic surgical images with only a few sampling steps without paired data. We evaluate our approach on three datasets, assessing the generated images in terms of quality and utility as downstream training datasets. Our results demonstrate that our method outperforms GANs and diffusion-based approaches. Our code is available at this https URL.

Submission history

From: Danush Kumar Venkatesh [view email]
[v1]
Mon, 19 Aug 2024 09:19:25 UTC (20,365 KB)
[v2]
Fri, 23 Aug 2024 13:01:11 UTC (20,365 KB)
[v3]
Fri, 11 Oct 2024 07:46:11 UTC (6,271 KB)

Source link
lol

By stp2y