CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale, by ZeMing Gong and 6 other authors

View PDF
HTML (experimental)

Abstract:Measuring biodiversity is crucial for understanding ecosystem health. While prior works have developed machine learning models for taxonomic classification of photographic images and DNA separately, in this work, we introduce a multimodal approach combining both, using CLIP-style contrastive learning to align images, barcode DNA, and text-based representations of taxonomic labels in a unified embedding space. This allows for accurate classification of both known and unknown insect species without task-specific fine-tuning, leveraging contrastive learning for the first time to fuse DNA and image data. Our method surpasses previous single-modality approaches in accuracy by over 8% on zero-shot learning tasks, showcasing its effectiveness in biodiversity studies.

Submission history

From: ZeMing Gong [view email]
[v1]
Mon, 27 May 2024 17:57:48 UTC (5,008 KB)
[v2]
Thu, 31 Oct 2024 20:07:53 UTC (41,938 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.