Investigating Robustness of Open-Vocabulary Foundation Object Detectors under Distribution Shifts

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


[Submitted on 1 Apr 2024]

View a PDF of the paper titled Investigating Robustness of Open-Vocabulary Foundation Object Detectors under Distribution Shifts, by Prakash Chandra Chhipa and 4 other authors

View PDF

Abstract:The challenge of Out-Of-Distribution (OOD) robustness remains a critical hurdle towards deploying deep vision models. Open-vocabulary object detection extends the capabilities of traditional object detection frameworks to recognize and classify objects beyond predefined categories. Investigating OOD robustness in open-vocabulary object detection is essential to increase the trustworthiness of these models. This study presents a comprehensive robustness comparison of zero-shot capabilities of three recent open-vocabulary foundation object detection models, namely OWL-ViT, YOLO World, and Grounding DINO. Experiments carried out on the COCO-O and COCO-C benchmarks encompassing distribution shifts highlight the challenges of the models’ robustness. Source code shall be made available to the research community on GitHub.

Submission history

From: Prakash Chandra Chhipa [view email]
[v1]
Mon, 1 Apr 2024 14:18:15 UTC (3,296 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.