LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS, by Xinyu Liu and 4 other authors

View PDF
HTML (experimental)

Abstract:Video Object Segmentation (VOS) presents several challenges, including object occlusion and fragmentation, the dis-appearance and re-appearance of objects, and tracking specific objects within crowded scenes. In this work, we combine the strengths of the state-of-the-art (SOTA) models SAM2 and Cutie to address these challenges. Additionally, we explore the impact of various hyperparameters on video instance segmentation performance. Our approach achieves a J&F score of 0.7952 in the testing phase of LSVOS challenge VOS track, ranking third overall.

Submission history

From: Xinyu Liu [view email]
[v1]
Tue, 20 Aug 2024 00:45:13 UTC (128 KB)
[v2]
Wed, 21 Aug 2024 00:39:38 UTC (128 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.