View a PDF of the paper titled Temporally Propagated Masks and Bounding Boxes: Combining the Best of Both Worlds for Multi-Object Tracking, by Tomasz Stanczyk and 1 other authors
Abstract:Multi-object tracking (MOT) involves identifying and consistently tracking objects across video sequences. Traditional tracking-by-detection methods, while effective, often require extensive tuning and lack generalizability. On the other hand, segmentation mask-based methods are more generic but struggle with tracking management, making them unsuitable for MOT. We propose a novel approach, McByte, which incorporates a temporally propagated segmentation mask as a strong association cue within a tracking-by-detection framework. By combining bounding box and propagated mask information, McByte enhances robustness and generalizability without per-sequence tuning. Evaluated on four benchmark datasets – DanceTrack, MOT17, SoccerNet-tracking 2022, and KITTI-tracking – McByte demonstrates performance gain in all cases examined. At the same time, it outperforms existing mask-based methods. Implementation code will be provided upon acceptance.
Submission history
From: Tomasz Stanczyk [view email]
[v1]
Sat, 21 Sep 2024 18:52:07 UTC (676 KB)
[v2]
Thu, 26 Sep 2024 08:13:43 UTC (2,623 KB)
[v3]
Fri, 22 Nov 2024 21:32:53 UTC (3,644 KB)
Source link
lol