UnsupTrack: Simple Unsupervised Multi-Object Tracking


Video not available.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Benchmark:

MOT17 | MOT16 | MOT20 |

Short name:

UnsupTrack

Detector:

Public

Description:

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised re-identification network, thus sidestepping the labeling costs entirely, required for training. Given unlabeled videos, our proposed method (SimpleReID) first generates tracking labels using SORT and trains a ReID network to predict the generated labels using crossentropy loss. We demonstrate that SimpleReID performs substantially better than simpler alternatives, and we recover the full performance of its supervised counterpart consistently across diverse tracking frameworks. The observations are unusual because unsupervised ReID is not expected to excel in crowded scenarios with occlusions, and drastic viewpoint changes. By incorporating our unsupervised SimpleReID with CenterTrack trained on augmented still images, we establish a new state-of-the-art performance on popular datasets like MOT16/17 without using tracking supervision, beating current best (CenterTrack) by 0.2-0.3 MOTA and 4.4-4.8 IDF1 scores. We further provide evidence for limited scope for improvement in IDF1 scores beyond our unsupervised ReID in the studied settings. Our investigation suggests reconsideration towards more sophisticated, supervised, end-to-end trackers by showing promise in simpler unsupervised alternatives.

Reference:

S. Karthik, A. Prabhu, V. Gandhi. Simple Unsupervised Multi-Object Tracking. In Arxiv, 2020.

Last submitted:

June 19, 2020 (4 years ago)

Published:

June 12, 2020 at 11:37:09 CET

Submissions:

2

Project page / code:

n/a

Open source:

No

Hardware:

GTX 1080Ti

Runtime:

1.3 Hz

Benchmark performance:

Sequence MOTA IDF1 HOTA MT ML FP FN Rcll Prcn AssA DetA AssRe AssPr DetRe DetPr LocA FAF ID Sw. Frag
MOT2053.650.641.7376 (30.3)311 (25.0)6,439231,29855.397.840.243.343.275.945.179.882.61.42,178 (39.4)4,335 (78.4)

Detailed performance:

Sequence MOTA IDF1 HOTA MT ML FP FN Rcll Prcn AssA DetA AssRe AssPr DetRe DetPr LocA FAF ID Sw. Frag
MOT20-0473.262.351.1318562,56769,90574.598.844.658.647.977.161.381.283.11.29181,652
MOT20-0631.532.025.8241241,79688,45533.496.126.225.727.871.926.576.480.91.87321,546
MOT20-0749.847.138.2232514016,31050.799.236.940.039.775.941.481.083.00.2173357
MOT20-0824.029.024.1111061,93656,62826.991.528.520.630.871.821.472.780.22.4355780

Raw data:


UnsupTrack