MOT Challenge

Video not available.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Benchmark:

MOT17 | MOT16 | MOT20 |

Short name:

UnsupTrack

Detector:

Public

Description:

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised re-identification network, thus sidestepping the labeling costs entirely, required for training. Given unlabeled videos, our proposed method (SimpleReID) first generates tracking labels using SORT and trains a ReID network to predict the generated labels using crossentropy loss. We demonstrate that SimpleReID performs substantially better than simpler alternatives, and we recover the full performance of its supervised counterpart consistently across diverse tracking frameworks. The observations are unusual because unsupervised ReID is not expected to excel in crowded scenarios with occlusions, and drastic viewpoint changes. By incorporating our unsupervised SimpleReID with CenterTrack trained on augmented still images, we establish a new state-of-the-art performance on popular datasets like MOT16/17 without using tracking supervision, beating current best (CenterTrack) by 0.2-0.3 MOTA and 4.4-4.8 IDF1 scores. We further provide evidence for limited scope for improvement in IDF1 scores beyond our unsupervised ReID in the studied settings. Our investigation suggests reconsideration towards more sophisticated, supervised, end-to-end trackers by showing promise in simpler unsupervised alternatives.

Reference:

S. Karthik, A. Prabhu, V. Gandhi. Simple Unsupervised Multi-Object Tracking. In Arxiv, 2020.

Last submitted:

June 19, 2020 (4 years ago)

Published:

June 12, 2020 at 11:37:09 CET

Submissions:

Project page / code:

n/a

Open source:

Hardware:

GTX 1080Ti

Runtime:

1.3 Hz

Benchmark performance:

Sequence	MOTA	IDF1	HOTA	MT	ML	FP	FN	Rcll	Prcn	AssA	DetA	AssRe	AssPr	DetRe	DetPr	LocA	FAF	ID Sw.	Frag
MOT20	53.6	50.6	41.7	376 (30.3)	311 (25.0)	6,439	231,298	55.3	97.8	40.2	43.3	43.2	75.9	45.1	79.8	82.6	1.4	2,178 (39.4)	4,335 (78.4)

Detailed performance:

Sequence	MOTA	IDF1	HOTA	MT	ML	FP	FN	Rcll	Prcn	AssA	DetA	AssRe	AssPr	DetRe	DetPr	LocA	FAF	ID Sw.	Frag
MOT20-04	73.2	62.3	51.1	318	56	2,567	69,905	74.5	98.8	44.6	58.6	47.9	77.1	61.3	81.2	83.1	1.2	918	1,652
MOT20-06	31.5	32.0	25.8	24	124	1,796	88,455	33.4	96.1	26.2	25.7	27.8	71.9	26.5	76.4	80.9	1.8	732	1,546
MOT20-07	49.8	47.1	38.2	23	25	140	16,310	50.7	99.2	36.9	40.0	39.7	75.9	41.4	81.0	83.0	0.2	173	357
MOT20-08	24.0	29.0	24.1	11	106	1,936	56,628	26.9	91.5	28.5	20.6	30.8	71.8	21.4	72.7	80.2	2.4	355	780

Raw data:

download

SUT

UnsupTrack

Fair

UnsupTrack: Simple Unsupervised Multi-Object Tracking