MOT Challenge

Video not available.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Rendering of new sequences is currently deactivated due to heavy load.

Benchmark:

MOT17 | MOT16 | MOT20 |

Short name:

UnsupTrack

Detector:

Public

Description:

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised re-identification network, thus sidestepping the labeling costs entirely, required for training. Given unlabeled videos, our proposed method (SimpleReID) first generates tracking labels using SORT and trains a ReID network to predict the generated labels using crossentropy loss. We demonstrate that SimpleReID performs substantially better than simpler alternatives, and we recover the full performance of its supervised counterpart consistently across diverse tracking frameworks. The observations are unusual because unsupervised ReID is not expected to excel in crowded scenarios with occlusions, and drastic viewpoint changes. By incorporating our unsupervised SimpleReID with CenterTrack trained on augmented still images, we establish a new state-of-the-art performance on popular datasets like MOT16/17 without using tracking supervision, beating current best (CenterTrack) by 0.2-0.3 MOTA and 4.4-4.8 IDF1 scores. We further provide evidence for limited scope for improvement in IDF1 scores beyond our unsupervised ReID in the studied settings. Our investigation suggests reconsideration towards more sophisticated, supervised, end-to-end trackers by showing promise in simpler unsupervised alternatives.

Reference:

S. Karthik, A. Prabhu, V. Gandhi. Simple Unsupervised Multi-Object Tracking. In Arxiv, 2020.

Last submitted:

May 18, 2020 (4 years ago)

Published:

May 18, 2020 at 16:09:12 CET

Submissions:

Project page / code:

n/a

Open source:

Hardware:

GTX 1080Ti

Runtime:

1.9 Hz

Benchmark performance:

Sequence	MOTA	IDF1	HOTA	MT	ML	FP	FN	Rcll	Prcn	AssA	DetA	AssRe	AssPr	DetRe	DetPr	LocA	FAF	ID Sw.	Frag
MOT16	62.4	58.5	47.0	205 (27.0)	242 (31.9)	5,909	61,981	66.0	95.3	44.8	49.8	53.8	64.1	53.0	76.5	81.2	1.0	588 (8.9)	1,361 (20.6)

Detailed performance:

Sequence	MOTA	IDF1	HOTA	MT	ML	FP	FN	Rcll	Prcn	AssA	DetA	AssRe	AssPr	DetRe	DetPr	LocA	FAF	ID Sw.	Frag
MOT16-01	41.5	48.2	39.3	7	8	529	3,190	50.1	85.8	43.1	36.3	47.2	68.1	40.0	68.5	79.2	1.2	24	43
MOT16-03	78.7	68.6	54.7	90	11	1,549	20,557	80.3	98.2	49.1	61.3	56.8	67.4	64.5	78.8	81.5	1.0	178	471
MOT16-06	52.7	41.1	36.2	57	74	715	4,649	59.7	90.6	30.2	44.0	56.8	39.4	47.8	72.6	80.4	0.6	95	205
MOT16-07	45.3	45.6	35.0	12	9	1,568	7,259	55.5	85.3	32.4	38.7	39.1	56.5	43.4	66.7	79.1	3.1	105	231
MOT16-08	32.8	36.2	31.9	10	23	602	10,553	36.9	91.1	34.1	30.2	41.0	62.8	31.7	78.2	84.3	1.0	89	147
MOT16-12	48.4	46.0	40.7	17	39	287	3,969	52.2	93.8	40.9	40.6	60.9	52.7	43.3	77.9	84.0	0.3	24	48
MOT16-14	32.2	40.3	29.3	12	78	659	11,804	36.1	91.0	33.2	26.1	37.6	60.9	27.5	69.3	77.8	0.9	73	216

Raw data:

download

GLMBFairMOT

UnsupTrack

AFN

UnsupTrack: Simple Unsupervised Multi-Object Tracking