MOT17

MOT17 Challenge. All MOT16 sequences are used with a new, more accurate ground truth. Each sequences is provided with 3 sets of detections: DPM, Faster-RCNN, and SDP.

Jump to download

Training Set

SampleName FPS Resolution Length Tracks Boxes DensityDescriptionSourceRef.
1MOT17-05-DPM14640x480837 (01:00)13369178.3Street scene from a moving platformlink[1]
2MOT17-05-FRCNN14640x480837 (01:00)13369178.3Street scene from a moving platformlink[1]
3MOT17-05-SDP14640x480837 (01:00)13369178.3Street scene from a moving platformlink[1]
4MOT17-13-DPM251920x1080750 (00:30)1101164215.5Filmed from a bus on a busy intersectionlink[2]
5MOT17-13-FRCNN251920x1080750 (00:30)1101164215.5Filmed from a bus on a busy intersectionlink[2]
6MOT17-13-SDP251920x1080750 (00:30)1101164215.5Filmed from a bus on a busy intersectionlink[2]
7MOT17-02-DPM301920x1080600 (00:20)621858131.0People walking around a large square.link[3]
8MOT17-04-DPM301920x10801050 (00:35)834755745.3Pedestrian street at night, elevated viewpointlink[2]
9MOT17-09-DPM301920x1080525 (00:18)26532510.1A pedestrian street scene filmed from a low angle.link[3]
10MOT17-10-DPM301920x1080654 (00:22)571283919.6A pedestrian scene filmed at night by a moving cameralink[3]
11MOT17-11-DPM301920x1080900 (00:30)75943610.5Forward moving camera in a busy shopping malllink[2]
12MOT17-02-FRCNN301920x1080600 (00:20)621858131.0People walking around a large square.link[3]
13MOT17-04-FRCNN301920x10801050 (00:35)834755745.3Pedestrian street at night, elevated viewpointlink[2]
14MOT17-09-FRCNN301920x1080525 (00:18)26532510.1A pedestrian street scene filmed from a low angle.link[3]
15MOT17-10-FRCNN301920x1080654 (00:22)571283919.6A pedestrian scene filmed at night by a moving cameralink[3]
16MOT17-11-FRCNN301920x1080900 (00:30)75943610.5Forward moving camera in a busy shopping malllink[2]
17MOT17-02-SDP301920x1080600 (00:20)621858131.0People walking around a large square.link[3]
18MOT17-04-SDP301920x10801050 (00:35)834755745.3Pedestrian street at night, elevated viewpointlink[2]
19MOT17-09-SDP301920x1080525 (00:18)26532510.1A pedestrian street scene filmed from a low angle.link[3]
20MOT17-10-SDP301920x1080654 (00:22)571283919.6A pedestrian scene filmed at night by a moving cameralink[3]
21MOT17-11-SDP301920x1080900 (00:30)75943610.5Forward moving camera in a busy shopping malllink[2]
Total 15948 frm.
(645 s.)
1638 336891 21.1

Test Set

SampleName FPS Resolution Length Tracks Boxes DensityDescriptionSourceRef.
1MOT17-06-DPM14640x4801194 (01:25)222117849.9Street scene from a moving platformlink[1]
2MOT17-06-FRCNN14640x4801194 (01:25)222117849.9Street scene from a moving platformlink[1]
3MOT17-06-SDP14640x4801194 (01:25)222117849.9Street scene from a moving platformlink[1]
4MOT17-14-DPM251920x1080750 (00:30)1641848324.6Filmed from a bus on a busy intersectionlink[2]
5MOT17-14-FRCNN251920x1080750 (00:30)1641848324.6Filmed from a bus on a busy intersectionlink[2]
6MOT17-14-SDP251920x1080750 (00:30)1641848324.6Filmed from a bus on a busy intersectionlink[2]
7MOT17-01-DPM301920x1080450 (00:15)24645014.3People walking around a large square.link[3]
8MOT17-03-DPM301920x10801500 (00:50)14810467569.8Pedestrian street at night, elevated viewpointlink[2]
9MOT17-07-DPM301920x1080500 (00:17)601689333.8A busy pedestrian street filmed at eye level by a moving cameralink[3]
10MOT17-08-DPM301920x1080625 (00:21)762112433.8A crowded pedestrian street, stationary cameralink[3]
11MOT17-12-DPM301920x1080900 (00:30)9186679.6Forward moving camera in a busy shopping malllink[2]
12MOT17-01-FRCNN301920x1080450 (00:15)24645014.3People walking around a large square.link[3]
13MOT17-03-FRCNN301920x10801500 (00:50)14810467569.8Pedestrian street at night, elevated viewpointlink[2]
14MOT17-07-FRCNN301920x1080500 (00:17)601689333.8A busy pedestrian street filmed at eye level by a moving cameralink[3]
15MOT17-08-FRCNN301920x1080625 (00:21)762112433.8A crowded pedestrian street, stationary cameralink[3]
16MOT17-12-FRCNN301920x1080900 (00:30)9186679.6Forward moving camera in a busy shopping malllink[2]
17MOT17-01-SDP301920x1080450 (00:15)24645014.3People walking around a large square.link[3]
18MOT17-03-SDP301920x10801500 (00:50)14810467569.8Pedestrian street at night, elevated viewpointlink[2]
19MOT17-07-SDP301920x1080500 (00:17)601689333.8A busy pedestrian street filmed at eye level by a moving cameralink[3]
20MOT17-08-SDP301920x1080625 (00:21)762112433.8A crowded pedestrian street, stationary cameralink[3]
21MOT17-12-SDP301920x1080900 (00:30)9186679.6Forward moving camera in a busy shopping malllink[2]
Total 17757 frm.
(744 s.)
2355 564228 31.8


Download

Get all data (5.5 GB)
Get detections and labels only (9.7 MB)
Get development kit (0.5 MB)

Note that the data contains the same set of sequences (frames) as MOT16 three times. For convenience you may download the entire data which will extract in correct folder structure. Alternatively, you may re-use the MOT16 sequences (frames) locally. Important: Both the ground truth and the detection set is new for MOT17!

References:


[1] Ess, A., Leibe, B. & Gool, L.V. Depth and Appearance for Mobile Scene Analysis. In Proceedings of the Eleventh IEEE International Conference on Computer Vision, 2007.
[2] Milan, A., Leal-Taixé, L., Reid, I., Roth, S. & Schindler, K. MOT16: A Benchmark for Multi-Object Tracking. arXiv:1603.00831 [cs], 2016., (arXiv: 1603.00831).
[3] Leal-Taixé, L., Milan, A., Reid, I., Roth, S. & Schindler, K. MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking. arXiv:1504.01942 [cs], 2015., (arXiv: 1504.01942).