This benchmark contains video sequences in unconstrained environments filmed with both static and moving cameras. Tracking and evaluation are done in image coordinates.
Sample | Name | FPS | Resolution | Length | Tracks | Boxes | Density | Description | Source | Ref. |
Venice-2 | 30 | 1920x1080 | 600 (00:20) | 26 | 7141 | 11.9 | People walking around a large square. | link | [1] | |
KITTI-17 | 10 | 1224x370 | 145 (00:15) | 9 | 683 | 4.7 | Walking pedestrians on a sunny day, static camera | link | [2] | |
KITTI-13 | 10 | 1242x375 | 340 (00:34) | 42 | 762 | 2.2 | Busy urban environment filmed from a moving car | link | [2] | |
ADL-Rundle-8 | 30 | 1920x1080 | 654 (00:22) | 28 | 6783 | 10.4 | A pedestrian scene filmed at night by a moving camera | link | [1] | |
ADL-Rundle-6 | 30 | 1920x1080 | 525 (00:18) | 24 | 5009 | 9.5 | A pedestrian street scene filmed from a low angle. | link | [1] | |
ETH-Pedcross2 | 14 | 640x480 | 837 (01:00) | 133 | 6263 | 7.5 | Street scene from a moving platform | link | [3] | |
ETH-Sunnyday | 14 | 640x480 | 354 (00:25) | 30 | 1858 | 5.2 | Street scene on a sunny day from a moving platform | link | [3] | |
ETH-Bahnhof | 14 | 640x480 | 1000 (01:11) | 171 | 5415 | 5.4 | Street scene from a moving platform | link | [3] | |
PETS09-S2L1 | 7 | 768x576 | 795 (01:54) | 19 | 4476 | 5.6 | A widely used sequence showing up to 8 walking pedestrians, partly in unusual patterns. | link | [4] | |
TUD-Campus | 25 | 640x480 | 71 (00:03) | 8 | 359 | 5.1 | A short sequence with side-view pedestrians | link | [5] | |
TUD-Stadtmitte | 25 | 640x480 | 179 (00:07) | 10 | 1156 | 6.5 | A static camera at about 2 meters height shows walking people on the street. | link | [6] | |
Total | 5500 frm. (389 s.) | 500 | 39905 | 7.3 |
Sample | Name | FPS | Resolution | Length | Tracks | Boxes | Density | Description | Source | Ref. |
Venice-1 | 30 | 1920x1080 | 450 (00:15) | 17 | 4563 | 10.1 | People walking around a large square. | link | [1] | |
KITTI-19 | 10 | 1238x374 | 1059 (01:46) | 62 | 5343 | 5.0 | A street scene from a moving vehicle | link | [2] | |
KITTI-16 | 10 | 1224x370 | 209 (00:21) | 17 | 1701 | 8.1 | Pedestrians crossing a street filmed from a car | link | [2] | |
ADL-Rundle-3 | 30 | 1920x1080 | 625 (00:21) | 44 | 10166 | 16.3 | A crowded pedestrian street, stationary camera | link | [1] | |
ADL-Rundle-1 | 30 | 1920x1080 | 500 (00:17) | 32 | 9306 | 18.6 | A busy pedestrian street filmed at eye level by a moving camera | link | [1] | |
AVG-TownCentre | 2.5 | 1920x1080 | 450 (03:45) | 226 | 7148 | 15.9 | A pedestrian street filmed from an elevated point | link | [7] | |
ETH-Crossing | 14 | 640x480 | 219 (00:16) | 26 | 1003 | 4.6 | Street scene from a moving platform | link | [3] | |
ETH-Linthescher | 14 | 640x480 | 1194 (01:25) | 197 | 8930 | 7.5 | Street scene from a moving platform | link | [3] | |
ETH-Jelmoli | 14 | 640x480 | 440 (00:31) | 45 | 2537 | 5.8 | Street scene from a moving platform | link | [3] | |
PETS09-S2L2 | 7 | 768x576 | 436 (01:02) | 42 | 9641 | 22.1 | A crowded scene shown from an elevated viewpoint. | link | [4] | |
TUD-Crossing | 25 | 640x480 | 201 (00:08) | 13 | 1102 | 5.5 | A road crossing from a side view | link | [5] | |
Total | 5783 frm. (607 s.) | 721 | 61440 | 10.6 |
[1] | MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking. arXiv:1504.01942 [cs], 2015., (arXiv: 1504.01942). |
[2] | Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2012. |
[3] | Depth and Appearance for Mobile Scene Analysis. In Proceedings of the Eleventh IEEE International Conference on Computer Vision, 2007. |
[4] | PETS2009: Dataset and challenge. In 11th IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (PETS), 2009. |
[5] | People-Tracking-by-Detection and People-Detection-by-Tracking. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2008. |
[6] | Monocular 3D Pose Estimation and Tracking by Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. |
[7] | Guiding Visual Surveillance by Tracking Human Attention. In Proceedings of the British Machine Vision Conference, 2009. |