Benchmark:
MOT17 | MOT16 | MOT15 | MOT20 | MOTSynth-MOT-CVPR22 |
Short name:
Tracktor++v2
Detector:
Public
Description:
The problem of tracking multiple objects in a video sequence poses several challenging tasks. For tracking-by- detection these include object re-identification, motion prediction and dealing with occlusions. We present a tracker that accomplishes tracking without specifically targeting any of these tasks, in particular, we perform no training or optimization on tracking data. To this end, we exploit the bounding box regression of an object detector to predict the position of an object in the next frame, thereby converting a detector into a Tracktor. We demonstrate the extensibility of our Tracktor and provide a new state-of-the-art on three multi-object tracking benchmarks by extending it with a straightforward re-identification and camera motion compensation. This benchmark submission presents the results of our extended Tracktor++ multi-object tracker.
Reference:
P. Bergmann, T. Meinhardt, L. Leal-Taixé. Tracking without bells and whistles. In ICCV, 2019.
Last submitted:
March 17, 2020 (4 years ago)
Published:
March 17, 2020 at 17:34:00 CET
Submissions:
1
Project page / code:
Open source:
Yes
Hardware:
Titan X
Runtime:
1.4 Hz
Benchmark performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT15 | 46.6 | 47.6 | 37.6 | 131 (18.2) | 201 (27.9) | 4,624 | 26,896 | 56.2 | 88.2 | 35.9 | 40.1 | 39.7 | 72.5 | 44.3 | 69.5 | 79.9 | 0.8 | 1,290 (22.9) | 1,702 (30.3) |
Detailed performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
ADL-Rundle-1 | 38.3 | 50.7 | 39.3 | 10 | 4 | 2,089 | 3,595 | 61.4 | 73.2 | 40.5 | 38.4 | 44.1 | 72.1 | 47.8 | 57.0 | 77.9 | 4.2 | 56 | 171 |
ADL-Rundle-3 | 47.0 | 46.2 | 38.0 | 9 | 8 | 516 | 4,804 | 52.7 | 91.2 | 35.1 | 41.6 | 37.2 | 82.7 | 44.7 | 77.4 | 84.4 | 0.8 | 67 | 82 |
AVG-TownCentre | 43.0 | 38.8 | 30.2 | 39 | 42 | 308 | 3,061 | 57.2 | 93.0 | 23.7 | 39.6 | 28.3 | 58.0 | 42.4 | 68.9 | 76.4 | 0.7 | 705 | 602 |
ETH-Crossing | 43.8 | 55.4 | 42.3 | 2 | 10 | 23 | 531 | 47.1 | 95.4 | 46.0 | 38.9 | 48.5 | 81.7 | 40.6 | 82.0 | 85.5 | 0.1 | 10 | 19 |
ETH-Jelmoli | 57.6 | 67.7 | 51.1 | 15 | 13 | 278 | 778 | 69.3 | 86.4 | 52.9 | 49.5 | 59.5 | 75.2 | 56.8 | 70.7 | 82.8 | 0.6 | 19 | 42 |
ETH-Linthescher | 49.9 | 56.1 | 44.5 | 31 | 98 | 182 | 4,244 | 52.5 | 96.3 | 47.9 | 41.4 | 52.6 | 77.2 | 43.3 | 79.5 | 82.9 | 0.2 | 44 | 95 |
KITTI-16 | 51.8 | 62.2 | 39.8 | 1 | 1 | 79 | 720 | 57.7 | 92.5 | 40.8 | 38.8 | 43.0 | 68.9 | 41.5 | 66.7 | 74.9 | 0.4 | 21 | 56 |
KITTI-19 | 47.5 | 55.8 | 38.9 | 9 | 17 | 400 | 2,336 | 56.3 | 88.3 | 39.4 | 38.7 | 44.1 | 65.4 | 42.3 | 66.3 | 76.5 | 0.4 | 68 | 148 |
PETS09-S2L2 | 46.5 | 31.1 | 24.2 | 2 | 5 | 285 | 4,617 | 52.1 | 94.6 | 15.6 | 38.0 | 16.3 | 66.9 | 40.2 | 73.1 | 78.8 | 0.7 | 256 | 428 |
TUD-Crossing | 77.8 | 56.9 | 41.9 | 8 | 0 | 12 | 216 | 80.4 | 98.7 | 31.6 | 56.0 | 44.0 | 44.5 | 59.5 | 73.0 | 77.8 | 0.1 | 17 | 23 |
Venice-1 | 45.8 | 42.9 | 35.4 | 5 | 3 | 452 | 1,994 | 56.3 | 85.0 | 33.1 | 38.1 | 37.8 | 66.4 | 43.2 | 65.2 | 79.3 | 1.0 | 27 | 36 |
Raw data: