MOT17-03-SDP
Benchmark:
MOT17 |
Short name:
TADN
Detector:
Public
Description:
Data association is a crucial component for any multiple object tracking (MOT) method that follows the tracking-by-detection paradigm. To generate complete trajectories such methods employ a data association process to establish assignments between detections and existing targets during each timestep. Recent data association approaches try to solve a multi-dimensional linear assignment task or a network flow minimization problem or either tackle it via multiple hypotheses tracking. However, during inference an optimization step that computes optimal assignments is required for every sequence frame adding significant computational complexity in any given solution. To this end, in the context of this work we introduce Transformer-based Assignment Decision Network (TADN) that tackles data association without the need of any explicit optimization during inference. In particular, TADN can directly infer assignment pairs between detections and active targets in a single forward pass of the network. We have integrated TADN in a rather simple MOT framework, we designed a novel training strategy for efficient end-to-end training and demonstrate the high potential of our approach for online visual tracking-by-detection MOT on two popular benchmarks, i.e. MOT17 and UA-DETRAC. Our proposed approach outperforms the state-of-the-art in most evaluation metrics despite its simple nature as a tracker which lacks significant auxiliary components such as occlusion handling or re-identification.
Reference:
A. Psalta, V. Tsironis, K. Karantzalos. Transformer-based assignment decision network for multiple object tracking. In , 2022.
Last submitted:
June 23, 2022 (2 years ago)
Published:
October 13, 2022 at 10:18:31 CET
Submissions:
2
Project page / code:
Open source:
Yes
Hardware:
GPU 2080ti
Runtime:
360.9 Hz
Benchmark performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT17 | 54.6 | 49.0 | 39.7 | 528 (22.4) | 711 (30.2) | 36,285 | 214,857 | 61.9 | 90.6 | 35.1 | 45.3 | 43.1 | 59.3 | 49.4 | 72.2 | 80.1 | 2.0 | 4,869 (0.0) | 7,821 (0.0) |
Detailed performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT17-01-DPM | 38.3 | 36.8 | 30.2 | 7 | 4 | 1,196 | 2,684 | 58.4 | 75.9 | 24.9 | 37.2 | 29.3 | 64.7 | 44.8 | 58.2 | 77.1 | 2.7 | 102 | 130 |
MOT17-01-FRCNN | 38.3 | 36.8 | 30.2 | 7 | 4 | 1,196 | 2,684 | 58.4 | 75.9 | 24.9 | 37.2 | 29.3 | 64.7 | 44.8 | 58.2 | 77.1 | 2.7 | 102 | 130 |
MOT17-01-SDP | 38.3 | 36.8 | 30.2 | 7 | 4 | 1,196 | 2,684 | 58.4 | 75.9 | 24.9 | 37.2 | 29.3 | 64.7 | 44.8 | 58.2 | 77.1 | 2.7 | 102 | 130 |
MOT17-03-DPM | 71.3 | 56.8 | 45.7 | 64 | 13 | 2,731 | 26,868 | 74.3 | 96.6 | 37.6 | 55.9 | 45.7 | 59.3 | 59.3 | 77.1 | 80.8 | 1.8 | 482 | 959 |
MOT17-03-FRCNN | 71.3 | 56.8 | 45.7 | 64 | 13 | 2,731 | 26,868 | 74.3 | 96.6 | 37.6 | 55.9 | 45.7 | 59.3 | 59.3 | 77.1 | 80.8 | 1.8 | 482 | 959 |
MOT17-03-SDP | 71.3 | 56.8 | 45.7 | 64 | 13 | 2,731 | 26,868 | 74.3 | 96.6 | 37.6 | 55.9 | 45.7 | 59.3 | 59.3 | 77.1 | 80.8 | 1.8 | 482 | 959 |
MOT17-06-DPM | 47.5 | 50.0 | 38.9 | 56 | 68 | 1,396 | 4,594 | 61.0 | 83.7 | 35.6 | 42.7 | 50.2 | 54.9 | 48.6 | 66.7 | 79.4 | 1.2 | 195 | 281 |
MOT17-06-FRCNN | 47.5 | 50.0 | 38.9 | 56 | 68 | 1,396 | 4,594 | 61.0 | 83.7 | 35.6 | 42.7 | 50.2 | 54.9 | 48.6 | 66.7 | 79.4 | 1.2 | 195 | 281 |
MOT17-06-SDP | 47.5 | 50.0 | 38.9 | 56 | 68 | 1,396 | 4,594 | 61.0 | 83.7 | 35.6 | 42.7 | 50.2 | 54.9 | 48.6 | 66.7 | 79.4 | 1.2 | 195 | 281 |
MOT17-07-DPM | 41.8 | 40.8 | 32.8 | 8 | 16 | 1,501 | 8,124 | 51.9 | 85.4 | 29.1 | 37.3 | 34.0 | 61.3 | 41.2 | 67.8 | 78.7 | 3.0 | 209 | 366 |
MOT17-07-FRCNN | 41.8 | 40.8 | 32.8 | 8 | 16 | 1,501 | 8,124 | 51.9 | 85.4 | 29.1 | 37.3 | 34.0 | 61.3 | 41.2 | 67.8 | 78.7 | 3.0 | 209 | 366 |
MOT17-07-SDP | 41.8 | 40.8 | 32.8 | 8 | 16 | 1,501 | 8,124 | 51.9 | 85.4 | 29.1 | 37.3 | 34.0 | 61.3 | 41.2 | 67.8 | 78.7 | 3.0 | 209 | 366 |
MOT17-08-DPM | 25.1 | 27.7 | 26.2 | 9 | 34 | 1,124 | 14,484 | 31.4 | 85.5 | 28.3 | 24.4 | 34.1 | 62.2 | 26.0 | 70.7 | 80.7 | 1.8 | 214 | 258 |
MOT17-08-FRCNN | 25.1 | 27.7 | 26.2 | 9 | 34 | 1,124 | 14,484 | 31.4 | 85.5 | 28.3 | 24.4 | 34.1 | 62.2 | 26.0 | 70.7 | 80.7 | 1.8 | 214 | 258 |
MOT17-08-SDP | 25.1 | 27.7 | 26.2 | 9 | 34 | 1,124 | 14,484 | 31.4 | 85.5 | 28.3 | 24.4 | 34.1 | 62.2 | 26.0 | 70.7 | 80.7 | 1.8 | 214 | 258 |
MOT17-12-DPM | 34.0 | 46.3 | 38.4 | 20 | 38 | 1,410 | 4,225 | 51.3 | 75.9 | 41.7 | 35.7 | 51.8 | 59.4 | 42.1 | 62.3 | 80.9 | 1.6 | 85 | 113 |
MOT17-12-FRCNN | 34.0 | 46.3 | 38.4 | 20 | 38 | 1,410 | 4,225 | 51.3 | 75.9 | 41.7 | 35.7 | 51.8 | 59.4 | 42.1 | 62.3 | 80.9 | 1.6 | 85 | 113 |
MOT17-12-SDP | 34.0 | 46.3 | 38.4 | 20 | 38 | 1,410 | 4,225 | 51.3 | 75.9 | 41.7 | 35.7 | 51.8 | 59.4 | 42.1 | 62.3 | 80.9 | 1.6 | 85 | 113 |
MOT17-14-DPM | 25.8 | 33.2 | 24.9 | 12 | 64 | 2,737 | 10,640 | 42.4 | 74.1 | 22.2 | 28.5 | 28.6 | 54.6 | 32.8 | 57.4 | 76.0 | 3.6 | 336 | 500 |
MOT17-14-FRCNN | 25.8 | 33.2 | 24.9 | 12 | 64 | 2,737 | 10,640 | 42.4 | 74.1 | 22.2 | 28.5 | 28.6 | 54.6 | 32.8 | 57.4 | 76.0 | 3.6 | 336 | 500 |
MOT17-14-SDP | 25.8 | 33.2 | 24.9 | 12 | 64 | 2,737 | 10,640 | 42.4 | 74.1 | 22.2 | 28.5 | 28.6 | 54.6 | 32.8 | 57.4 | 76.0 | 3.6 | 336 | 500 |
Raw data: