Benchmark:
MOT15 |
Short name:
RNN_LSTM
Detector:
Public
Description:
We present a novel approach to online multi-target tracking based on
recurrent neural networks (RNNs). Tracking multiple objects in
real-world scenes involves many challenges, including a) an a-priori
unknown and time-varying number of targets, b) a continuous state
estimation of all present targets, and c) a discrete combinatorial
problem of data association. Most previous methods involve complex
models that require tedious tuning of parameters. Here, we propose for
the first time, a full end-to-end learning approach for online
multi-target tracking based on deep learning. Existing deep learning
methods are not designed for the above challenges and cannot be
trivially applied to the task. Our solution addresses all of the above
points in a principled way. Experiments on both synthetic and real data
show competitive results obtained at 300 Hz on a standard CPU, and pave
the way towards future research in this direction.
Reference:
A. Milan, S. Rezatofighi, A. Dick, I. Reid, K. Schindler. Online Multi-Target Tracking using Recurrent Neural Networks. In AAAI, 2017.
Last submitted:
March 15, 2016 (8 years ago)
Published:
March 15, 2016 at 08:24:33 CET
Submissions:
2
Project page / code:
Open source:
Yes
Hardware:
3 GHz, 1 CPU
Runtime:
165.2 Hz
Benchmark performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT15 | 19.0 | 17.1 | 15.8 | 40 (5.5) | 329 (45.6) | 11,578 | 36,706 | 40.3 | 68.1 | 9.7 | 26.5 | 23.6 | 14.7 | 31.2 | 52.8 | 74.3 | 2.0 | 1,490 (37.0) | 2,081 (51.7) |
Detailed performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
ADL-Rundle-1 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
ADL-Rundle-3 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
AVG-TownCentre | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
ETH-Crossing | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
ETH-Jelmoli | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
ETH-Linthescher | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
KITTI-16 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
KITTI-19 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
PETS09-S2L2 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
TUD-Crossing | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
Venice-1 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 0 |
Raw data: