Graphs offer a natural way to formulate Multiple Object Tracking (MOT) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as defining a model that can operate on such a structured domain is not trivial. As a consequence, most learning-based work has been devoted to learning better features for MOT, and then using these with well-established optimization frameworks. In this work, we exploit the classical network flow formulation of MOT to define a fully differentiable framework based on Message Passing Networks (MPNs). By operating directly on the graph domain, our method can reason globally over an entire set of detections and predict final solutions. Hence, we show that learning in MOT does not need to be restricted to feature extraction, but it can also be applied to the data association step. We show a significant improvement in both MOTA and IDF1 on three publicly available benchmarks.
G. Brasó, L. Leal-Taixé. Learning a Neural Solver for Multiple Object Tracking. In CVPR, 2020.
October 28, 2020 (10 months ago)
November 01, 2020 at 20:36:48 CET
Project page / code:
NVIDIA Quadro P5000
|MOT20||57.6||59.1||46.8||474 (38.2)||279 (22.5)||16,953||201,384||61.1||94.9||47.3||46.6||52.7||70.0||49.5||76.9||81.6||3.8||1,210 (19.8)||1,420 (23.2)|