Benchmark:
MOT20 |
Short name:
LITE
Detector:
Public
Description:
The Lightweight Integrated Tracking-Feature Extraction (LITE)
paradigm is introduced as a novel multi-object tracking (MOT) approach. It enhances ReID-based trackers by eliminating inference, preprocessing, post-processing, and ReID model training costs. LITE uses
real-time appearance features without compromising speed. By integrating appearance feature extraction directly into the tracking pipeline using
standard CNN-based detectors such as YOLOv8m, LITE demonstrates
significant performance improvements. The simplest implementation of
LITE on top of classic DeepSORT achieves a HOTA score of 43.03% at
28.3 FPS on the MOT17 benchmark, making it twice as fast as DeepSORT on MOT17 and four times faster on the more crowded MOT20
dataset, while maintaining similar accuracy. Additionally, a new evaluation framework for tracking-by-detection approaches reveals that conventional trackers like DeepSORT remain competitive with modern stateof-the-art trackers when evaluated under fair conditions. The code will
be available post-publication at https://github.com/Jumabek/LITE
Reference:
J. Alikhanov, D. Obidov, H. Kim. LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration. In , 2024.
Last submitted:
December 27, 2024 (24 days ago)
Published:
December 18, 2024 at 02:10:56 CET
Submissions:
4
Project page / code:
Open source:
Yes
Hardware:
NVIDIA GeForce RTX 3090
Runtime:
149.3 Hz
Benchmark performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT20 | 43.8 | 39.7 | 32.9 | 233 (18.8) | 298 (24.0) | 34,230 | 251,079 | 51.5 | 88.6 | 28.8 | 37.9 | 36.4 | 46.8 | 41.4 | 71.3 | 80.6 | 7.6 | 5,694 (0.0) | 13,502 (0.0) |
Detailed performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT20-04 | 62.2 | 49.2 | 40.1 | 168 | 74 | 4,613 | 96,665 | 64.7 | 97.5 | 32.1 | 50.3 | 40.3 | 50.5 | 52.9 | 79.7 | 82.9 | 2.2 | 2,389 | 7,335 |
MOT20-06 | 23.1 | 26.9 | 21.8 | 26 | 111 | 15,356 | 84,770 | 36.1 | 75.8 | 19.3 | 25.0 | 24.9 | 35.7 | 28.0 | 58.7 | 75.7 | 15.2 | 1,972 | 3,514 |
MOT20-07 | 49.0 | 45.2 | 35.3 | 28 | 21 | 1,345 | 15,186 | 54.1 | 93.0 | 32.1 | 39.3 | 40.2 | 50.5 | 42.3 | 72.7 | 80.3 | 2.3 | 337 | 678 |
MOT20-08 | 11.8 | 21.1 | 18.0 | 11 | 92 | 12,916 | 54,458 | 29.7 | 64.1 | 16.5 | 20.0 | 21.6 | 32.5 | 23.1 | 49.8 | 74.0 | 16.0 | 996 | 1,975 |
Raw data: