Benchmark:
MOT20 |
Short name:
LITE
Detector:
Public
Description:
The Lightweight Integrated Tracking-Feature Extraction (LITE)
paradigm is introduced as a novel multi-object tracking (MOT) approach. It enhances ReID-based trackers by eliminating inference, preprocessing, post-processing, and ReID model training costs. LITE uses
real-time appearance features without compromising speed. By integrating appearance feature extraction directly into the tracking pipeline using
standard CNN-based detectors such as YOLOv8m, LITE demonstrates
significant performance improvements. The simplest implementation of
LITE on top of classic DeepSORT achieves a HOTA score of 43.03% at
28.3 FPS on the MOT17 benchmark, making it twice as fast as DeepSORT on MOT17 and four times faster on the more crowded MOT20
dataset, while maintaining similar accuracy. Additionally, a new evaluation framework for tracking-by-detection approaches reveals that conventional trackers like DeepSORT remain competitive with modern stateof-the-art trackers when evaluated under fair conditions. The code will
be available post-publication at https://github.com/Jumabek/LITE
Reference:
J. Alikhanov, D. Obidov, H. Kim. LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration. In , 2024.
Last submitted:
December 18, 2024 (4 days ago)
Published:
December 18, 2024 at 02:10:56 CET
Submissions:
2
Project page / code:
Open source:
Yes
Hardware:
NVIDIA GeForce RTX 3090
Runtime:
149.3 Hz
Benchmark performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT20 | 72.7 | 70.3 | 57.8 | 774 (62.3) | 151 (12.2) | 21,763 | 117,814 | 77.2 | 94.8 | 55.5 | 60.4 | 64.3 | 69.8 | 65.1 | 80.0 | 84.6 | 4.9 | 1,688 (0.0) | 4,307 (0.0) |
Detailed performance:
Sequence | MOTA | IDF1 | HOTA | MT | ML | FP | FN | Rcll | Prcn | AssA | DetA | AssRe | AssPr | DetRe | DetPr | LocA | FAF | ID Sw. | Frag |
MOT20-04 | 84.8 | 78.4 | 64.4 | 528 | 17 | 12,383 | 28,589 | 89.6 | 95.2 | 59.6 | 69.7 | 69.2 | 71.7 | 75.6 | 80.3 | 84.5 | 6.0 | 639 | 2,200 |
MOT20-06 | 59.8 | 59.9 | 48.7 | 114 | 69 | 4,311 | 48,484 | 63.5 | 95.1 | 47.3 | 50.3 | 54.2 | 65.4 | 53.6 | 80.3 | 84.8 | 4.3 | 600 | 1,219 |
MOT20-07 | 79.7 | 70.9 | 60.0 | 87 | 1 | 2,169 | 4,406 | 86.7 | 93.0 | 54.8 | 66.5 | 63.4 | 68.3 | 73.9 | 79.3 | 85.6 | 3.7 | 156 | 264 |
MOT20-08 | 49.0 | 53.5 | 44.3 | 45 | 64 | 2,900 | 36,335 | 53.1 | 93.4 | 47.5 | 41.5 | 55.6 | 66.7 | 44.3 | 77.9 | 84.0 | 3.6 | 293 | 624 |
Raw data: