Online Multiple Pedestrian Tracking with Deep Temporal Appearance Matching Association


Short name:

DD_TAMA19

Benchmark:

Description:

In online multiple pedestrian tracking it is of great importance to construct reliable cost matrix for assigning observations to tracks. Each element of cost matrix is constructed by using similarity measure. Many previous works have proposed their own similarity calculation methods consisting of geometric model (e.g. bounding box coordinates) and appearance model. In particular, appearance model contains information with higher dimension compared to geometric model. Thanks to the recent success of deep learning based methods, handling of high dimensional appearance information becomes possible. Among many deep networks, a siamese network with triplet loss is popularly adopted as an appearance feature extractor. Since the siamese network can extract features of each input independently, it is possible to adaptively model tracks (e.g. linear
update). However, it is not suitable for multi-object setting that requires comparison with other inputs. In this paper we propose a novel track appearance modeling based on joint inference network to address this issue. The proposed method enables comparison of two inputs to be used for adaptive appearance
modeling. It contributes to disambiguating target-observation matching and consolidating the identity consistency. Intensive experimental results support effectiveness of our method. Ours has been awarded as a 3rd-highest tracker on MOTChallenge19, held in 4th BMTT workshop.

Hardware:

3.0GHZ, 1 Core, TITAN X

Detector:

Public

Processing:

Online

Last submitted:

June 13, 2019 (5 months ago)

Published:

June 16, 2019 at 00:00:00 CET

Submissions:

1

Open source:

No

Project page / code:

n/a

Reference:

Y. Yoon, D. Kim, K. Yoon, Y. Song, M. Jeon. Online Multiple Pedestrian Tracking using Deep Temporal Appearance Matching Association. In arXiv:1907.00831, 2019.

Benchmark performance:

MOTAMOTPFAFMTMLFPFNID Sw.FragSpecificationsDetector
47.677.68.527.2 % 23.6 % 38,194252,9342,4373,8873.0GHZ, 1 Core, TITAN XPublic
IDF1ID PrecisionID Recall
48.763.839.4

Detailed performance:

Sequence MOTA IDF1 MOTP FAF GT MT ML FP FN ID Sw Frag
CVPR19-0464.059.280.14.069238.7 % 13.0 % 8,225105,4999001,325
CVPR19-0626.132.871.715.22639.9 % 38.0 % 15,27980,8308141,420
CVPR19-0753.949.675.73.211134.2 % 13.5 % 1,86513,161237288
CVPR19-0813.826.069.915.91905.3 % 48.4 % 12,82553,444486854

Raw data:

n/a


DD_TAMA19