SRK_ODESA: SRK_ODESA

CVPR19-04


Short name:

SRK_ODESA

Description:

Team:
Dmytro Mykheievskyi, Dmytro Borysenko and Viktor Porokhonskyy
Samsung Research&Development Institute Ukraine (SRK)

This solution is merely an adaptation of Faster R-CNN to CVPR'19 Tracking dataset.

Detector Design:
Faster R-CNN with ResNet-101 backbone was taken the starting point. On top of that FPN and 3-stage refining Cascade were added. Anchor settings: two aspect ratios and five scales were configured.

Trainset:
CVPR'19 and MOT16 datasets were included into the trainset. Only pedestrian instances considered by the evaluator (i.e. consider flag is being equal to 1) and visibility above 0.25 were employed during the training stage.

Image preprocessing:
The input images were resized such that the shorter side was set to 1080 pixels. The initial image aspect ratio was preserved.

Training protocol:
Synchronized SGD was employed to train the model on 8 GPUs (NVidia P100). Each mini-batch included single image per GPU. The weight decay and momentum values were set to 0.001 and 0.9, respectively. Because considered scenes were of high density (246 pedestrian per frame), RoIs and NMS top-k counts per image were increased during training and testing stages.

Reference:

D. Borysenko, D. Mykheievskyi, V. Porokhonskyy. ODESA: Object Descriptor that is Smooth Appearance-wise for object tracking tasks. In (to be submitted to ECCV'20), .

Last submitted:

June 03, 2019 (1 year ago)

Published:

June 12, 2019 at 12:47:35 CET

Submissions:

1

Project page / code:

n/a

Open source:

No

Hardware:

CPU: 3GHz, 1 core; GPU: 1.5Ghz

Runtime:

3.0 Hz

Benchmark performance:

Sequence AP MODA MODP FAF TP FP FN Recall Precision F1
CVPR 2019 Detection Challenge0.8175.079.412.3340,61255,25139,93089.586.087.7

Detailed performance:

Sequence AP MODA MODP FAF TP FP FN Recall Precision F1
CVPR19-040.9089.281.66.6253,58713,67615,36094.394.994.6
CVPR19-060.6832.571.025.746,22325,85716,50273.764.168.6
CVPR19-070.9185.981.31.614,9529411,36691.694.192.8
CVPR19-080.7034.071.418.325,85014,7776,70279.463.670.6

Raw data: