SRK_ODESA


Short name:

SRK_ODESA

Benchmark:

Description:

Team:
Dmytro Mykheievskyi, Dmytro Borysenko and Viktor Porokhonskyy
Samsung Research&Development Institute Ukraine (SRK)

This solution is merely an adaptation of Faster R-CNN to CVPR'19 Tracking dataset.

Detector Design:
Faster R-CNN with ResNet-101 backbone was taken the starting point. On top of that FPN and 3-stage refining Cascade were added. Anchor settings: two aspect ratios and five scales were configured.

Trainset:
CVPR'19 and MOT16 datasets were included into the trainset. Only pedestrian instances considered by the evaluator (i.e. consider flag is being equal to 1) and visibility above 0.25 were employed during the training stage.

Image preprocessing:
The input images were resized such that the shorter side was set to 1080 pixels. The initial image aspect ratio was preserved.

Training protocol:
Synchronized SGD was employed to train the model on 8 GPUs (NVidia P100). Each mini-batch included single image per GPU. The weight decay and momentum values were set to 0.001 and 0.9, respectively. Because considered scenes were of high density (246 pedestrian per frame), RoIs and NMS top-k counts per image were increased during training and testing stages.

Hardware:

CPU: 3GHz, 1 core; GPU: 1.5Ghz

Detector:

Public

Last submitted:

June 03, 2019 (3 months ago)

Published:

June 12, 2019 at 12:47:35 CET

Submissions:

1

Open source:

No

Project page / code:

n/a

Reference:

D. Borysenko, D. Mykheievskyi, V. Porokhonskyy. ODESA: Object Descriptor that is Smooth Appearance-wise for object tracking tasks. In (to be submitted to ECCV'20), .

Benchmark performance:

APSpecifications
0.81CPU: 3GHz, 1 core; GPU: 1.5Ghz

Detailed performance:

Sequence AP
CVPR19-040.9049
CVPR19-060.6795
CVPR19-070.9051
CVPR19-080.6998

Raw data:

n/a


SRK_ODESA