TAO-VOS is an extension of the TAO Benchmark, where we added segmentation mask annotations. TAO-VOS contains 626 high resolution videos, captured in diverse environments, which are half a minute long on average and cover a large variety of categories. The validation set of 126 sequences was annotated with masks fully manually. The training set of 500 sequences was annotated semi-automatically at high quality level with minor errors in the masks (for details see ). As for the original TAO Benchmark, the videos are annotated at 1 FPS, while the raw videos have 30 FPS. Here we provide the masks of the annotated frames together with the corresponding images. If you want to have the images of the intermediate frames (full 30 FPS), please download them from the original TAO Benchmark.
|TAO_VOS_val||30||1280x720||0 (00:00)||835||14987||0.0||Validation set, for training trackers||link|||
|TAO_VOS_train||30||1280x720||0 (00:00)||2833||59104||0.0||Training set, for training trackers||link|||
|Total||0 frm. |
|||Reducing the Annotation Effort for Video Object Segmentation Datasets. In WACV, 2021.|