TAO VOS Benchmark

TAO-VOS is an extension of the TAO Benchmark, where we added segmentation mask annotations. TAO-VOS contains 626 high resolution videos, captured in diverse environments, which are half a minute long on average and cover a large variety of categories. The validation set of 126 sequences was annotated with masks fully manually. The training set of 500 sequences was annotated semi-automatically at high quality level with minor errors in the masks (for details see [1]). As for the original TAO Benchmark, the videos are annotated at 1 FPS, while the raw videos have 30 FPS. Here we provide the masks of the annotated frames together with the corresponding images. If you want to have the images of the intermediate frames (full 30 FPS), please download them from the original TAO Benchmark.

Training Set

Sample	Name	FPS	Resolution	Length	Tracks	Boxes	Density	Description	Source	Ref.
	TAO_VOS_val	30	1280x720	0 (00:00)	835	14987	0.0	Validation set, for training trackers	link	[1]
	TAO_VOS_train	30	1280x720	0 (00:00)	2833	59104	0.0	Training set, for training trackers	link	[1]
	Total			0 frm. (0 s.)	3668	74091	inf

Test Set

Sample	Name	FPS	Resolution	Length	Tracks	Boxes	Density	Description	Source	Ref.
	Total			0 frm. (0 s.)			nan

Download

Get all data (2.4GB)
Get files (no img) only (113MB)
Development Kit

TAO VOS Benchmark

Training Set

Test Set

Download

References: