d3s_repro

This project was developed as part of ML Reproducibility Challenge 2020 and Spring 2021 and aims to reproduce results from paper D3S - A Discriminative Single Shot Segmentation Tracker. The paper describes new neural network architecture - D3S - for both visual object tracking and video object segmentation. Original implementation can be found here.

Scope of Reproducibility

In our reproducibility study, we focused on training and evaluation of D3S for visual object tracking tasks due to limited time.

Methodology

Our work is based on code provided by the authors of the original paper. The training code was reorganized and partially re-implemented. As a result, our version consists of only the most necessary code (the original code consists of other experiments not presented in the paper). For model evaluation, we use the pytracking framework following the authors of the original article.

We use NVIDIA Tesla V100 GPU with CUDA 9.2 and pytorch 1.7.1 for model training and validation. For model training you need to download yotube-vos 2018 dataset in DATA/ folder and run:

python -m experiments.run_training

For model evaluation you need to download vot2016, vot2108, GOT10-K and TrackingNet datasets, set correct paths in pytracking/evaluation/local.py and run:

vot2016 dataset:

python pytracking/run_tracker.py segm default_params --dataset vot16

vot2018 dataset:

python pytracking/run_tracker.py segm default_params --dataset vot18

GOT10-k dataset:

python pytracking/run_tracker.py segm default_params --dataset gotv

TrackingNet dataset:

python pytracking/run_tracker.py segm default_params --dataset tn

The tools provided with the datasets were used to calculate the metrics.

Results

Below is a comparison of the results obtained with those given in the original article:

Dataset	Metric	Our result	Original result
vot 2016	EAO	0.494	0.493
	Acc.	0.67	0.66
	Rob.	0.131	0.131
vot 2018	EAO	0.487	0.489
	Acc.	0.63	0.64
	Rob.	0.153	0.150
GOT10-k	AO	0.60	59.7
	SR0.75	47.3	46.2
	SR0.5	68.6	67.6
TrackingNet	AUC	72.8	72.8
	Prec.	66.5	66.4
	Prec.N	76.8	76.8

Training takes 16 hours. Evaluation speed listed in table below:

Dataset	vot2016	vot2018	GOT10-k	TrackingNet
Our FPS	22	21	16	23

In the original paper 25fps evaluation speed was reported.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
DATA		DATA
experiments		experiments
images		images
pytracking		pytracking
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

d3s_repro

Scope of Reproducibility

Methodology

Results

About

Releases

Packages

Languages

aerubanov/d3s_repro

Folders and files

Latest commit

History

Repository files navigation

d3s_repro

Scope of Reproducibility

Methodology

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages