Self-supervised pretraining for Scene Change Detection

This is the official code for NeurIPS 2021 Machine Learning for Autonomous Driving Workshop Paper, "Self-supervised pretraining for scene change detection" by Vijaya Raghavan Thiruvengadathan Ramkumar, Prashant Bhat, Elahe Arani and Bahram Zonooz, where we propose a novel self-supervised pretraining architechture based on differenceing called D-SSCD for scene change detection.

Abstract

High Definition (HD) maps provide highly accurate details of the surrounding environment that aids in the precise localization of autonomous vehicles. To provide the most recent information, these HD maps must remain up-to-date with the changes present in the real world. Scene Change Detection (SCD) is a critical perception task that helps keep these maps updated by identifying the changes of the scene captured at different time instances. Deep Neural Network (DNNs) based SCD methods hinge on the availability of large-scale labeled images that are expensive to obtain. Therefore, current SCD methods depend heavily on transfer learning from large ImageNet datasets. However, they induce domain shift which results in a drop in change detection performance. To address these challenges, we propose a novel self-supervised pretraining method for the SCD called D-SSCD that learns temporal-consistent representations between the pair of images. The D-SSCD uses absolute feature differencing to learn distinctive representations belonging to the changed region directly from unlabeled pairs of images. Our experimental results on the VL-CMU-CD and Panoramic change detection datasets demonstrate the effectiveness of the proposed method. Compared to the widely used ImageNet pretraining strategy that uses more than a million additional labeled images, D-SSCD can match or surpass it without using any additional data. Our results also demonstrate the robustness of D-SSCD to natural corruptions, out-of-distribution generalization, and its superior performance in limited label scenarios.

For more details, please see the Paper and Presentation.

Requirements

python 3.6+
opencv 3.4.2+
pytorch 1.6.0
torchvision 0.4.0+
tqdm 4.51.0
tensorboardX 2.1

Datasets

Our network is tested on two datasets for street-view scene change detection.

'PCD' dataset from Change detection from a street image pair using CNN features and superpixel segmentation.
- You can find the information about how to get 'TSUNAMI', 'GSV' and preprocessed datasets for training and test here.
'VL-CMU-CD' dataset from Street-View Change Detection with Deconvolutional Networks.
- 'VL-CMU-CD': [googledrive]

Dataset Preprocessing

For DSSCD pretraining - included in the DSSCD--dataset--CMU.py/PCD.py
For finetuning and evaluation - Please follow the preprocessing method used by the official implementation of {Dynamic Receptive Temporal Attention Network for Street Scene Change Detection paper}

Dataset folder structure for VL-CMU-CD:

├── VL-CMU-CD
│   ├── Image_T0
│   ├── Image_T1
│   ├── Ground Truth

SSL Training

For training 'SSCD' on VL-CMU-CD dataset:

python3 DSSCD/train.py --ssl_batchsize 16 --ssl_epochs 400 --save_dir /outputs --data_dir /path/to/VL-CMU-CD --img_size 256 --n_proj 256 --hidden_layer 512 --output_stride 8 --pre_train False --m_backbone False --sscd_barlow_twins True --dsscd_barlow_twins False

For training 'D-SSCD' on VL-CMU-CD dataset:

python3 DSSCD/train.py --ssl_batchsize 16 --ssl_epochs 400 --save_dir /outputs --data_dir /path/to/VL-CMU-CD --img_size 256 --n_proj 256 --hidden_layer 512 --output_stride 8 --pre_train False --m_backbone False --dsscd_barlow_twins True --sscd_barlow_twins False

Fine Tuning

We evaluate our SSCD and DSSCD pretraining on DR-TANet.

Follow the Please follow the train and test procedure used by the official implementation of {Dynamic Receptive Temporal Attention Network for Street Scene Change Detection paper}

Start training with DR-TANet on 'VL-CMU-CD' dataset.

python3 train.py --dataset vl_cmu_cd --datadir /path_to_dataset --checkpointdir /path_to_check_point_directory --max-epochs 150 --batch-size 16 --encoder-arch resnet50 --epoch-save 25 --drtam --refinement

Start evaluating with DR-TANet on 'PCD' dataset.

python3 eval.py --dataset pcd --datadir /path_to_dataset --checkpointdir /path_to_check_point_directory --resultdir /path_to_save_eval_result --encoder-arch resnet50 --drtam --refinement --store-imgs

Evaluating the finetuned model

Start evaluating with DR-TANet on 'PCD' dataset.

python3 eval.py --dataset pcd --datadir /path_to_dataset --checkpointdir /path_to_check_point_directory --resultdir /path_to_save_eval_result --encoder-arch resnet18 --drtam --refinement --store-imgs

Analysis

We analyse our D-SSCD model under 3 scenarios: 1. Robustness to Natural corruptions 2. Out-of-distribution data 3. Limited labeled data. For more details, please see the Paper. For the ease of comparison, we have provided the model checkpoints for these analyses below:

Model Checkpoints

Cite our work

If you find the code useful in your research, please consider citing our paper:

@article{ramkumarself,
  title={Self-Supervised Pretraining for Scene Change Detection},
  author={Ramkumar, Vijaya Raghavan T and Bhat, Prashant and Arani, Elahe and Zonooz, Bahram}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
DR-TANet-main		DR-TANet-main
DSSCD		DSSCD
imagenet_c		imagenet_c
images		images
LICENSE		LICENSE
README.md		README.md
generate_corrupted_testdata.py		generate_corrupted_testdata.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-supervised pretraining for Scene Change Detection

Abstract

Requirements

Datasets

Dataset Preprocessing

SSL Training

Fine Tuning

Evaluating the finetuned model

Analysis

Model Checkpoints

Cite our work

About

Releases

Packages

Contributors 2

Languages

License

NeurAI-Lab/D-SSCD

Folders and files

Latest commit

History

Repository files navigation

Self-supervised pretraining for Scene Change Detection

Abstract

Requirements

Datasets

Dataset Preprocessing

SSL Training

Fine Tuning

Evaluating the finetuned model

Analysis

Model Checkpoints

Cite our work

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages