SegSlum

Official repository of SegSlum, a semi-supervised learning framework for slum detection from satellite imagery.

Minimally supervised learning on sub-meter satellite imagery reveals slum expansion during the COVID-19 pandemic
Jeasurk Yang, Sungwon Park, Hyoshin Kim, Sumin Lee, Jungwon Kim, Jihee Kim, Meeyoung Cha
Communications Sustainability, 2026 · Paper · Project Page

Overview

SegSlum detects informal settlements (slums) from 60 cm/pixel RGB satellite imagery using a semi-supervised self-training pipeline. It requires only a small number of labeled images (~3% of the dataset) to achieve mIoU > 0.9 across diverse cities and time periods.

Key features:

Semi-supervised learning with adaptive pseudo-label selection
GRAM-matrix texture consistency regularization for photometric robustness
Multi-temporal and cross-national slum detection
Applied to 12 cities across Asia, Africa, and Latin America (2014–2024)

This codebase builds on ST++ (Yang et al., CVPR 2022).

Environment

Tested with Python 3.8+ and CUDA-enabled PyTorch.

pip install -r requirements.txt

Dependencies: torch, torchvision, numpy, pandas, scipy, scikit-image, opencv-python, Pillow, tqdm

Data

Directory layout

--data-root must point to a directory with this structure:

<DATA_ROOT>/
├── image/
│   └── <id>.png        # 256×256 RGB satellite tile (60 cm/pixel)
└── label/
    └── <id>.tif        # binary mask (0 = non-slum, 1 = slum)

Image IDs (one per line, without extension) are listed in split files under dataset/splits/. Each subfolder corresponds to one city/region (e.g. BRA_NEW_multiyear, KEN_NEW_multiyear).

Downloads

Resource	Link
Label dataset	Download
Model checkpoints	Download

Pretrained backbones

Place backbone checkpoints in pretrained/:

File	Download
`resnet50.pth`	https://download.pytorch.org/models/resnet50-0676ba61.pth
`resnet101.pth`	https://download.pytorch.org/models/resnet101-63fe2227.pth

Training

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset <DATASET_NAME> \
  --data-root <PATH_TO_DATASET_ROOT> \
  --batch-size 16 \
  --backbone resnet50 \
  --model deeplabv3plusaux \
  --labeled-id-path   dataset/splits/<DATASET_NAME>/<SPLIT>/labeled.txt \
  --unlabeled-id-path dataset/splits/<DATASET_NAME>/<SPLIT>/unlabeled.txt \
  --pseudo-mask-path  outdir/pseudo_masks/<EXPERIMENT_NAME> \
  --save-path         outdir/models/<EXPERIMENT_NAME> \
  --reliable-id-path  outdir/reliable_ids/<EXPERIMENT_NAME> \
  --plus

--plus enables ST++ selective re-training. Omit for plain ST.
--nogram disables GRAM-matrix consistency (baseline).

CLI arguments

Argument	Default	Description
`--data-root`	required	Root directory containing `image/` and `label/`
`--dataset`	required	Name of the split folder under `dataset/splits/`
`--labeled-id-path`	required	Text file listing labeled image IDs
`--unlabeled-id-path`	required	Text file listing unlabeled image IDs
`--pseudo-mask-path`	required	Output path for pseudo masks
`--save-path`	required	Output path for model checkpoints
`--reliable-id-path`	ST++ only	Output path for reliable/unreliable ID lists
`--batch-size`	16	Batch size
`--lr`	0.01	Initial learning rate
`--epochs`	100	Epochs per stage
`--crop-size`	256	Input crop size
`--backbone`	resnet50	`resnet50` or `resnet50_whitening`
`--model`	deeplabv3plusaux	`deeplabv3plusaux`, `deeplabv3plus`, `deeplabv3plusauxrgl`, `deeplabv2`, `pspnet`
`--plus`	—	Enable ST++ selective re-training
`--nogram`	—	Disable GRAM-matrix consistency

Citation

@article{yang2026segslum,
  title     = {Minimally supervised learning on sub-meter satellite imagery reveals slum expansion during the {COVID}-19 pandemic},
  author    = {Yang, Jeasurk and Park, Sungwon and Kim, Hyoshin and Lee, Sumin and Kim, Jungwon and Kim, Jihee and Cha, Meeyoung},
  journal   = {Communications Sustainability},
  volume    = {1},
  pages     = {52},
  year      = {2026},
  publisher = {Nature Publishing Group},
  doi       = {10.1038/s44458-026-00054-6}
}

Acknowledgement

This codebase is adapted from ST++:

Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao. ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation. CVPR 2022. [paper] [code]

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
model		model
pretrained		pretrained
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
augmentation.py		augmentation.py
dataloader.py		dataloader.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SegSlum

Overview

Environment

Data

Directory layout

Downloads

Pretrained backbones

Training

CLI arguments

Citation

Acknowledgement

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SegSlum

Overview

Environment

Data

Directory layout

Downloads

Pretrained backbones

Training

CLI arguments

Citation

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages