PTNet: Prototype-Guided Task-Adaptive Network for Remote Sensing Change Captioning

A unified framework for joint change detection and captioning on UAV-based urban construction imagery.

Data Format

split_3_images/
├── train/
│   ├── A/          # Pre-change images
│   ├── B/          # Post-change images
│   └── Label/      # Binary change masks
├── val/
└── test/

wanzhengbanbe.json  # 5 captions per image pair from 5 different VLMs

Each image pair expands to 5 independent training samples (one per caption). At evaluation, a single prediction is scored against all 5 references.

Setup

pip install -r requirements.txt

Usage

Step 1 — Build Prototype Bank (run once before training)

python scripts/build_prototypes.py --dataset uccd

This runs K-means clustering on training-set difference features from CLIP layer 12, applies RBF spatial interpolation, and saves the prototype bank to ./cache/prototypes_uccd.pt.

Step 2 — Train

Single GPU:

python train.py --dataset uccd

Key arguments:

--dataset        uccd | whu_cdc
--output_dir     path to save checkpoints and logs
--resume         path to checkpoint to resume from
--batch_size     override batch size
--img_size       override input resolution 
--no_wandb       disable wandb logging

Step 3 — Test

python test.py \
    --dataset uccd \
    --checkpoint outputs/ptnet_uccd/best_model.pt \
    --split test \
    --save_predictions

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
engine		engine
models		models
scripts		scripts
utils		utils
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PTNet: Prototype-Guided Task-Adaptive Network for Remote Sensing Change Captioning

Data Format

Setup

Usage

Step 1 — Build Prototype Bank (run once before training)

Step 2 — Train

Step 3 — Test

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PTNet: Prototype-Guided Task-Adaptive Network for Remote Sensing Change Captioning

Data Format

Setup

Usage

Step 1 — Build Prototype Bank (run once before training)

Step 2 — Train

Step 3 — Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages