IllumiDiff: Indoor Illumination Estimation from a Single Image with Diffusion Model (TVCG 2025)

Shiyuan Shen, Zhongyun Bao, Wenju Xu, Chunxia Xiao

Differences from the original paper

Upgrade LDM to Stable Diffusion 1.5, use input image instead of text prompt.
Replace Latent HDR Guidance with an equivalent substitution using ControlNet, with the aim of accelerating the fine-tuning process.
Training epochs are changed to 40 for id_net, 50 for sg_net, 100 for asg_net, 50 for hdr_net, 5 for controlnet.

Structure

IllumiDiff/
├── ckpts/ # Pre-trained model checkpoints
├── lighting_est/ # including stage1 (id_net,sg_net,asg_net) and stage3 (hdr_net)
│   ├── asg_fitting_fixed_ldr_adam_batch.py # fitting asg ground truth
│   ├── sg_fitting_free_nadam.py # fitting sg ground truth
│   ├── dataset.py # dataset for stage1 and stage3
│   ├── dataset_processing.py # some dataset processing scripts, still on organization
│   ├── models.py # model definitions for stage1 and stage3
│   ├── modules.py # lightning modules for stage1 and stage3
├── pano_gen/
│   ├── cldm # controlnet core codes
│   ├── configs # configuration files for model definition
│   ├── ldm # ldm core codes
│   ├── openai # CLIP model
│   ├── dataset.py # dataset for ldm
│   ├── pano_tools.py # some tools for panorama projection
│   ├── tool_add_control.py # ckpt initialization
│   ├── outpainting-mask.png # outpainting mask
├── inference_lighting_est.py # inference script for stage1 and stage3
├── inference_pano_gen.py # inference script for stage2
├── pipeline_lighting_est.py # pipeline for lighting estimation (stage1 + stage3)
├── pipeline_full.py # full pipeline for IllumiDiff (stage1 + stage2 + stage3)
├── train_lighting_est.py # training script for stage1 and stage3
├── train_pano_gen.py # training script for stage2

TODO or not to do

Config files for all networks.
Simplify LDM code.
Full dataset process script.
Training all stage together.

Environment

conda create -n illumidiff python=3.10
conda activate illumidiff
conda install pytorch==2.2.2 torchvision==0.17.2 pytorch-cuda=11.8 numpy=1.26.4 -c pytorch -c nvidia
conda install lightning -c conda-forge
pip install -r requirements.txt

Checkpoints

You can download them from OneDrive.

Unzip clip-vit-base-patch32.zip to IllumiDiff/pano_gen/openai/clip-vit-base-patch32/,

put all ckpts to IllumDiff/ckpts/,

control_sd15_clip_asg_sg.ckpt is required solely for training from scratch.

Inference

Full pipeline inference, the input is single images:

python pipeline_full.py --input_path <path> --output_path <path>

Stage 1 or Stage 3 only:

python pipeline_lighting_est.py --input_path <path> --input_pano_path <path> --output_path <path>

Single network only:

for id_net, sg_net, asg_net, or hdr_net:

python inference_lighting_est.py --task <network>

for pano_gen:

python inference_pano_gen.py

Dataset

See more details in the paper.

Training

All networks are trained separately.

For id_net, sg_net, asg_net or hdr_net:

python train_lighting_est.py --task <network>

For pano_gen:

python train_pano_gen.py --ckpt_path <path> --config_path <path>

Contact

For questions please contact:
syshen@whu.edu.cn

Renferences

ControlNet

LDM

Skylibs

Citation

@article{shen2025illumidiff,
  title={IllumiDiff: Indoor Illumination Estimation from a Single Image with Diffusion Model},
  author={Shen, Shiyuan and Bao, Zhongyun and Xu, Wenju and Xiao, Chunxia},
  journal={IEEE transactions on visualization and computer graphics},
  year={2025},
  publisher={IEEE}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IllumiDiff: Indoor Illumination Estimation from a Single Image with Diffusion Model (TVCG 2025)

Differences from the original paper

Structure

TODO or not to do

Environment

Checkpoints

Inference

Dataset

Training

Contact

Renferences

Citation

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
ckpts		ckpts
lighting_est		lighting_est
pano_gen		pano_gen
LICENSE		LICENSE
README.md		README.md
inference_lighting_est.py		inference_lighting_est.py
inference_pano_gen.py		inference_pano_gen.py
pipeline_full.py		pipeline_full.py
pipeline_lighting_est.py		pipeline_lighting_est.py
requirements.txt		requirements.txt
train_lighting_est.py		train_lighting_est.py
train_pano_gen.py		train_pano_gen.py

License

nauyihsnehs/IllumiDiff

Folders and files

Latest commit

History

Repository files navigation

IllumiDiff: Indoor Illumination Estimation from a Single Image with Diffusion Model (TVCG 2025)

Differences from the original paper

Structure

TODO or not to do

Environment

Checkpoints

Inference

Dataset

Training

Contact

Renferences

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages