iTARFlow: Iterative Transformer Autoregressive Flows

This repo contains code for Normalizing Flow with Iterative Denoising (iTarFlow).

Setup

pip install -r requirements.txt

Data Preparation

Place datasets under data/:
- i.e data/imagenet/
Pre-compute FID statistics for each dataset/resolution:

# ImageNet 64x64
torchrun --standalone --nproc_per_node 8 prepare_fid_stats.py --dataset imagenet --img_size 64

# ImageNet 128x128
torchrun --standalone --nproc_per_node 8 prepare_fid_stats.py --dataset imagenet --img_size 128

# ImageNet 256x256
torchrun --standalone --nproc_per_node 8 prepare_fid_stats.py --dataset imagenet --img_size 256

Training

ImageNet 64x64

torchrun --standalone --nproc_per_node 8 train.py \
  --dataset imagenet --img_size 64 \
  --patch_sizes 2 --channels 1280 --layers_per_block 4 4 4 24 \
  --batch_size 512 --epochs 300 --cfg 2.1 --drop_label 0.1  --t0 1e-2 --t1 3e-1\
  --reweight t --xflip 1 --ckpts_to_keep 20 \
  --sample_freq 10 --tag imagenet64

ImageNet 128x128

torchrun --standalone --nproc_per_node 8 train.py \
  --dataset imagenet --img_size 128 \
  --patch_sizes 4 --channels 1600 --layers_per_block 4 4 4 24 \
  --batch_size 512 --epochs 450 --cfg 2.1 --drop_label 0.1  --t0 1e-2 --t1 5e-1\
  --reweight t --xflip 1 --ckpts_to_keep 20 \
  --sample_freq 10 --tag imagenet128

ImageNet 256x256

torchrun --standalone --nproc_per_node 8 train.py \
  --dataset imagenet --img_size 256 \
  --patch_sizes 8 --channels 2176 --layers_per_block 4 4 4 24 \
  --batch_size 512 --epochs 600 --cfg 2.1 --drop_label 0.0  --t0 1e-2 --t1 7e-1\
  --reweight t --xflip 1 --ckpts_to_keep 20 \
  --sample_freq 10 --tag imagenet256

Evaluation (FID) (change the argument to your config accordingly if you chagne them)

ImageNet 64x64

torchrun --standalone --nproc_per_node 8 evaluate_fid.py \
  --dataset imagenet --img_size 64 \
  --patch_sizes 2 --channels 1280 --layers_per_block 4 4 4 24 \
  --ckpt_file <path_to_checkpoint.pth> \
  --cfg 2.1 --t0 1e-2 --t1 3e-1 --sample_batch_size 1024 --denoising_batch_size 256

ImageNet 128x128

torchrun --standalone --nproc_per_node 8 evaluate_fid.py \
  --dataset imagenet --img_size 128 \
  --patch_sizes 4 --channels 1600 --layers_per_block 4 4 4 24 \
  --ckpt_file <path_to_checkpoint.pth> \
  --cfg 3.7 --t0 1e-2 --t1 5e-1 --sample_batch_size 600 --denoising_batch_size 200

ImageNet 256x256

torchrun --standalone --nproc_per_node 8 evaluate_fid.py \
  --dataset imagenet --img_size 256 \
  --patch_sizes 8 --channels 2176 --layers_per_block 4 4 4 24 \
  --ckpt_file <path_to_checkpoint.pth> \
  --cfg 4.2 --t0 1e-2 --t1 7e-1\
  --sample_batch_size 720 --denoising_batch_size 144

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
asset		asset
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
evaluate_fid.py		evaluate_fid.py
prepare_fid_stats.py		prepare_fid_stats.py
requirements.txt		requirements.txt
train.py		train.py
transformer_flow.py		transformer_flow.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iTARFlow: Iterative Transformer Autoregressive Flows

Setup

Data Preparation

Training

ImageNet 64x64

ImageNet 128x128

ImageNet 256x256

Evaluation (FID) (change the argument to your config accordingly if you chagne them)

ImageNet 64x64

ImageNet 128x128

ImageNet 256x256

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

iTARFlow: Iterative Transformer Autoregressive Flows

Setup

Data Preparation

Training

ImageNet 64x64

ImageNet 128x128

ImageNet 256x256

Evaluation (FID) (change the argument to your config accordingly if you chagne them)

ImageNet 64x64

ImageNet 128x128

ImageNet 256x256

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages