Skip to content
/ iLAT Public

The Image Local Autoregressive Transformer (NIPS2021)

License

Notifications You must be signed in to change notification settings

ewrfcas/iLAT

Repository files navigation

The Image Local Autoregressive Transformer (NeurIPS 2021)

arXiv paper

teaser

Overview

We propose the two-stream convolution and the local autoregressive mask to maintain the image consistence without information leakage.

mask

News

  • Release the training codes.
  • Codes for pose editing.

Requirements

These codes are only tested in pytorch==1.3.1.

Preparation

  1. Download the pretrained sketch-vqgan, image-vqgan, and transformer.
  2. For masked training, we provide irregular and segmentation masks (download) with different masking rates. And you should define the mask file list before the training (flist_example.txt).

Training

The TS-VQGAN is designed a little different from VQGAN, which can trained with less GPU memory with the same performance.

  1. Remove attentions.

  2. Using InstanceNorm instead of GroupNorm.

  3. Progressive channel widths.

  4. Fewer parameters for the sketch vqgan.

  1. Generating XDoG sketches.
python generate_xdog_sketch.py --input_path <img fold> --output_path <output fold>
  1. Train sketch VQGAN.
python train_sketch_vqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 150000 --learning_rate 1e-4 --gpu 0
  1. Train image TS-VQGAN.
python train_image_tsvqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 150000 --learning_rate 2e-4 --gpu 0
python train_image_tsvqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 300000 --learning_rate 4e-5 --gpu 0 --finetune
  1. Train the iLAT.
python train_transformer.py --path <model path> --config_path configs/transformer_ffhq.yml \
                            --sketch_model_path <sketch-VQGAN path> \
                            --image_model_path <image-VQGAN path> \
                            --max_iters 300000 --learning_rate 5e-5 --gpu 3

Testing

See face_editing_demo.ipynb.

About

The Image Local Autoregressive Transformer (NIPS2021)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages