Skip to content

IAmigos/avatar-image-generator

Repository files navigation

Avatar Image Generator

Faces Domain

Generated Cartoons

Based on the paper XGAN: https://arxiv.org/abs/1711.05139

The problem

This repo aims to contribute to the daunting problem of generating a cartoon given the picture of a face.

This is an image-to-image translation problem, which involves many classic computer vision tasks, like style transfer, super-resolution, colorization and semnantic segmentation. Also, this is a many-to-many mapping, which means that for a given face there are multiple valid cartoons, and for a given cartoon there are multiple valid faces too.

Dataset

Faces dataset: we use the VggFace dataset (https://www.robots.ox.ac.uk/~vgg/data/vgg_face/) from the University of Oxford

Cartoon dataset: we use the CartoonSet dataset from Google (https://google.github.io/cartoonset/), both the versions of 10000 and 100000 items.

We filtered out the data just to keep realistic cartoons and faces images. This code is in scripts. To download the dataset:

  1. pip3 install gdown
  2. gdown https://drive.google.com/uc?id=1tfMW5vZ0aUFnl-fSYpWexoGRKGSQsStL
  3. unzip datasets.zip

Directory structure

config.json: contains the model configuration to train the model and deploy the app

weights: contains weights that we saved the last time we train the model.

├── app.py
├── avatar-image-generator-app
├── config.json
├── Dockerfile
├── images
│   ├── Cartoons_example.jpeg
│   └── Faces_example.jpeg
├── LICENSE
├── losses
│   └── __init__.py
├── models
│   ├── avatar_generator_model.py
│   ├── cdann.py
│   ├── decoder.py
│   ├── denoiser.py
│   ├── discriminator.py
│   ├── encoder.py
│   └── __init__.py
├── README.md
├── requirements.txt
├── scripts
│   ├── copyFiles.sh
│   ├── download_faces.py
│   ├── keepFiles.sh
│   ├── plot_utils.py
│   └── preprocessing_cartoons_data.py
├── sweeps
│   ├── sweep-bs-1.yaml
│   └── sweep-rs-1.yaml
├── train.py
├── utils
│   └── __init__.py
└── weights
    ├── c_dann.pth
    ├── d1.pth
    ├── d2.pth
    ├── denoiser.pth
    ├── disc1.pth
    ├── d_shared.pth
    ├── e1.pth
    ├── e2.pth
    └── e_shared.pth

The model

Our codebase is in Python3. We suggest creating a new virtual environment.

  • The required packages can be installed by running pip3 install -r requirements.txt
  • Update N_CUDA by running export N_CUDA=<gpu_number> if you want to specify the GPU to use

It is based on the XGAN paper omitting the Teacher Loss and adding an autoencoder in the end. The latter was trained to learn well only the representation of the cartoons as to "denoise" the spots and wrong colorisation from the face-to-cartoon outputs of the XGAN.

The model was trained using the hyperparameters located in config.json. Weights & Biases Sweep was used to find the best hyperparameters:

  1. Change root_path in config json. It specifies where is datasets which contains the datasets.
  2. Run wandb login 17d2772d85cbda79162bd975e45fdfbf3bb18911 to use wandb to get the report
  3. Run python3 train.py --wandb --run_name <RUN_NAME> --run_notes <RUN_NOTES> or python3 train.py --no-wandb
  4. To launch an agent with a sweep configuration of wandb in bg from ssh nohup wandb agent --count <RUN_NUMBERS> stevramos/avatar_image_generator/<SWEEP_ID> &

You can see the Weights & Biases report here: https://wandb.ai/stevramos/avatar_image_generator

This is the implementation of our project created for the Made With ML Data Science Incubator (deprecated).

Docker

  1. Build the container: sudo docker build -f Dockerfile -t avatar-image-generator .
    • Run the container: sudo docker run -ti avatar-image-generator /bin/bash

    • Train the model:

      a. Create the folder: mkdir weights_trained

      b. Change the absolute path from which mount the volume. This is for both weights_trained and datasets. In this case:

      sudo docker run -v <WEIGHTS_TRAINED>:/src/weights_trained/ -v <DATASETS>:/src/datasets/ -ti avatar-image-generator /bin/bash -c "cd src/ && source activate ml && wandb login 17d2772d85cbda79162bd975e45fdfbf3bb18911 && python train.py --wandb --run_name <RUN_NAME> --run_notes <RUN_NOTES>"
      
    • Run the app locally as a daemon in docker. model_path in config.json contains the weights to use in the app sudo docker run -d -p 8000:9999 -ti avatar-image-generator /bin/bash -c "cd src/ && source activate ml && python app.py"

      a. Local server: http://0.0.0.0:8000/

About

We strive to transfer realistic features from photos of real faces to avatar styles. An end-to-end ML application using PyTorch, Weights & Biases, Flask API, Docker and ReactJS.

Topics

Resources

License

Stars

Watchers

Forks

Languages