Real-Time High-Resolution Background Matting

This is a reimplementation of the work by the same name, Real-Time High-Resolution Background Matting. The work borrows a significant bit of original code. At the same time, there are several changes, reflective of my own coding style. Please feel free to use, fork the code or leave comments if you find any inconsitency.

Environment

torch == 1.9.0
numpy == 1.20.3
kornia == 0.4.1
torchvision == 0.10.0

Installation Guideline

git clone git@github.com:Anuj040/matte.git [-b <branch_name>]
cd matte (Work Directory)

# local environment settings
pyenv local 3.8.10                                 
python -m pip install poetry
poetry config virtualenvs.create true --local
poetry config virtualenvs.in-project true --local

# In case older version of pip throws installation errors
poetry run python -m pip install --upgrade pip 

# local environment preparation
poetry install

Working with the code

Before running the code please make sure that your datasets follow the below directory hierarchy.

    matte {work_directory}
    ├── src
    ├── datasets                  
    │   ├──PhotoMatte85                                   
    │       ├──train                               # RGBA image files 
    │         ├──fgr+alpha_1.png
    │         ├──fgr+alpha_1.png
    │       ├──valid
    │         :
    │   ├──backgrounds
    │       ├──train                               
    │         ├──bg_image1.png                      # RGB image file
    │         ├──bg_image2.png
    │           :  
    │       ├──valid                                
    │         ├──bg_image3.png
    │         ├──bg_image4.png
    │           :      
    └── ...

Notes

Inputs to the model are foreground (RGBA-4 channel) and background images
Refer data_path.py to get a better idea of how to specify dataset paths

Training mode

All the code executions shall happen from work directory
There are three different model train modes (base, refine, gan). The former two are as in the original implementation. The last one is refine mode with additional discriminator assitance.
The code also includes functionality for a two step training (training base model first, and then finetuning it with training of refiner model). It can be assessed with --load_base flag. However, the path to the pretrained base model is hard coded as of now. Please change it as per your convenience.
In my limited set of experiments following the 2-step training strategy as described above leads to faster model convergence for refine model.

poetry run python src/run.py train --model_type=refine

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.vscode		.vscode
src		src
.gitignore		.gitignore
.pylintrc		.pylintrc
.python-version		.python-version
Makefile		Makefile
README.md		README.md
__init__.py		__init__.py
data_path.py		data_path.py
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time High-Resolution Background Matting

Environment

Installation Guideline

Working with the code

Notes

Training mode

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real-Time High-Resolution Background Matting

Environment

Installation Guideline

Working with the code

Notes

Training mode

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages