Diffusart - Official implementation

Diffusart: Enhancing Line Art Colorization with Conditional Diffusion Models
Hernan Carrillo, Michaël Clément, Aurélie Bugeau, Edgar Simo-Serra.
EEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper]

Reference

Citation:

@InProceedings{Carrillo_2023_CVPR,
    author    = {Carrillo, Hernan and Cl\'ement, Micha\"el and Bugeau, Aur\'elie and Simo-Serra, Edgar},
    title     = {Diffusart: Enhancing Line Art Colorization With Conditional Diffusion Models},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2023},
    pages     = {3486-3490}
}

Requirements

python==3.8
pytorch==2.1.2
torchvision==0.16.2

conda create -n diffusart python=3.8
conda activate diffusart
pip install -r requirements.txt

Pretrained Model

We uploaded the pre-trained model to Google drive.

Training the model [TO DO]

Working..

Test

python test.py --sketch_dir ./samples/sketch/ --scrib_dir ./samples/scrib/ --out_dir ./samples/results/ --model_path ./checkpoint/diffusart_v1.pth

where --sketch_dir and --scrib_dir are directories that contains the line art and color scribbles images. Both images need to have the same name (i.e., ./samples/sketch/1118324.jpg ./samples/scrib/1118324.png).

Note: The color scribbles are 4 dimension images [R,G,B,mask]

Abstract

Colorization of line art drawings is an important task in illustration and animation workflows. However, this highly laborious process is mainly done manually, limiting the creative productivity. This paper presents a novel interactive approach for line art colorization using conditional Diffusion Probabilistic Models (DPMs). In our proposed approach, the user provides initial color strokes for colorizing the line art. The strokes are then integrated into the conditional DPM-based colorization process by means of a coupled implicit and explicit conditioning strategy to generates diverse and high-quality colorized images. We evaluate our proposal and show it outperforms existing state-of-the-art approaches using the FID, LPIPS and SSIM metrics.

Diffusart Framework

Overview of our proposed user-guided line art colorization. The framework is composed of two main components: a denoising model εθ , which learns to generate a denoised image, and an application-specific encoder gθ for extracting user color scribbles information.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
data		data
diagrams_img		diagrams_img
models		models
samples		samples
testing		testing
training		training
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

data

data

diagrams_img

diagrams_img

models

models

samples

samples

testing

testing

training

training

README.md

README.md

requirements.txt

requirements.txt

test.py

test.py

utils.py

utils.py

Repository files navigation

Diffusart - Official implementation

Reference

Requirements

Pretrained Model

Training the model [TO DO]

Test

Abstract

Diffusart Framework

About

Releases

Packages

Languages

hernan0930/Diffusart-CVPRW

Folders and files

Latest commit

History

Repository files navigation

Diffusart - Official implementation

Reference

Requirements

Pretrained Model

Training the model [TO DO]

Test

Abstract

Diffusart Framework

About

Topics

Resources

Stars

Watchers

Forks

Languages