Skip to content

comeeasy/DALS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Abstract

We propose a framework that synthesizes artistic landscape sketches using a diffusion model-based approach. Furthermore, we suggest a three-channel perspective map (3CPM) that mimics the artistic skill used by real artists. We employ Stable Diffusion, which leads us to use ControlNet to process 3CPM in Stable Diffusion. Additionally, we adopt the Low Rank Adaptation (LoRA) method to fine-tune our framework, thereby enhancing the quality of sketch and resolving the color-remaining problem, which is a frequently observed artifact in the sketch images using diffusion models. We implement a bimodal sketch generation interface: text to sketch and image to sketch. In producing a sketch, a guide token is used so that our method synthesizes an artistic sketch in both cases. Finally, we evaluate our framework using quantitative and quantitative schemes. Various sketch images synthesized by our framework demonstrate the excellence of our study.

How to run demo

Step 1: Install AUTOMATIC1111 webui

git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git

Step 2: Install ControlNet Extention into webui**

you can follow instructions here.

Step 3: Download weights

Step 4: Locate each weight into

  • Stable Diffusion weight: <git repo path>/stable-diffusion-webui/models/Stable-diffusion
  • LoRA weight: <git repo path>/stable-diffusion-webui/models/Lora
  • ControlNet (3CPM) weight: <git repo path>/stable-diffusion-webui/models/ControlNet

Step 5: Run webui as a api server

cd stable-diffusion-webui
bash webui.sh --api

Step 6: Run user interface for DALS

cd <git repo>
python main.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages