Stable Diffusion-NCNN

Stable-Diffusion implemented by ncnn framework based on C++

Zhihu: https://zhuanlan.zhihu.com/p/582552276

Video: https://www.bilibili.com/video/BV15g411x7Hc

Performance (time pre-it and ram)

per-it	i7-12700 (512x512)	i7-12700 (256x256)	Snapdragon865 (256x256)
slow	4.85s/5.24G(7.07G)	1.05s/3.58G(4.02G)	1.6s/2.2G(2.6G)
fast	2.85s/9.47G(11.29G)	0.65s/5.76G(6.20G)

News

2023-01-19: speed up & less ram in x86, dynamic shape in x86

2023-01-12: update to the latest ncnn code and use optimize model, update android, add memory monitor

2023-01-05: add 256x256 model to x86 project

2023-01-04: merge and finish the mha op in x86, enable fast gelu

Demo

Out of box

All models and exe file you can download from 百度网盘 or Google Drive

x86 Windows

enter folder exe
download three bin file: AutoencoderKL-fp16.bin, FrozenCLIPEmbedder-fp16.bin, UNetModel-MHA-fp16.bin and put them to assets folder
set up your config in magic.txt, each line are:
1. resolution (only support 256 and 512)
2. speed mode (0 for slow but low ram, 1 for fast but high ram)
3. step number (15 is noe bad)
4. seed number (set 0 to be random)
5. positive prompt
6. negative prompt
run stable-diffusion.exe

x86 Linux / MacOS

build and Install NCNN
build the demo with CMake

cd x86/linux
mkdir -p build && cd build
cmake ..
make -j$(nproc)

download three bin file: AutoencoderKL-fp16.bin, FrozenCLIPEmbedder-fp16.bin, UNetModel-MHA-fp16.bin and put them to build/assets folder
run the demo

./stable-diffusion-ncnn

android apk

download an install the apk from the link
in the top, the first one is step and the second one is seed
int the bottom, the top one the positive prompt and the bottom one negative prompt (set empty to enable the default prompt)
note: the apk needs 7G ram, and run very slow and power consumption

Implementation Details

Note: Please comply with the requirements of the SD model and do not use it for illegal purposes

Three main steps of Stable-Diffusion：
1. CLIP: text-embedding
2. iterative sampling with sampler
3. decode the sampler results to obtain output images
Model details：
1. Weights：Naifu (u know where to find)
2. Sampler：Euler ancestral (k-diffusion version)
3. Resolution：dynamic shape, but must be a multiple of 128, minimum is 256
4. Denoiser：CFGDenoiser, CompVisDenoiser
5. Prompt：positive & negative, both supported :)

Code Details

Complie for x86 Windows

download three bin file: AutoencoderKL-fp16.bin, FrozenCLIPEmbedder-fp16.bin, UNetModel-MHA-fp16.bin and put them to assets folder
open the vs2019 project and compile the release&x64

Complie for x86 Linux / MacOS

build and Install NCNN
build the demo with CMake

cd x86/linux
mkdir -p build && cd build
cmake ..
make -j$(nproc)

download three bin file: AutoencoderKL-fp16.bin, FrozenCLIPEmbedder-fp16.bin, UNetModel-MHA-fp16.bin and put them to build/assets folder
run the demo

./stable-diffusion-ncnn

Compile for android

download three bin file: AutoencoderKL-fp16.bin, FrozenCLIPEmbedder-fp16.bin, UNetModel-MHA-fp16.bin and put them to assets folder
open android studio and run the project

ONNX Model

I've uploaded the three onnx models used by Stable-Diffusion, so that you can do some interesting work.

You can find them from the link above.

Statements

Please abide by the agreement of the stable diffusion model consciously, and DO NOT use it for illegal purposes!
If you use these onnx models to make open source projects, please inform me and I'll follow and look forward for your next great work :)

Instructions

FrozenCLIPEmbedder

ncnn (input & output): token, multiplier, cond, conds
onnx (input & output): onnx::Reshape_0, 2271

z = onnx(onnx::Reshape_0=token)
origin_mean = z.mean()
z *= multiplier
new_mean = z.mean()
z *= origin_mean / new_mean
conds = torch.concat([cond,z], dim=-2)

UNetModel

ncnn (input & output): in0, in1, in2, c_in, c_out, outout
onnx (input & output): x, t, cc, out

outout = in0 + onnx(x=in0 * c_in, t=in1, cc=in2) * c_out

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
android		android
resources		resources
x86		x86
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

android

android

resources

resources

x86

x86

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Stable Diffusion-NCNN

News

Demo

Out of box

x86 Windows

x86 Linux / MacOS

android apk

Implementation Details

Code Details

Complie for x86 Windows

Complie for x86 Linux / MacOS

Compile for android

ONNX Model

Statements

Instructions

References

About

Releases

Packages

Languages

License

nihui/Stable-Diffusion-NCNN

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion-NCNN

News

Demo

Out of box

x86 Windows

x86 Linux / MacOS

android apk

Implementation Details

Code Details

Complie for x86 Windows

Complie for x86 Linux / MacOS

Compile for android

ONNX Model

Statements

Instructions

References

About

Resources

License

Stars

Watchers

Forks

Languages