Exploring how KANs (Kolmogorov-Arnold Networks) can be used in Denoising Diffusion Models

This repo is essentially "An Introduction to Diffusion Models",

BUT WITH KAN! (and without, for comparison)

Because KAN makes everything better! (Seems to be so indeed in our diffusion case! - skip to the bottom or to the notebook)

Akshually, it's simply an experiment on MLP approximating the noise, and the choice of diffusion model doesn't really matter as long as it predicts the needed functions

Setup

Because we cannot afford +100500 GPUs and the entire Laion XB dataset, let's start small and use the swiss roll dataset used in the very original Diffusion paper.

It is corrupted by Gaussian noise.

Training and KAN-results

Our KAN-based model consists of the MLP, all of whose linear layers have been replaced by KAN-layers.

It trains to restore the noised pictures.

And quite successfully!

Vanilla MLP comparison

Now let's train an MLP with the same structure (multilayered)

But, the loss and the results look slightly worse!

Note, the number of parameters of the structures above differ!

KAN has 22080.

MLP has 2306.

We need to even them out (like we did with GPT) before jumping to the conclusions.

Now, we try small two-layer KAN

To balance its count of parameters with MLP, we cross out all the inner layers, leaving only two layers, similar to the KAN paper's authors.

This model has only 1600 parameters, less than the MLP!

The results look slightly worse, but it's just two layer and should be more interpretable! (TODO: me, or delegate to pykan)

Comparison of the loss curves

And now, finally! A quantitative metric! The loss curves.

As we can see, KAN fare as good or better than the similar MLP structures in predicting denoising.

How to train your own KAN-diffusion?

To play around use the kan-diffusion.ipynb notebook (colab-launchable, as always). All trainings — even of KANs — take no more than 2 minutes on my PC.

...

Will KANs assist us in prompting anime girls by Greg Rutkowski? Only time will tell...

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
corruption.png		corruption.png
data-example.png		data-example.png
kan-diffusion.ipynb		kan-diffusion.ipynb
kan.py		kan.py
loss-comparison.png		loss-comparison.png
mlp-s1.png		mlp-s1.png
mlp-s2.png		mlp-s2.png
s1.png		s1.png
s2.png		s2.png
smol-kan-s1.png		smol-kan-s1.png
smol-kan-s2.png		smol-kan-s2.png
struct.png		struct.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring how KANs (Kolmogorov-Arnold Networks) can be used in Denoising Diffusion Models

Setup

Training and KAN-results

Vanilla MLP comparison

Now, we try small two-layer KAN

Comparison of the loss curves

How to train your own KAN-diffusion?

...

About

Releases

Packages

Languages

kabachuha/kan-diffusion

Folders and files

Latest commit

History

Repository files navigation

Exploring how KANs (Kolmogorov-Arnold Networks) can be used in Denoising Diffusion Models

Setup

Training and KAN-results

Vanilla MLP comparison

Now, we try small two-layer KAN

Comparison of the loss curves

How to train your own KAN-diffusion?

...

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages