Combine any two models using a Super Mario merge(DARE) as described in the linked whitepaper.
Combine capabilities from multiple models. Works with:
- Stable Diffusion (1.5, XL/XL Turbo)
- LLMs(Mistral, Llama, etc)
- LoRas(must be same size)
- Any two homologous models
Model | Description | Image(same seed) |
---|---|---|
sd_xl_turbo | Attempting 1024 | |
sdxl base 1.0 | Attempting to use SDTurboScheduler | |
merged | Mario merged(DARE) |
python3 merge.py -p [weight drop probability] -lambda [scaling factor] [base_safetensors_model_file_or_folder] [model_to_merge] [output_path]
python3 merge.py -p 0.13 -lambda 3.0 sdxl_base.safetensors sd_xl_turbo_1.0_fp16.safetensors sdxl_merged.safetensors
Note: This also works with arguments reversed.
- https://huggingface.co/martyn/sdxl-turbo-mario-merge - SD Turbo XL merged with SDXL Base
- https://civitai.com/models/215796 - Top Rated - TurboXL+LCM - Super Mario Merge
- https://huggingface.co/martyn/llama-megamerge-dare-13b - A llama 13b mega merge created using
hf_merge.py
- https://huggingface.co/collections/martyn/dare-llm-merges-6581f52c0fb25b9aa26fb180 - A collection of LLM merges
- https://huggingface.co/collections/martyn/dare-diffusion-merges-6581f5fab1c4ab777ad43cf7 - A collection of text-to-image diffusion model merges
- Dec 27 2023: Add support for using files, folders, or hf repos in the hf_merge.py merge list.
- Dec 27 2023: Added mergekit-compatible yaml support for
hf_merge.py
. Always runs dare and ignores options outside model specification. weight is p and density is 1/λ. - Dec 12 2023: Added
hf_merge.py
for merging hf repos. - Dec 12 2023: Added support for folders. You can now merge LLMs(mistral, llama, etc) and other large models. Folders with
.bin
files are supported - the first specified model in the cli must be in.safetensors
format. - Nov 28 2023: Initial release supporting stable diffusion.
- https://arxiv.org/pdf/2311.03099.pdf - Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
- https://github.com/yule-BUAA/MergeLM - GitHub for the linked whitepaper
- https://stability.ai/research/adversarial-diffusion-distillation - SXDL Turbo - Adversarial Diffusion Distillation