Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
-
Updated
Jul 25, 2023 - Python
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
Merge transformers without using like a bajillion GB of RAM
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Advanced Transfer Learning project with the purpose of obtaining the best model by mixing three twice-fine-tuned models.
All-in-one UI for merged LLMs in Hugging Face
An easy-to-use Python library for merging PyTorch models.
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Mergecraft is a simple library to streamline model merging operations, with seamless integration with HuggingFace🤗
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.
The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery. Arxiv, 2024.
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
This repository serves as a template for creating new projects based on FusionBench. It includes all the necessary configurations and boilerplate code to get started quickly.
A model merging project for generalizing Featured Finite State Machines (FFSMs) to unify behaviors across Software Product Lines (SPLs)
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Exploring Model Kinship for Merging Large Language Models
Add a description, image, and links to the model-merging topic page so that developers can more easily learn about it.
To associate your repository with the model-merging topic, visit your repo's landing page and select "manage topics."