PacerKit

PACER: Permutation-Aligned Consensus Expert Routing

A unified framework for base-free, interference-aware model merging in Large Language Models and Vision Transformers.

Key Features

** No Base Model Required** - Synthesizes a Consensus Barycenter from input models
** Interference-Aware** - Dynamically decides between merging and MoE upcycling per layer
** Smart Routing** - Zero-shot router using Subspace Projection Affinity (no training needed)
** Vision Support** - Native ViT support with Visual Token Merging (ToMe)
** Minimal Parameter Growth** - Only upcycles high-conflict layers to MoE

Installation

Quick Install

git clone https://github.com/Akicuo/pacer.git
cd pacer
pip install -e .

Manual Installation

pip install torch transformers safetensors accelerate
pip install -r requirements.txt

Quick Start

Python API

from pacerkit import PACERMerger

# Initialize merger with models
merger = PACERMerger([
    "fluently/FluentlyQwen3-Coder-4B-0909",
    "SamuelBang/AesCoder-4B"
])

# Run PACER merge pipeline
merged_model = merger.merge(
    interference_threshold=0.35,
    top_k_experts=2,
    output_path="./merged_model"
)

CLI

# Merge models using a config file
pacerwkit merge --config configs/qwen_coder_merge.yaml

# Analyze interference between models
pacerkit analyze --models model1 model2 --output report.json

Jupyter Notebook

See notebooks/pacer_quickstart.ipynb for an interactive guide.

Configuration

PacerKit uses YAML configuration files:

project_name: "qwen-coder-merge"

models:
  - "fluently/FluentlyQwen3-Coder-4B-0909"
  - "SamuelBang/AesCoder-4B"

output:
  path: "./merged_model"
  save_format: "safetensors"

pacer:
  interference_threshold: 0.35
  top_k_experts: 2
  dropout_rate: 0.1
  anchor_strategy: "first"
  enable_moe_upcycle: true

See configs/ for more examples.

How It Works

PACER operates in three phases:

Phase 1: Geometric Alignment (Git Re-Basin)

Aligns permutation symmetries of N models into a shared geometric basin using weight matching and the Hungarian algorithm.

Phase 2: Consensus Barycenter

Computes the Fréchet Mean of aligned models to create a synthetic "base model", then calculates deviation vectors.

Phase 3: Interference-Aware Upcycling

Low interference layers → DARE-TIES merge (0% parameter increase)
High interference layers → MoE upcycling with zero-shot routing

Performance

Metric	Dense Ensemble (4x)	Standard MoE	PACER
Total Params	400%	400%	~136%
Active Params	400%	100%	~100%
Interference	None	Low	None

📚 Documentation

Methodology - Full technical details
Configuration Reference - All config options
API Reference - Python API documentation

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🙏 Acknowledgments

Built on research from:

Git Re-Basin (Ainsworth et al.)
TIES-Merging (Yadav et al.)
Token Merging (Bolya et al.)
MergeME (Model Merging for MoEs)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
docs		docs
notebooks		notebooks
pacerkit		pacerkit
.gitignore		.gitignore
LICENSE		LICENSE
Novel Model Merging Method Development.md		Novel Model Merging Method Development.md
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PacerKit

Key Features

Installation

Quick Install

Manual Installation

Quick Start

Python API

CLI

Jupyter Notebook

Configuration

How It Works

Phase 1: Geometric Alignment (Git Re-Basin)

Phase 2: Consensus Barycenter

Phase 3: Interference-Aware Upcycling

Performance

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

Akicuo/pacer

Folders and files

Latest commit

History

Repository files navigation

PacerKit

Key Features

Installation

Quick Install

Manual Installation

Quick Start

Python API

CLI

Jupyter Notebook

Configuration

How It Works

Phase 1: Geometric Alignment (Git Re-Basin)

Phase 2: Consensus Barycenter

Phase 3: Interference-Aware Upcycling

Performance

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages