🧬 FVAE-LoRA: Latent Space Factorization in LoRA

Official implementation of FVAE-LoRA, introduced in our NeurIPS 2025 paper: "Latent Space Factorization in LoRA".

FVAE-LoRA uses a Variational Autoencoder (VAE) to split the LoRA latent space into two:

🎯 Task-salient features: Dedicated to your specific downstream task.
🌪️ Residual information: Captures the remaining variance.

The result? Better performance across text, audio, and image tasks compared to standard LoRA. 🚀

🔍 Overview

FVAE-LoRA is a Parameter-Efficient Fine-Tuning (PEFT) method that enhances LoRA's expressiveness through latent space factorization. This repository includes:

🛠️ Modified 🤗 PEFT Library: An extended version of Hugging Face PEFT with built-in FVAE-LoRA support.
🖼️ Image Classification Suite: Everything you need to reproduce our results on ViT.

✨ Quick Start (Highlights)

FVAE-LoRA is designed to be a drop-in replacement for standard PEFT methods. If you know how to use Hugging Face, you already know how to use FVAE-LoRA.

from peft import FVAEPEFTConfig, get_peft_model
from transformers import AutoModelForImageClassification

# 1. Define your FVAE-LoRA config
fvae_peft_config = FVAEPEFTConfig(
    peft_type="FVAE_PEFT",
    latent_dim=16,  # latent dim
    latent_fusion="concat",
    enc_num_of_layer=1,
    enc_hidden_layer=16,
    enc_dropout=0.1,
    encoder_use_common_hidden_layer=True,
    dec_num_of_layer=3,
    dec_hidden_layer=128,
    z2_latent_mean=1.5,
    z2_latent_std=1,
    z1z2_orthogonal_reg=0,
    lambda_downstream=1000,
    lambda_reconstruction=1,
    lambda_z2_l2=1,
    lambda_z1_l2=1,
    lambda_kl_z1=1,
    target_modules=["query", "value"],
    modules_to_save=["classifier"],
)

# 2. Load any HF and PEFT supported model
num_labels = 42
model_name_or_path = "google/vit-base-patch16-224-in21k"
config = AutoConfig.from_pretrained(
    model_name_or_path,
    num_labels=num_labels,
)
model = AutoModelForImageClassification.from_pretrained(
    model_name_or_path,
    config=config,
)

# 3. Convert to FVAE-LoRA 🪄
model = get_peft_model(model, fvae_peft_config)

model.print_trainable_parameters()
# Train as usual!

⚙️ Installation

1. Clone & Environment

git clone https://github.com/idiap/FVAE-LoRA.git
cd FVAE-LoRA

conda env create -f env.yaml
conda activate fvae-lora
pip install -r requirements.txt

2. Install the Modified PEFT 🛠️

You must install the local version of PEFT included in this repo:

pip install -e ./peft

3. Path Configuration

Update path_constants.py with your local directories.

💡 Tip: This is required for reproducing the paper's image experiments but optional for custom usage described in Quick Start.

ViT model setup (for image experiments)

To run the image experiments:

Download the ViT model from: google/vit-base-patch16-224-in21k (Hugging Face).
Inside your LARGE_MODELS_PATH directory, create a folder named: vit-base-patch16-224-in21k
Place the downloaded model files inside that folder.

🖼️ Image Classification Experiments

We provide scripts to replicate image classification results on multiple benchmark datasets.

Datasets

The following datasets are supported (automatically downloaded from Hugging Face 🤗):

📉 Training FVAE-LoRA

Run the full suite across 3 seeds (1, 2, 42):

bash scripts/train_image_fvae_lora.sh

Important

SLURM: The scripts default to SLURM. If running locally, remove the submission commands from the *.sh files.
Project Name: Replace <your-project> in the scripts with your actual project name.

🎛️ Hyperparameter Tuning

The FVAE-LoRA uses several loss components controlled by lambda hyperparameters:

--fvae_lambda_downstream: Weight for the downstream task loss (default: 1000)
--fvae_lambda_reconstruction: Weight for the reconstruction loss (default: 1)
--fvae_lambda_kl_z1: Weight for the KL divergence on z1 (default: 1)
--fvae_lambda_z2_l2: L2 regularization on z2 (default: 1)
--fvae_lambda_z1_l2: L2 regularization on z1 (default: 1)

The secret sauce is in the lambda weights. For new tasks, we recommend starting with these sets apart from the default:

(1000, 0.1, 1, 1, 1)
(1000, 0.1, 10, 1, 1)

Refer to Section G in the paper's appendix for a detailed practical guide on tuning these values.

Training Baseline Methods

For comparison, scripts are provided for other PEFT methods:

# Standard LoRA
# supports: pissa, rslora, dora, olora
# change fine_tuning_method="peft" # peft, pissa, rslora, dora, olora inside the bash script.
bash scripts/train_image_peft.sh

# Full fine-tuning
bash scripts/train_image_full_ft.sh

📊 Analyzing Results

Aggregate your results into a clean summary:

python prepare_results_images.py \
    --max-depth 2 \
    --exp-base exp/exp_image/fvae_peft/vit-base-patch16-224-in21k/

Use --max-depth 1 for experiments apart from FVAE-LoRA.

📂 Repository Structure

.
├── peft/                         # 🛠️ Modified PEFT library (core logic)
├── scripts/                      # 📜 Bash scripts for training & baselines
│   ├── train_image_fvae_lora.sh  # FVAE-LoRA training
│   ├── train_image_peft.sh       # LoRA and variants training
│   └── train_image_full_ft.sh    # Full fine-tuning baseline
├── image_main.py                 # 🚀 Main entry point for image experiments
├── image_model.py                # 🧩 Model wrapper with PEFT integration
├── image_datamodule.py           # 📊 PyTorch Lightning data module
├── prepare_results_images.py     # 📈 Results analysis script
├── path_constants.py             # ⚙️ Path configuration
├── requirements.txt              # Python dependencies
├── env.yaml                      # Conda environment specification
└── README.md                     # README

PEFT Library Modifications

The included PEFT library is based on Hugging Face's PEFT with the following additions:

FVAEPEFTConfig: Configuration class for FVAE-LoRA parameters
FVAE-LoRA implementation with factorized latent space
Support for variational inference in the LoRA framework

See peft/ for the complete modified library.

📝 Citation

If you use this code or find our work helpful, please cite us:

@misc{kumar2025latentspacefactorizationlora,
      title={Latent Space Factorization in LoRA}, 
      author={Shashi Kumar and Yacouba Kaloga and John Mitros and Petr Motlicek and Ina Kodrasi},
      year={2025},
      eprint={2510.19640},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2510.19640}, 
}

🤝 Contact

📧 Questions? Open an issue or reach out at

Shashi Kumar (shashi.kumar@idiap.ch)
Yacouba Kaloga (yacouba.kaloga@idiap.ch)

⚖️ License

This project is released under the MIT License. See the LICENSES/MIT.txt file for details.

The modified PEFT library retains its original Apache 2.0 License - see peft/LICENSE.

For third-party dependencies retain their respective licenses.

Built with ❤️ at the Idiap Research Institute.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧬 FVAE-LoRA: Latent Space Factorization in LoRA

📑 Contents

🔍 Overview

✨ Quick Start (Highlights)

⚙️ Installation

1. Clone & Environment

2. Install the Modified PEFT 🛠️

3. Path Configuration

ViT model setup (for image experiments)

🖼️ Image Classification Experiments

Datasets

📉 Training FVAE-LoRA

🎛️ Hyperparameter Tuning

Training Baseline Methods

📊 Analyzing Results

📂 Repository Structure

PEFT Library Modifications

📝 Citation

🤝 Contact

⚖️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSES		LICENSES
peft		peft
scripts		scripts
.gitignore		.gitignore
README.md		README.md
REUSE.toml		REUSE.toml
THIRD_PARTY.md		THIRD_PARTY.md
env.yaml		env.yaml
image_datamodule.py		image_datamodule.py
image_main.py		image_main.py
image_model.py		image_model.py
path_constants.py		path_constants.py
prepare_results_images.py		prepare_results_images.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🧬 FVAE-LoRA: Latent Space Factorization in LoRA

📑 Contents

🔍 Overview

✨ Quick Start (Highlights)

⚙️ Installation

1. Clone & Environment

2. Install the Modified PEFT 🛠️

3. Path Configuration

ViT model setup (for image experiments)

🖼️ Image Classification Experiments

Datasets

📉 Training FVAE-LoRA

🎛️ Hyperparameter Tuning

Training Baseline Methods

📊 Analyzing Results

📂 Repository Structure

PEFT Library Modifications

📝 Citation

🤝 Contact

⚖️ License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages