Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training

Ethan Nguyen^1*, Ipsita Ghosh^2*, Christian Kümmerle³,

¹Department of Computer Science University of North Carolina at Charlotte,
²Department of Computer Science University of Central Florida,
³School of Data, Mathematical and Statistical Sciences Department of Computer Science University of Central Florida
^*Equal Contribution

Overview

Q3R is a novel regularization technique for training low-rank neural networks. It promotes low-rank structures in weight matrices during training and can be applied to standard linear layers as well as fused layers (like QKV projections in Transformers).

Key Features:

Rank regularization for weight matrices
Fused layer support (e.g., Q, K, V slices)
Distributed training (DDP) support
Integrated AdamQ3R optimizer

Quick Start

Two ways to use Q3R:

Option 1: AdamQ3R (Recommended)

from Functions.AdamQ3R import AdamQ3R
from main_helper import extract_linear

trainable_modules = extract_linear(model, config)
optimizer = AdamQ3R(model.parameters(), 
                    trainable_modules=trainable_modules, 
                    target_rank=0.2, 
                    lmbda=0.1, 
                    steps=5)

Option 2: Q3R Regularizer

from Functions.Q3R import Q3R

trainable_modules = extract_linear(model, config)
q3r = Q3R(trainable_modules=trainable_modules, 
          target_rank=0.2, 
          lmbda=0.1, 
          steps=5)

# In training loop
q3r.update()
total_loss = loss + q3r.val
total_loss.backward()

Setup and Installation

Ensure you have CUDA installed. This project was tested with CUDA 12.6.

Install PyTorch:

pip install torch==2.6.0+cu126 torchvision==0.21.0+cu126 --extra-index-url https://download.pytorch.org/whl/cu126

Install dependencies:
```
pip install -r requirements.txt
```

Verify Installation:

python -c "import torch; print(torch.cuda.is_available())"

Experiment Execution

Basic AdamQ3R Training:

python main.py --dataset CIFAR10 --model VIT_Tiny --learning_rate 0.0004 --epoch 100 --technique AdamQ3R --lmbda 0.1 --target_rank 0.05 --target_modules qkv

LoRITa + Q3R:

python main.py --dataset CIFAR10 --model VIT_Tiny --learning_rate 0.00004 --epoch 100 --technique LoRITaQuaRS --depth_lorita=1 --weight_decay_alpha=0.1 --target_modules qkv --target_rank 16 --epsilon_schedule linear --N 46875

Hyperparameters

Parameter	Type	Default / Example	Description
`lr`	float	0.00004	Base learning rate for the optimizer.
`trainable_modules`	dict	`extract_linear(model, config)`	Linear modules that will receive Q3R updates.
`target_rank`	float (0–1)	0.2	Fraction of singular values to retain for low-rank projection.
`lmbda`	float	0.1	Scaling factor for the Q3R regularization term.
`steps`	int	5	Update period for SVD calculations (higher = faster, less frequent).

Advanced Usage

Fused Modules (QKV Layers)

Q3R supports fused modules where multiple linear projections are concatenated into a single weight matrix. Provide slice indices to regularize each component independently:

# Fused QKV layer with output dimension 768 (256 for Q, 256 for K, 256 for V)
qkv_slices = [(0, 256), (256, 512), (512, 768)]

trainable_modules = {
    model.attention.qkv: qkv_slices,
    model.fc1: None  # None means use the full weight matrix
}

optimizer = AdamQ3R(
    model.parameters(),
    trainable_modules=trainable_modules,
    target_rank=0.1,
    lmbda=0.1
)

The gradients for each slice are computed independently and "stuffed" back into the full gradient tensor using pad_tensor_with_slice_bounds, ensuring correct regularization without physically splitting weights.

Distributed Training

Q3R automatically supports PyTorch DDP. Regularizers are distributed across ranks for efficient computation:

# Standard DDP setup
model = torch.nn.parallel.DistributedDataParallel(model, ...)

# Q3R will automatically distribute work across GPUs
optimizer = AdamQ3R(model.parameters(), trainable_modules=trainable_modules, ...)

Citation

If you use Q3R in your research, please cite:

@article{nguyen2025q3r,
  title={Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training},
  author={Nguyen, Ethan and Ghosh, Ipsita and K{\"u}mmerle, Christian},
  journal={arXiv preprint arXiv:2511.04485},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Functions		Functions
UnitTest		UnitTest
models		models
.gitignore		.gitignore
README.md		README.md
main.py		main.py
main_helper.py		main_helper.py
multijob.slurm		multijob.slurm
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training

Overview

Quick Start

Setup and Installation

Experiment Execution

Hyperparameters

Advanced Usage

Fused Modules (QKV Layers)

Distributed Training

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training

Overview

Quick Start

Setup and Installation

Experiment Execution

Hyperparameters

Advanced Usage

Fused Modules (QKV Layers)

Distributed Training

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages