Support Basis: Fast Attention Beyond Bounded Entries

Maryam Aliakbarpour · Vladimir Braverman · Junze Yin · Haochen Zhang
AISTATS 2026 Spotlight

📄 Paper: Support Basis: Fast Attention Beyond Bounded Entries
🔗 Code: https://github.com/yinj66/support_basis

Overview

Softmax attention has quadratic complexity in sequence length ( n ), which limits the scalability of large language models (LLMs). Prior sub-quadratic algorithms rely on the bounded-entry assumption, which rarely holds in practice.

This repository implements Support-Basis Decomposition, a new framework that:

Removes the bounded-entry assumption
Achieves sub-quadratic runtime
Matches the error guarantees of prior work

The repository includes:

🔬 Computational efficiency experiments
📊 Entry distribution visualizations
🧠 Downstream benchmark evaluations
🤖 Modified LLaDA model implementation

Repository Structure

├── Computational efficiency experiment
│   └── supp_basis_optimization.py
├── Downstream task experiment
│   ├── generate.py
│   ├── run_all.sh
│   ├── run_chebyshev_ARC_challenge.py
│   ├── run_chebyshev_ARC_easy.py
│   ├── run_chebyshev_GPQA.py
│   ├── run_chebyshev_hellaswag.py
│   └── run_chebyshev_llmu.py
├── Entry distributions
│   ├── opt_tinyllama_phi.py
│   ├── visualize_llada.py
│   └── visualize_tiny_llama.py
├── LLaDA-8B-Instruct-B&S-light
│   ├── modeling_llada.py
│   ├── configuration_llada.py
│   ├── config.json
│   └── ...
└── README.md

1️⃣ Computational Efficiency Experiment

File:

Computational efficiency experiment/supp_basis_optimization.py

This script implements:

Exact attention
Chebyshev polynomial approximation
Hybrid support-basis attention
Runtime comparison

To run:

cd "Computational efficiency experiment"
python supp_basis_optimization.py

You may modify:

N (sequence length)
d (hidden dimension)
DEGREE (Chebyshev polynomial degree)

2️⃣ Entry Distribution Visualization

We empirically validate that Q and K entries behave sub-Gaussian.

Supported models:

TinyLlama/TinyLlama-1.1B-Chat-v1.0
microsoft/phi-2
GSAI-ML/LLaDA-8B-Base
facebook/opt-1.3b

Scripts:

Entry distributions/
├── opt_tinyllama_phi.py
├── visualize_llada.py
└── visualize_tiny_llama.py

Each script:

Extracts Q/K entries
Plots entry distributions
Draws ±√(log n) thresholds

Example:

cd "Entry distributions"
python visualize_tiny_llama.py

These experiments support the theoretical assumption that large entries are rare.

3️⃣ Downstream Task Experiments

Directory:

Downstream task experiment/

We evaluate hybrid Chebyshev attention on multiple benchmarks.

Datasets

ARC-Challenge
ARC-Easy
GPQA
HellaSwag
MMLU

Each script modifies the LLaDA model configuration:

model.config.hybrid_exact_ratio = args.exact_ratio
model.config.hybrid_chebyshev_degree = args.chebyshev_degree

Example: Run ARC Challenge

python run_chebyshev_ARC_challenge.py \
  --model_name "path/to/LLaDA-8B-Instruct-B&S" \
  --exact_ratio 0.2 \
  --chebyshev_degree 4

Run All Benchmarks

bash run_all.sh

This runs:

Degree 4 and 6
Exact ratio = 0.2
All downstream tasks

Parameters

Parameter	Meaning
`exact_ratio`	Fraction of entries computed exactly
`chebyshev_degree`	Polynomial degree for dense part
`steps`	Reverse diffusion steps
`block_length`	Semi-autoregressive block size

4️⃣ LLaDA-8B Hybrid Model

Directory:

LLaDA-8B-Instruct-B&S-light/

Contains:

Modified attention implementation
Hybrid support-basis logic
Config hooks

Key parameters:

model.config.hybrid_exact_ratio
model.config.hybrid_chebyshev_degree

These control:

Single-threshold support basis
Polynomial degree
Exact computation sparsity

Algorithm Summary

Single-Threshold Support Basis

Split Q, K into:
- Small entries (dense)
- Large entries (sparse)
Compute:
- Exact attention on sparse support
- Polynomial approximation on dense support
Combine via:

$$ \exp(A^{(s)}) + \exp(A^{(L)}) - \mathbf{1}_{n \times n} $$

Dependencies

Install:

pip install torch transformers datasets scipy matplotlib tqdm

For GPU experiments:

CUDA 11+
PyTorch ≥ 2.0 recommended

Reproducing Paper Results

Runtime Experiments

Computational efficiency experiment/

Entry Distributions

Entry distributions/

Downstream Accuracy

Downstream task experiment/

Citation

If you use this work, please cite:

@inproceedings{aliakbarpour2026support,
title={Support Basis: Fast Attention Beyond Bounded Entries},
author={Maryam Aliakbarpour and Vladimir Braverman and Junze Yin and Haochen Zhang},
booktitle={The 29th International Conference on Artificial Intelligence and Statistics},
year={2026},
url={https://openreview.net/forum?id=IgpnZIGFsD}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Support Basis: Fast Attention Beyond Bounded Entries

Overview

Repository Structure

1️⃣ Computational Efficiency Experiment

2️⃣ Entry Distribution Visualization

3️⃣ Downstream Task Experiments

Datasets

Example: Run ARC Challenge

Run All Benchmarks

Parameters

4️⃣ LLaDA-8B Hybrid Model

Algorithm Summary

Single-Threshold Support Basis

Dependencies

Reproducing Paper Results

Runtime Experiments

Entry Distributions

Downstream Accuracy

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Computational efficiency experiment		Computational efficiency experiment
Downstream task experiment		Downstream task experiment
Entry distributions		Entry distributions
LLaDA-8B-Instruct-B&S-light		LLaDA-8B-Instruct-B&S-light
.DS_Store		.DS_Store
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Support Basis: Fast Attention Beyond Bounded Entries

Overview

Repository Structure

1️⃣ Computational Efficiency Experiment

2️⃣ Entry Distribution Visualization

3️⃣ Downstream Task Experiments

Datasets

Example: Run ARC Challenge

Run All Benchmarks

Parameters

4️⃣ LLaDA-8B Hybrid Model

Algorithm Summary

Single-Threshold Support Basis

Dependencies

Reproducing Paper Results

Runtime Experiments

Entry Distributions

Downstream Accuracy

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages