⚡ cuLA - Fast Linear Attention on CUDA

🧭 What cuLA does

cuLA is a GPU tool for linear attention variants. It uses CUDA, CuTe DSL, and CUTLASS C++ to run attention work on NVIDIA graphics cards.

If you use it as an end user, you can download the project, open the files, and run the build steps on a Windows PC with a supported NVIDIA GPU.

🖥️ What you need

To run cuLA on Windows, you need:

Windows 10 or Windows 11
An NVIDIA GPU
A recent NVIDIA driver
CUDA Toolkit installed
Enough free disk space for source files and build files
Python 3.10 or newer for helper tools, if used by the project
CMake for build setup
Visual Studio Build Tools or Visual Studio with C++ support

A modern RTX card gives the best results. Older NVIDIA cards may still work if they support the CUDA version used by the project.

📥 Download cuLA

Use this link to visit the project page and download cuLA:

Open the cuLA GitHub page

If the page shows a release, download the Windows files from there. If it shows source code only, use the source archive or the green Code button on GitHub to get the files onto your computer

🚀 Install on Windows

Open the cuLA GitHub page
Download the project files to a folder you can find again, such as Downloads or Desktop
If you downloaded a ZIP file, right-click it and choose Extract All
Open the extracted folder
Install the CUDA Toolkit if it is not already on your PC
Install Visual Studio Build Tools with the C++ desktop workload
Install CMake if it is not already installed
Open a Command Prompt or PowerShell window in the cuLA folder
Run the build command shown in the project files
Wait for the build to finish
Open the generated program or test file, if the repository includes one

🛠️ Setup steps

If you are starting from a clean Windows system, follow this order:

1. Install NVIDIA driver

Download and install the latest driver for your GPU from NVIDIA. Restart your PC when the installer asks.

2. Install CUDA Toolkit

Use a CUDA version that matches the project setup. The toolkit adds the compiler and GPU libraries needed by cuLA.

3. Install CMake

CMake helps Windows create the build files. During setup, let it add itself to the system path if that option appears.

4. Install C++ build tools

Install Visual Studio Build Tools or Visual Studio. Make sure the C++ workload is selected.

5. Get the cuLA files

Download the project from GitHub and extract it to a simple folder path, such as:

C:\cuLA
C:\Users\YourName\Downloads\cuLA

Avoid folders with long paths or special characters. That keeps build steps simple.

▶️ How to run

After setup, open a terminal in the cuLA folder and follow the build steps in the project files.

A common Windows flow looks like this:

Open Start
Type cmd
Open Command Prompt
Move into the cuLA folder with the cd command
Run the build command from the README or project notes
Wait for the build to complete
Run the output file that the build creates

If the project includes sample input files or test scripts, use those first. They help you confirm that your GPU setup works before you use your own data.

🔧 Common build path on Windows

Many CUDA projects on Windows follow a path like this:

Clone or download the source
Configure with CMake
Build with Visual Studio tools
Run the compiled binary or test target

If cuLA includes preset build files, use them exactly as written. If it includes a Visual Studio solution, open the .sln file and build from there.

📁 Folder layout

You may see folders and files like these:

src — source files
include — header files
tests — test cases
examples — sample runs
CMakeLists.txt — build setup file
README.md — main instructions
LICENSE — usage terms

This layout helps you find the parts you need without opening every file.

🧪 Check that it works

After you build cuLA, check for these signs:

The build finishes without errors
The output file appears in the build folder
Your NVIDIA GPU is listed during the run
Sample tests complete

If the project prints timing data or kernel info, that means the GPU code is running.

🧩 If something goes wrong

If the build does not work, check these items:

CUDA Toolkit version
NVIDIA driver version
CMake install path
Visual Studio C++ tools
Folder path length
GPU support for your CUDA version

If Windows says a command is not found, restart the terminal after you install tools. That lets Windows pick up the new paths.

If the build fails near CUDA files, reinstall the toolkit and make sure you used the same version the project expects.

📌 What this project is for

cuLA is useful if you want to:

test linear attention kernels on NVIDIA GPUs
compare GPU kernel speed
work with CUDA source code
study CuTe DSL and CUTLASS code
build attention layers for research or experiments

It is a source-based project, so you should expect to compile it on your own machine.

🔗 Source and updates

Use the GitHub page for the latest files, issues, and updates:

https://github.com/Thibault7410/cuLA

🧱 Build notes

If you are new to Windows builds, keep these habits:

Use a short folder path
Keep your CUDA version up to date
Restart after installs
Read the project README before you build
Start with a sample or test target if one exists

These steps reduce setup problems and make it easier to get a clean build

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
benchmarks		benchmarks
csrc		csrc
cula		cula
docs		docs
scripts		scripts
tests		tests
third_party		third_party
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
BENCHMARK_GB200.md		BENCHMARK_GB200.md
BENCHMARK_GB200_CUDA_130.md		BENCHMARK_GB200_CUDA_130.md
BENCHMARK_GB300.md		BENCHMARK_GB300.md
BENCHMARK_H200.md		BENCHMARK_H200.md
BENCHMARK_KDA_DECODE_GB200.md		BENCHMARK_KDA_DECODE_GB200.md
BENCHMARK_KDA_DECODE_H203E.md		BENCHMARK_KDA_DECODE_H203E.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
RECOMMENDED_CODING_STYLE.md		RECOMMENDED_CODING_STYLE.md
REPO_LAYOUT.md		REPO_LAYOUT.md
USAGE.md		USAGE.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚡ cuLA - Fast Linear Attention on CUDA

🧭 What cuLA does

🖥️ What you need

📥 Download cuLA

🚀 Install on Windows

🛠️ Setup steps

1. Install NVIDIA driver

2. Install CUDA Toolkit

3. Install CMake

4. Install C++ build tools

5. Get the cuLA files

▶️ How to run

🔧 Common build path on Windows

📁 Folder layout

🧪 Check that it works

🧩 If something goes wrong

📌 What this project is for

🔗 Source and updates

🧱 Build notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

⚡ cuLA - Fast Linear Attention on CUDA

🧭 What cuLA does

🖥️ What you need

📥 Download cuLA

🚀 Install on Windows

🛠️ Setup steps

1. Install NVIDIA driver

2. Install CUDA Toolkit

3. Install CMake

4. Install C++ build tools

5. Get the cuLA files

▶️ How to run

🔧 Common build path on Windows

📁 Folder layout

🧪 Check that it works

🧩 If something goes wrong

📌 What this project is for

🔗 Source and updates

🧱 Build notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages