GitHub - rictrlab/fitgpu: cli tool to know if a model will run on your GPU without downloading it.

Check if a HuggingFace model will run on your GPU.

fitgpu takes a HuggingFace model ID and tells you whether the model's weights will fit in your GPU's available VRAM.

How?

Gets model's file metadata from HuggingFace (weights are not downloaded)
Sums up the sizes of all weight files (.safetensors / .bin)
Queries your GPU's free VRAM using the NVIDIA driver
Compares and shows the result

Installation

pip install fitgpu

Use

fitgpu <model_id> [--token TOKEN]

model_id — HuggingFace model ID (e.g. google/gemma-2-2b)
--token TOKEN — optional, HuggingFace API token for gated/private models

Public models

fitgpu google/gemma-2-2b

Gated models

fitgpu meta-llama/Llama-2-7b-hf --token hf_YOUR_TOKEN

Example

$ fitgpu google/gemma-2-2b
model : google/gemma-2-2b
size  : 4.89 GB (weights on disk)

GPU 0: NVIDIA RTX 4090
  VRAM : 24.00 GB total, 22.31 GB free
  result: fits

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE.md		LICENSE.md
fitgpu.py		fitgpu.py
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How?

Installation

Use

Public models

Gated models

Example

About

Uh oh!

Releases

Packages

Languages

License

rictrlab/fitgpu

Folders and files

Latest commit

History

Repository files navigation

How?

Installation

Use

Public models

Gated models

Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages