Skip to content
puma-community-bot edited this page May 24, 2026 · 2 revisions

PUMA Community Wiki

PUMA Community is a public hub where users of the PUMA benchmarking tool share their local LLM evaluation results. Every submission is cryptographically integrity-checked, schema-validated, and PII-scanned before publication. The submission flow lives entirely on GitHub — there is no central server, no account beyond your GitHub account, and no fee.

Why this exists

Local LLM benchmarks depend heavily on hardware: a 7-billion-parameter model behaves differently on a 4 GB GPU than on a 24 GB GPU than on CPU. No commercial benchmark captures that diversity. By pooling verified results across machines, PUMA Community builds a community-owned record of how local models actually perform across the configurations practitioners run.

Explore the Wiki

  • Why PUMA Community? — the rationale and the value proposition for sharing your results.
  • Running a Benchmark — a short overview of running PUMA locally; the full reference lives in the PUMA Wiki.
  • Submitting Results — step-by-step submission flow, including PAT creation, dry-run, and publish.
  • Submission Format — what's inside a submission JSON, what every field means, and accepted values.
  • Anonymity and Privacy — what is and isn't published, how the PII scanner works, how to withdraw a submission.
  • Validation Process — what happens after you open a submission PR.
  • Mirror Locations — status of Hugging Face, Zenodo, and Kaggle mirrors.
  • FAQ — common questions about validation, hardware, models, and integrity hashes.
  • For Maintainers — operator-facing guide for those with write access to this repository.

Quick links

  • PUMA Community repo — the canonical submission archive and the schema.
  • PUMA tool repo — the benchmarking framework that produces the submissions.
  • Latest badges — submission count, model count, scenario count, latest submission.

Clone this wiki locally