Replies: 1 comment
-
|
@li-yifei — never a bother, you are very welcome here. 5090 owners absolutely belong in this club. I want to address the name directly, because you're not the first to feel this way (see #27 and others in the last 24 hours): "club-3090" is a tribute, not a hardware filterThe 3090 holds a unique place in 2026: it's the last GPU class that's actually still affordable for someone getting started with AI (~$700-900 used in most markets vs $2-3K for a 4090, $3-4K for a 5090, $30K+ for an H100). The name honors that. It's the card you can buy to learn this stack on without taking a loan. But this repo is about squeezing the absolute maximum out of whatever GPU you happen to have. The methodology — TQ3 KV / MTP K=3 / Genesis patches / Cliff 1+2 closures / verify-stress as ground truth — applies on every CUDA card from SM 8.0 (A100) through SM 12.0 (5090) and Blackwell datacenter. We just ship default configs tuned for what most people own. What's interesting about 5090 specifically5090 is Blackwell consumer SM 12.0 — a regime where most of our shipped patches will need re-tuning, and several blockers we work around on Ampere simply disappear:
What we'd love from any 5090 ownerThe fastest path to a useful data point is ~15 min: git clone https://github.com/noonghunna/club-3090
cd club-3090
bash scripts/setup.sh qwen3.6-27b
# Try the dual-turbo or default compose:
docker compose -f models/qwen3.6-27b/vllm/compose/docker-compose.yml up -d
# Then:
bash scripts/verify-stress.shPost the boot log + verify-stress output as a discussion. We'll turn it into:
If a The diversity push is in #27 if you want context — same applies here. Welcome aboard. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Sry to bother
Beta Was this translation helpful? Give feedback.
All reactions