Right now gpucomm-bot is simulation-level: it receives GitHub webhooks and logs events.
To make it real, we should pick an implementation track:
(A) Secure webhook + GitHub App auth
- Verify
X-Hub-Signature-256 using a WEBHOOK_SECRET (reject unsigned/invalid payloads).
- Add GitHub App JWT + installation token flow.
- Start taking actions: comment/label PRs, set a “GPU required” check, etc.
(B) Real GPU CI first
- Add a self-hosted runner with an NVIDIA GPU.
- Update
gpu.yml to run actual CUDA/PyTorch/nvcc build + tests/benchmarks (instead of simulated steps).
Question: which track do we want next, (A) or (B)?
Suggested decision criteria:
- If we need the bot to interact with GitHub (comments/labels/checks), pick (A).
- If we need real GPU validation ASAP, pick (B).
Checklist after decision:
Right now
gpucomm-botis simulation-level: it receives GitHub webhooks and logs events.To make it real, we should pick an implementation track:
(A) Secure webhook + GitHub App auth
X-Hub-Signature-256using aWEBHOOK_SECRET(reject unsigned/invalid payloads).(B) Real GPU CI first
gpu.ymlto run actual CUDA/PyTorch/nvcc build + tests/benchmarks (instead of simulated steps).Question: which track do we want next, (A) or (B)?
Suggested decision criteria:
Checklist after decision:
Update (2026-03-22): pushed Track B enablement changes.
Code changes (now on
main):gpu-requiredlabel OR title contains gpu/cuda (avoids endless queue if no GPU runner online):.github/workflows/gpu.ymlGPUCOMM_ENFORCE_CUDA_VERSION=true, usingconfig/gpu.json.allowed_cuda_versions:scripts/gpu-ci.sh*private-key*.pem) and removed a stray key file locally:.gitignoreNext actions (non-code):
self-hosted, linux, x64, gpuwith NVIDIA driver + CUDA toolkit (nvcc) available.PYTORCH_INDEX_URL(e.g.