docs: add scoring system, rules, and contribution track#45

Merged

msaroufim merged 23 commits intogpu-mode:mainfrom

yf225:docs/scoring-and-rules

Mar 14, 2026

Contributor

yf225 commented Mar 13, 2026 •

edited

Loading

Summary

Add scoring section with point allocation table and ranking formula (top 10 per kernel earn performance points)
Add rules & requirements (correctness, submission format)
Add open-ended contribution track (autotuner improvements, bug fixes, tooling, docs)

Test plan

Review scoring formula matches hackathon spec
Verify point allocations are correct
Check rules match latest requirements

🤖 Generated with Claude Code

yf225 and others added 9 commits

March 13, 2026 15:42


          docs: add TileIR backend usage guide to helion-hackathon.md

a4ac5c4

Documents ENABLE_TILE=0 vs ENABLE_TILE=1 and the TileIR compilation
pipeline available via nvtriton on B200 instances. Covers how to enable
TileIR with Helion (ENABLE_TILE=1 + HELION_BACKEND=tileir), the
different tunables (num_ctas/occupancy vs num_warps/maxnreg), and how
to hardcode TileIR configs in submissions.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: restructure ACF + TileIR as optional performance knobs

8bbd20d

Group both sections under a single "Optional: Extra Performance Knobs"
heading to emphasize neither is required. Streamline both into
step 1 (autotune) / step 2 (hardcode) format. Add a "Which combination"
section showing all 4 options to try.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: remove "(Booster Pack)" from ACF heading

4882ec0

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: consolidate TileIR env var instructions

f10d235

Remove duplicate bash export block — the Python os.environ in the
code example is sufficient for both local autotuning and submissions.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: clarify TileIR tunables come from autotuner output

e4cb531

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: shorten "Which should I use?" section

115daeb

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: be explicit about ENABLE_TILE=0 vs ENABLE_TILE=1

2b1bef6

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: simplify TileIR comparison table to just backend names

1a127c1

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: add scoring system, rules, and open-ended contribution track

10c9ba6

Add point allocation table, scoring formula (correctness + performance
ranking), rules & requirements, and the separate open-ended contribution
track for non-kernel Helion contributions.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

codecov bot commented Mar 13, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

yf225 and others added 10 commits

March 13, 2026 16:21


          docs: allow unlimited submissions, best one counts

19c69ac

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: clarify rules to match actual submission format

4fac3a8

Each submission uses one static helion.Config for all shapes, not
per-shape configs. Simplified rules to reflect this.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          Revert "docs: clarify rules to match actual submission format"

eaeebed

This reverts commit 4fac3a8.


          Add per-shape config dispatch pattern to all submissions

7cb3b40

Use a factory function (_make_kernel) to create kernel variants with
different helion.Config objects, and dispatch in custom_kernel() based
on input tensor shapes. This lets participants optimize each benchmark
shape independently.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: update example to show all shapes, remove DEFAULT_CONFIG

36aaba5

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: use Config(...) placeholders with distinct TODO comments for te…

b1aacca

…st vs benchmark shapes

Test shapes: TODO to replace with default config or any config that passes correctness.
Benchmark shapes: TODO to replace with autotuned config.
Also add instructions on getting default config via autotune_effort="none".

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: remove references to single-config-for-all-shapes pattern

30159b1

Per-shape configs are the recommended approach. Remove mentions of
using a single config across all shapes.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: remove references to default config in rules section

f143d61

Configs are always participant-provided.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: add tips for version control, tmux, and machine reboots

92fb7f7

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          docs: move GPU machine tips to standalone section

c608d4b

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

yf225 closed this

yf225 reopened this

yf225 and others added 4 commits

March 13, 2026 18:19


          docs: fix performance metric description to match actual eval method

5f814e2

The previous description incorrectly stated geometric mean of 100 runs.
The actual helion eval uses CUDA graphs with L2 cache clearing, 10
measurements, and arithmetic mean.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          Replace hard 30% LOC limit with judges' discretion for inline triton/asm

4fa0303

The LOC-based rule was gameable (denominator inflation with padding code),
so switch to a qualitative rule: inline triton/asm is allowed as escape
hatches, but predominantly inline submissions may be disqualified.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          Add spawn mode tip for autotuning in GPU machine section

6a2a409

Spawn mode isolates each autotuner trial in a subprocess with timeout
protection, preventing hangs or crashes from killing the entire run.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>


          Clarify that spawn mode is slower than fork mode

3ba5b4a

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

msaroufim approved these changes

View reviewed changes

msaroufim merged commit 506fbd4 into gpu-mode:main

7 of 8 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet