feat: migrate cuda.tile_experimental.autotune_launch → cuda.tile.tune.exhaustive_search & other updates by hannahli-nv · Pull Request #114 · NVIDIA/TileGym

hannahli-nv · 2026-04-23T09:23:38Z

Description

Update codes.

This PR contains 5 new commit(s).

Commits included:

e899fc7 Fix attention calling error
968f54c Fix CUPTI flag
333ff32 Use cutile new autotuner for remaining kernels
12002f9 feat(flashinfer): Add flashinfer kernel, support flashinfer
a808f48 feat: migrate cuda.tile_experimental.autotune_launch → cuda.tile.tune.exhaustive_search

CI Configuration

config:
  build: true
  # valid options are "ops" and "benchmark"
  test: []

Checklist

Code formatted and imports sorted via repo specifications (./format.sh)
Documentation updated (if needed)
CI configuration reviewed

copy-pr-bot · 2026-04-23T09:23:42Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

hannahli-nv · 2026-04-23T09:27:53Z

/ok to test e899fc7

hannahli-nv · 2026-04-24T01:21:16Z

/ok to test 4fdd159

hannahli-nv · 2026-04-24T05:58:36Z

/ok to test 86d96a3

hannahli-nv · 2026-04-24T08:04:40Z

/ok to test e971116

hannahli-nv · 2026-04-24T14:45:01Z

/ok to test 4765bdc

hannahli-nv · 2026-04-24T14:55:33Z

/ok to test 2e9e9ad

….exhaustive_search

… budgets The migration to cuda.tile.tune.exhaustive_search exhaustively searches the entire config space and has no built-in per-config compile timeout, so slow-to-compile configs on sm120 can stall CI. Scope the compile timeout to autotune only, and raise CI step/job budgets to absorb the longer adaptive-repeat measurement loop in the new tune API. - Wrap every cuda.tile.tune.exhaustive_search call site (13 across 10 op files) with `with ct.compiler_timeout(5):` so individual slow configs are killed and routed to result.failures while non-autotune ct.launch compiles remain unaffected. - Bump the test-benchmark job timeout 40 -> 70 min and the "Pull and run benchmarks" step timeout 35 -> 60 min. - Bump the per-benchmark subprocess timeout in run_all_json.py from 10 min -> 20 min.

hannahli-nv · 2026-04-25T01:36:07Z

/ok to test 161ef03

hannahli-nv requested a review from xjmxyt April 23, 2026 09:59

xjmxyt approved these changes Apr 23, 2026

View reviewed changes

hannahli-nv force-pushed the tilegym_update branch from 2e9e9ad to 1a424f0 Compare April 25, 2026 01:19

xjmxyt and others added 6 commits April 25, 2026 09:34

feat: migrate cuda.tile_experimental.autotune_launch → cuda.tile.tune…

6329cb9

….exhaustive_search

feat(flashinfer): Add flashinfer kernel, support flashinfer

8b2392d

Use cutile new autotuner for remaining kernels

dbe9b79

Fix CUPTI flag

423d79b

Fix attention calling error

2cb19a0

hannahli-nv force-pushed the tilegym_update branch from 1a424f0 to 161ef03 Compare April 25, 2026 01:34

hannahli-nv merged commit 6311a1e into main Apr 25, 2026
13 checks passed

hannahli-nv deleted the tilegym_update branch April 25, 2026 01:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: migrate cuda.tile_experimental.autotune_launch → cuda.tile.tune.exhaustive_search & other updates#114

feat: migrate cuda.tile_experimental.autotune_launch → cuda.tile.tune.exhaustive_search & other updates#114
hannahli-nv merged 6 commits into
mainfrom
tilegym_update

hannahli-nv commented Apr 23, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Apr 23, 2026

Uh oh!

hannahli-nv commented Apr 23, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

hannahli-nv commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Commits included:

CI Configuration

Checklist

Uh oh!

copy-pr-bot Bot commented Apr 23, 2026

Uh oh!

hannahli-nv commented Apr 23, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 24, 2026

Uh oh!

hannahli-nv commented Apr 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hannahli-nv commented Apr 23, 2026 •

edited

Loading