Skip to content

Refactor and release Transformers patching and auto-kernelize loop & other updates#123

Merged
hannahli-nv merged 10 commits into
mainfrom
tilegym_update
May 8, 2026
Merged

Refactor and release Transformers patching and auto-kernelize loop & other updates#123
hannahli-nv merged 10 commits into
mainfrom
tilegym_update

Conversation

@hannahli-nv
Copy link
Copy Markdown
Collaborator

@hannahli-nv hannahli-nv commented May 8, 2026

Description

Update codes.

This PR contains 10 new commit(s).

Commits included:

081005d Add cutile-autotuning skill
7192508 [skill] update improve-cutile-perf skill
a2c8a8d [skill] refine monkey-patch-kernels-to-transformers based on feedback
87d29d5 skills: Drop HTML-comment headers from non-SKILL.md markdown files
970478e Batch cuTile cat copies
2d4cf0f Fix per-call replace_hints JIT-cache invalidation in decode loops
d58a3a5 [benchmark] add DUMP_CUPTI_EVENTS dump in benchmark_fn_cupti
fba356e bump version to 1.3.0
3fa4469 test: remove xfail marker for test_rope_quantize_fp8.py
7ba2266 Refactor and release Transformers patching and auto-kernelize loop

CI Configuration

config:
  build: true
  # valid options are "ops" and "benchmark"
  test: ["ops", "benchmark"]

Checklist

  • Code formatted and imports sorted via repo specifications (./format.sh)
  • Documentation updated (if needed)
  • CI configuration reviewed

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 8, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@hannahli-nv hannahli-nv changed the title Refactor and release Transformers patching and auto-kernelize loop & test: remove xfail markers fixed by & other updates Refactor and release Transformers patching and auto-kernelize loop & other updates May 8, 2026
@hannahli-nv
Copy link
Copy Markdown
Collaborator Author

/ok to test a2c8a8d

Signed-off-by: Yiwen Zhang <yiwenz@nvidia.com>
@hannahli-nv
Copy link
Copy Markdown
Collaborator Author

/ok to test 081005d

@hannahli-nv hannahli-nv merged commit 9b9574f into main May 8, 2026
31 checks passed
@hannahli-nv hannahli-nv deleted the tilegym_update branch May 8, 2026 07:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants