-
Notifications
You must be signed in to change notification settings - Fork 64
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Refine ck instance and update a8w8_bpreshuffle_tuned_gemm.csv
#621
opened Jul 7, 2025 by
solinzby1
Loading…
[TRITON]: Fix num_warps typo which was causing performance issues
#604
opened Jul 2, 2025 by
valechen
Loading…
[TRITON] Refactor Triton RMSNorm and LayerNorm unit tests
triton
#598
opened Jul 1, 2025 by
lucas-santos-amd
Loading…
[TRITON]: Standardize GEMM weight shape to (N, K) and TN memory layout (by default)
#597
opened Jul 1, 2025 by
willzhou-amd
Loading…
3 tasks done
[TRITON]: Kernel benchmarking improvements (for op_benchmarks/triton)
#594
opened Jun 30, 2025 by
willzhou-amd
Loading…
2 of 5 tasks
add num_kv_splits_indptr to mla for mtp<=4 case for now
#584
opened Jun 26, 2025 by
valarLip
Loading…
[TRITON] Add LayerNorm Backward Triton Kernels
triton
#546
opened Jun 16, 2025 by
lucas-santos-amd
Loading…
[TRITON][GFX950] Proton Capability & Metadata for GEMM fp4
#494
opened May 29, 2025 by
jtang10
Loading…
Add
act_first
flag in activation op to support flexible input ordering
#478
opened May 26, 2025 by
Conless
Loading…
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-06-07.