-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix epilogue::thread::Convert cannot be used with DefaultEpilogue
inactive-30d
#2333
opened May 26, 2025 by
solrex
Loading…
Add SM80/89 blockwise scaling kernel, support FP8 block/groupwise on Ada, INT8 on Ampere
inactive-30d
#2328
opened May 24, 2025 by
solrex
Loading…
Ensure compatibility with "-Wimplicit-fallthrough" when compiling with Clang
inactive-30d
#2324
opened May 22, 2025 by
wenxin0319
Loading…
More generic interface for Group GEMM problem size
inactive-30d
#2318
opened May 20, 2025 by
nandor
Loading…
Support N={48, 80, 96, 112, ...} for SM100 EpilogueTileAuto
inactive-30d
#2269
opened Apr 29, 2025 by
Algy
Loading…
Limit the number of SMs (sm_count) to user-provided value during profiling.
inactive-30d
#2257
opened Apr 22, 2025 by
manishucsd
Loading…
Adding Blackwell support for distributed GEMM.
inactive-30d
#2179
opened Mar 16, 2025 by
whatdhack
Loading…
Fix CUTE_DEVICE for cast_smem_ptr_to_unit
inactive-30d
#2171
opened Mar 13, 2025 by
monellz
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.