v26.01
What's Changed
Features & Enhancements
- HSTU KV Cache Manager V2 by @geoffreyQiu in #251
- workload balancer and datasets folder refactor by @JacoCheung in #275
- Fea unify pooling to dynamic embedding table by @shijieliu in #301
- Optimize EmbeddingBagCollection preliminarily by @jiashuy in #268
- refactor unique to stateless op by @shijieliu in #290
- Optimize dedup indices and segmented unique by @shijieliu in #293
- optimize backward local_reduce, use fwd unique results by @shijieliu in #299
Bug Fixes
- Fix devel build failure by @JacoCheung in #288
- Fix wrong evicted values when insert failed/busy in insert_and_evict. by @jiashuy in #284
- Fix test_jagged_tensor import bug by @JacoCheung in #289
- Fix KVCounter initialization bug. by @z52527 in #282
- Fix mcore version in training readme by @JacoCheung in #286
- Fix issue related to empty batch by @jiashuy in #271
- Fix issue #272: dump/load score consistency in STEP mode by @ShaobinChen-AH in #298
Misc
- Pull new triton hstu kernel by @JacoCheung in #291
- Add README for embedding pooling. by @z52527 in #270
- rename wrong module name by @gameofdimension in #278
- [CI] split dynamicemb tests by @shijieliu in #273
- Update triton version by @JacoCheung in #287
- release v26.01 by @shijieliu in #309
New Contributors
- @gameofdimension made their first contribution in #278
- @ShaobinChen-AH made their first contribution in #298
Full Changelog: v25.12...v26.01