v25.08
What's Changed
Features & Enhancements
- Refactor dyanmicemb with Cache&Storage. by @jiashuy in #128
- Support Kuairand dataset inference with alignment to training by @geoffreyQiu in #122
- Support eval mode for dynamicemb and move insert in backward to forward for use_index_dedup=True by @shijieliu in #136
- export hstu arbitrary mask by @shijieliu in #148
- Optimize TP HSTU layer by @JacoCheung in #132
Bug fixs
- Fix invalid pip option: replace --no-cache with --no-cache-dir by @mia1460 in #126
- Remove HostAlloc in dataloader by @JacoCheung in #129
- Fix filtering of samples with insufficient history by @mia1460 in #134
- fix pipeline test by @shijieliu in #135
- Hkv timeline clean by @jiashuy in #137
- Fix calc flops by @shijieliu in #139
- fix(dataset): add per-user reorder by time and pre-sort to guarantee … by @mia1460 in #141
- fix preprocessor not working on absolute data path by @shijieliu in #146
- fix codespell cheking by @shijieliu in #149
- fix collective utset by @shijieliu in #151
- Fix the shape hint for offsets by @yiwenchen2025 in #153
Misc
- Update dynamicemb benchmark by @jiashuy in #138
- Update the benchmarks and results. by @geoffreyQiu in #144
- update benchmark doc by @shijieliu in #150
- update benchmark result of dynamicemb to figure by @jiashuy in #154
New Contributors
- @mia1460 made their first contribution in #126
- @yiwenchen2025 made their first contribution in #153
Full Changelog: v25.07...v25.08