What's Changed
- input_pos_maxp1 as a Python integer by @Andrei-Aksionov in #2016
- add testing for py3.12 & py3.13 by @Borda in #2025
- ci: update guardian for PRs by @Borda in #2043
- Add devcontainer by @twsl in #2035
- Remove dependency from config to utils by @lukemerrick in #2034
- fix: Add fallback chat template by @andyland in #2040
- Add optional sys prompt by @twsl in #2036
- fix: Pretraining text files with recent litdata versions by @andyland in #2048
- tests: mark
test_evaluate_script
as flaky by @Borda in #2049 - simplify the GPU testing flow by @Borda in #2053
- ci: extend testing with
ubuntu-24.04
by @Borda in #2056 - Remove litserve version constraint by @twsl in #2055
- req: pin
bitsandbytes>=0.45.2,<0.45.5
by @Borda in #2057 - ci: use Thunder dev images for testing by @Borda in #2054
- Update spacing in README.md by @Borda in #2058
- Transformers version bump by @KaelanDt in #2029
- Qwen3 Dense by @ysjprojects in #2044
- phi-4 reasoning models by @ysjprojects in #2047
- adding logger args by @ysjprojects in #1973
- Qwen3 MoE Preliminary: add intermediate_size argument to MLP modules by @ysjprojects in #2046
- OLMo 2 by @ysjprojects in #1897
- bump: testing with PT 2.7.1 by @Borda in #2063
- Add Dependabot for Pip & GitHub Actions by @Borda in #2066
- build(deps): bump litdata from 0.2.45 to 0.2.49 by @dependabot in #2068
New Contributors
- @twsl made their first contribution in #2035
- @lukemerrick made their first contribution in #2034
- @prabhuteja12 made their first contribution in #2027
- @andyland made their first contribution in #2040
- @KaelanDt made their first contribution in #2029
Full Changelog: v0.5.8...v0.5.9