What's Changed
- chore: fetch transformers version from lightning-thunder requirements, for gpu test by @bhimrazy in #2184
- build(deps): update huggingface-hub requirement from >=0.30,<0.35 to >=0.30,<1.3 by @dependabot[bot] in #2139
- build(deps-dev): bump litdata from 0.2.51 to 0.2.59 by @dependabot[bot] in #2176
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci[bot] in #2180
- Doc: add uv options advanced installation fix by @sanggusti in #2185
- Fix rope_cache_length computation for models with rope_local_base_freq by @adi776borate in #2179
- feat: add generate_strategy option to litgpt serve by @adi776borate in #2188
- fix: gemma-3 checkpoint conversion from litgpt to hf by @adi776borate in #2195
- Allow train.max_steps for profiling runs by @MukeshK17 in #2193
- build(deps-dev): update bitsandbytes requirement from <0.43,>=0.42 to >=0.42,<0.50 by @dependabot[bot] in #2199
- build(deps): update huggingface-hub requirement from <1.3,>=0.30 to >=0.30,<1.4 by @dependabot[bot] in #2200
- feat(logger) : Add LitLogger Support for Lightning.ai Experiment Tracking by @bhimrazy in #2198
- fix: correct LoRA index computation uses wrong feature size by @Mr-Neutr0n in #2201
- fix: default val_split_fraction to 0.05 for single-file JSON data by @dario-fumarola in #2204
- fix: flaky
test_litgpt_chat_endtoendby @bhimrazy in #2209 - chore(workflows): Sync GPU CI workflow image with lightning-thunder repo by @bhimrazy in #2208
- Add YaRN rope adjustment + DeepSeekV3 rope_interleave by @ysjprojects in #2202
- fix: handle empty prompt list in R1Base.apply() by @LincolnBurrows2017 in #2216
- Added correct rope_base to all Mistral configs by @adi776borate in #2218
- chore(ruff): bump target-version to py310 and update type annotations by @bhimrazy in #2221
- fix: raise error when prefill exceeds sliding window KV cache size by @alvinttang in #2215
- LoRAQKVLinear.lora_ind does not cache index by @mseeger in #2225
- feat: Introduce a pre-flight validation script for LitGPT checkpoints… by @OnePunchMonk in #2214
- build(deps): bump psutil from 7.1.3 to 7.2.2 by @dependabot[bot] in #2226
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci[bot] in #2229
- fix(finetune): run generate_example only on rank 0 in multi-GPU training by @Br1an67 in #2213
- fix(tests): Fix 3 test failures introduced by the validate PR by @bhimrazy in #2233
- fix: make sample() produce greedy decoding when temperature=0 by @CrepuscularIRIS in #2239
- fix(tests): xfail DynamoThunder tests with torch>=2.12 by @bhimrazy in #2254
- security: remove
pull_request_targettrigger from CPU tests workflow by @bhimrazy in #2253 - chore: pin GitHub Actions to verified commit SHAs by @bhimrazy in #2256
- ci: migrate workflows from pip to uv by @bhimrazy in #2257
- fix(ci): unblock check-links and stabilize GPU serve tests by @bhimrazy in #2270
- fix(tests): skip tokenizer tests for only gated model in fork prs by @bhimrazy in #2268
- Omit ChatML system turn when there is no system message (Qwen3 emitted literal 'None') by @lollinng in #2258
- fix(tests): use Gutenberg mirror to fix flaky book downloads by @bhimrazy in #2272
- docs: add Code of Conduct and rename LICENSE filename by @bhimrazy in #2273
- ci: migrate PyPI release to trusted publishing (OIDC) by @bhimrazy in #2261
- docs: add CONTRIBUTING.md by @bhimrazy in #2276
- docs: update CODEOWNERS with patterns and retired committers by @bhimrazy in #2275
- fix(docs): update broken links in README by @bhimrazy in #2277
- chore: bump version to 0.5.13 by @bhimrazy in #2279
New Contributors
- @sanggusti made their first contribution in #2185
- @adi776borate made their first contribution in #2179
- @MukeshK17 made their first contribution in #2193
- @Mr-Neutr0n made their first contribution in #2201
- @dario-fumarola made their first contribution in #2204
- @LincolnBurrows2017 made their first contribution in #2216
- @alvinttang made their first contribution in #2215
- @OnePunchMonk made their first contribution in #2214
- @Br1an67 made their first contribution in #2213
- @CrepuscularIRIS made their first contribution in #2239
- @lollinng made their first contribution in #2258
Full Changelog: v0.5.12...v0.5.13