Skip to content

v0.5.13

Latest

Choose a tag to compare

@lianakoleva lianakoleva released this 29 Jun 15:53
2685705

What's Changed

  • chore: fetch transformers version from lightning-thunder requirements, for gpu test by @bhimrazy in #2184
  • build(deps): update huggingface-hub requirement from >=0.30,<0.35 to >=0.30,<1.3 by @dependabot[bot] in #2139
  • build(deps-dev): bump litdata from 0.2.51 to 0.2.59 by @dependabot[bot] in #2176
  • [pre-commit.ci] pre-commit suggestions by @pre-commit-ci[bot] in #2180
  • Doc: add uv options advanced installation fix by @sanggusti in #2185
  • Fix rope_cache_length computation for models with rope_local_base_freq by @adi776borate in #2179
  • feat: add generate_strategy option to litgpt serve by @adi776borate in #2188
  • fix: gemma-3 checkpoint conversion from litgpt to hf by @adi776borate in #2195
  • Allow train.max_steps for profiling runs by @MukeshK17 in #2193
  • build(deps-dev): update bitsandbytes requirement from <0.43,>=0.42 to >=0.42,<0.50 by @dependabot[bot] in #2199
  • build(deps): update huggingface-hub requirement from <1.3,>=0.30 to >=0.30,<1.4 by @dependabot[bot] in #2200
  • feat(logger) : Add LitLogger Support for Lightning.ai Experiment Tracking by @bhimrazy in #2198
  • fix: correct LoRA index computation uses wrong feature size by @Mr-Neutr0n in #2201
  • fix: default val_split_fraction to 0.05 for single-file JSON data by @dario-fumarola in #2204
  • fix: flaky test_litgpt_chat_endtoend by @bhimrazy in #2209
  • chore(workflows): Sync GPU CI workflow image with lightning-thunder repo by @bhimrazy in #2208
  • Add YaRN rope adjustment + DeepSeekV3 rope_interleave by @ysjprojects in #2202
  • fix: handle empty prompt list in R1Base.apply() by @LincolnBurrows2017 in #2216
  • Added correct rope_base to all Mistral configs by @adi776borate in #2218
  • chore(ruff): bump target-version to py310 and update type annotations by @bhimrazy in #2221
  • fix: raise error when prefill exceeds sliding window KV cache size by @alvinttang in #2215
  • LoRAQKVLinear.lora_ind does not cache index by @mseeger in #2225
  • feat: Introduce a pre-flight validation script for LitGPT checkpoints… by @OnePunchMonk in #2214
  • build(deps): bump psutil from 7.1.3 to 7.2.2 by @dependabot[bot] in #2226
  • [pre-commit.ci] pre-commit suggestions by @pre-commit-ci[bot] in #2229
  • fix(finetune): run generate_example only on rank 0 in multi-GPU training by @Br1an67 in #2213
  • fix(tests): Fix 3 test failures introduced by the validate PR by @bhimrazy in #2233
  • fix: make sample() produce greedy decoding when temperature=0 by @CrepuscularIRIS in #2239
  • fix(tests): xfail DynamoThunder tests with torch>=2.12 by @bhimrazy in #2254
  • security: remove pull_request_target trigger from CPU tests workflow by @bhimrazy in #2253
  • chore: pin GitHub Actions to verified commit SHAs by @bhimrazy in #2256
  • ci: migrate workflows from pip to uv by @bhimrazy in #2257
  • fix(ci): unblock check-links and stabilize GPU serve tests by @bhimrazy in #2270
  • fix(tests): skip tokenizer tests for only gated model in fork prs by @bhimrazy in #2268
  • Omit ChatML system turn when there is no system message (Qwen3 emitted literal 'None') by @lollinng in #2258
  • fix(tests): use Gutenberg mirror to fix flaky book downloads by @bhimrazy in #2272
  • docs: add Code of Conduct and rename LICENSE filename by @bhimrazy in #2273
  • ci: migrate PyPI release to trusted publishing (OIDC) by @bhimrazy in #2261
  • docs: add CONTRIBUTING.md by @bhimrazy in #2276
  • docs: update CODEOWNERS with patterns and retired committers by @bhimrazy in #2275
  • fix(docs): update broken links in README by @bhimrazy in #2277
  • chore: bump version to 0.5.13 by @bhimrazy in #2279

New Contributors

Full Changelog: v0.5.12...v0.5.13