Skip to content

v0.2.0: Web UI Refactor, LongLoRA

Choose a tag to compare

@hiyouga hiyouga released this 15 Oct 13:06
· 2636 commits to main since this release

New features

  • Support LongLoRA for the LLaMA models
  • Support training the Qwen-14B and InternLM-20B models
  • Support training state recovery for the all-in-one Web UI
  • Support Ascend NPU by @statelesshz in #975
  • Integrate MMLU, C-Eval and CMMLU benchmarks

Modifications

  • Rename repository to LLaMA Factory (former LLaMA Efficient Tuning)
  • Use the cutoff_len argument instead of max_source_length and max_target_length #944
  • Add a train_on_prompt option #1184

Bug fix

[1] huggingface/transformers#25598 (comment)