Skip to content

Latest commit

 

History

History
28 lines (20 loc) · 1.83 KB

README.md

File metadata and controls

28 lines (20 loc) · 1.83 KB

Fine-tuning Open-Source LLMs to Small Languages

Link: bit.ly/praguellm

Slides:

Exercises:

Benchmarks:

  • mlprague: The benchmark we created together during the workshop, 111 A/B/C/D questions (🇨🇿 41, 🇸🇰 27, 🇮🇹 8, 🇫🇷 7, 🇺🇦 6...)
  • synczech50: Synthetic dataset of 50 A/B/C/D questions for quick evaluation how the LMM understands Czech and Czech specific knowledge.

Small Czech LLM:

  • cswikimistral_0.1: Mistral7B model fine-tuned with 4bit-QLoRA on Czech Wikipedia data