Skip to content

Releases: nlpxucan/WizardLM

Release v1.6

27 Aug 10:47
a85f9bf
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardCoder 34B trained from Code Llama


WizardCoder-34B surpasses GPT-4, ChatGPT-3.5 and Claude-2 on HumanEval with 73.2% pass@1

Release v1.5

13 Aug 15:24
f57ff95
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardMath, the third member of Wizard Family

WizardMath 70B achieves:

  1. Surpasses ChatGPT-3.5, Claude Instant-1, PaLM-2 and Chinchilla on GSM8k with 81.6 Pass@1

  2. Surpasses Text-davinci-002, GAL, PaLM, GPT-3 on MATH with 22.7 Pass@1

Release v1.4

09 Aug 13:57
e74ac66
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardLM-70B-V1.0 trained from Llama-2

Compared with Llama-2-70b-chat, there are the following updates:

Release v1.3

25 Jul 15:36
e253267
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardLM-13B-V1.2 trained from Llama-2

Compared with WizardLM-13B-V1.1, there are the following updates:

Release v1.2

06 Jun 11:44
c64b61f
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardLM 30B Version.

  • On difficulty-balanced Evol-Instruct testset, evaluated by GPT-4: WizardLM-30B achieves 97.8% of ChatGPT, Guanaco-65B achieves 96.6%, and WizardLM-13B achieves 89.1%.
  • We provide a comparison between the performance of the WizardLM-30B and ChatGPT on different skills to establish a reasonable expectation of WizardLM's capabilities.

Release v1.1

26 May 15:53
0966834
Compare
Choose a tag to compare

🚀Major Update: Introducing WizardLM 13B Version.

  • On difficulty-balanced Evol-Instruct testset, evaluated by GPT-4: WizardLM-13B achieves 89.1% of ChatGPT, Vicuna-13B achieves 86.9%, and WizardLM-7B achieves 78%.
  • The 13B version is trained on instruction data evolved from real-world human conversations (ShareGPT), while the 7B version is trained on instruction data evolved from machine-generated data (Alpaca).
  • We provide a comparison between the performance of the WizardLM-13B and ChatGPT on different skills to establish a reasonable expectation of WizardLM's capabilities.