SFT of LLama 2 #3582

gsaivinay · 2023-07-18T20:10:24Z

Hello,

Just today, Meta open sourced Llama 2 models. Wondering if OA team is considering these.

andreaskoepf · 2023-07-18T22:49:10Z

Absolutely. We are already evaluating the models. We will start own fine-tuning runs soon (in a couple of hours).

Billyroot · 2023-07-18T23:17:29Z

That's good to know. Le mer. 19 juil. 2023 à 00:49, Andreas Köpf ***@***.***> a écrit :

Absolutely. We are already evaluating the models. We will start own fine-tuning runs soon (in a couple of hours). — Reply to this email directly, view it on GitHub <#3582 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVZJTUM2EH3LVCVXGH7TO3XQ4HH5ANCNFSM6AAAAAA2O4RWHI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

gsaivinay · 2023-07-19T09:22:48Z

We will start own fine-tuning runs soon (in a couple of hours).

This is really great. Please keep us posted if possible, on the training and evaluation progress.

flozi00 · 2023-07-19T19:37:30Z

Is there any place to track the training ?
I running some experiments using lora, no resized embeddings for OA special tokens, sequence length of 4096 tokens
The loss of 13b model is stable and lower than 2048 tokens falcon 7b

gsaivinay · 2023-07-19T19:46:37Z

Is there any place to track the training ?

I'm assuming the training progress will show up here as soon as the process is started, but I might be wrong.

andreaskoepf · 2023-07-21T21:55:02Z

I'm assuming the training progress will show up here as soon as the process is started, but I might be wrong.

We publish/copy successful runs afterwards to the public-sft wandb project.

Billyroot · 2023-07-23T03:09:07Z

So we just have to wait for the release :). Do you think it might happen next week?

gsaivinay · 2023-07-25T14:58:13Z

well, 13b model is here https://huggingface.co/OpenAssistant/llama2-13b-orca-8k-3319 and it is with 8k context length 🤯

olliestanley · 2023-08-29T18:09:32Z

Here is the first oasst dataset SFT model for Llama2 70B. As usual details are on the HF page and future models will also be on HF, so I will close this issue now

https://huggingface.co/OpenAssistant/llama2-70b-oasst-sft-v10

gsaivinay changed the title ~~LLama 2 is released~~ SFT of LLama 2 Jul 19, 2023

olliestanley closed this as completed Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFT of LLama 2 #3582

SFT of LLama 2 #3582

gsaivinay commented Jul 18, 2023

andreaskoepf commented Jul 18, 2023

Billyroot commented Jul 18, 2023 via email

gsaivinay commented Jul 19, 2023

flozi00 commented Jul 19, 2023

gsaivinay commented Jul 19, 2023

andreaskoepf commented Jul 21, 2023

Billyroot commented Jul 23, 2023

gsaivinay commented Jul 25, 2023

olliestanley commented Aug 29, 2023

SFT of LLama 2 #3582

SFT of LLama 2 #3582

Comments

gsaivinay commented Jul 18, 2023

andreaskoepf commented Jul 18, 2023

Billyroot commented Jul 18, 2023 via email

gsaivinay commented Jul 19, 2023

flozi00 commented Jul 19, 2023

gsaivinay commented Jul 19, 2023

andreaskoepf commented Jul 21, 2023

Billyroot commented Jul 23, 2023

gsaivinay commented Jul 25, 2023

olliestanley commented Aug 29, 2023