Is there a way to fine tune the llama.7B model? #166

TNTTheLagger · 2023-03-20T23:05:06Z

I want to try and fine-tune this model to see if I can make it into a sort of chatbot.
I have plenty of chat data in json files but I don't know how exactly would I fine-tune the llama model.

Does anyone have any references or tutorials like videos or GitHub repos on this subject?

DrakoHyena · 2023-03-20T23:06:56Z

It wouldnt be feasible to fine tune it youself, you would need to have quite a few powerful machines. The alpaca model that comes installable is already fine tuned to be a chatbot of sorts

TNTTheLagger · 2023-03-20T23:08:51Z

Are you referring to the alpaca.7B?

DrakoHyena · 2023-03-20T23:19:53Z

Yes I am

TNTTheLagger · 2023-03-21T00:55:56Z

I don't know it acts a bit weird also it tends to just repeat the same sentence or sentences over and over and over without stopping no matter when i set the output token count to

rafaelpleite · 2023-03-21T01:32:47Z

Try to write in the prompt as below

Below is an instruction that describes a task. Write a response that appropriately completes the request.
Question: [Your question]

like

Below is an instruction that describes a task. Write a response that appropriately completes the request.
Question: What is an alpaca? How is it different from a llama?

Juleffel · 2023-03-21T08:23:10Z

https://github.com/tloen/alpaca-lora with a few changes seem to be a good option. (Just by changing the generate_prompt to something more chat conversation oriented than instructions oriented).

https://github.com/deep-diver/Alpaca-LoRA-Serve is also a good option: he is using the alpaca-lorca model with a prompt generated to give the instruction "give an answer to the following conversation remembering the context", but it doesn't seem particularly trained to do conversation, so it might perform less good than training on your chat dataset.

I'll try to fork alpaca-lora to make it train on conversations, if it hasn't already been done by the time I'm on it 😅

mouchourider · 2023-03-28T16:13:04Z

https://github.com/tloen/alpaca-lora with a few changes seem to be a good option. (Just by changing the generate_prompt to something more chat conversation oriented than instructions oriented).

https://github.com/deep-diver/Alpaca-LoRA-Serve is also a good option: he is using the alpaca-lorca model with a prompt generated to give the instruction "give an answer to the following conversation remembering the context", but it doesn't seem particularly trained to do conversation, so it might perform less good than training on your chat dataset.

I'll try to fork alpaca-lora to make it train on conversations, if it hasn't already been done by the time I'm on it sweat_smile

I just trained the 7B model using the https://github.com/tloen/alpaca-lora project. But I don't get how you use the adapter along with the dalai project.

Does someone knows how to do it?

Juleffel · 2023-03-28T16:14:42Z

@mouchourider this is issue #135

tobiasgrossmann · 2023-04-17T10:46:49Z

https://github.com/tloen/alpaca-lora with a few changes seem to be a good option. (Just by changing the generate_prompt to something more chat conversation oriented than instructions oriented).
https://github.com/deep-diver/Alpaca-LoRA-Serve is also a good option: he is using the alpaca-lorca model with a prompt generated to give the instruction "give an answer to the following conversation remembering the context", but it doesn't seem particularly trained to do conversation, so it might perform less good than training on your chat dataset.
I'll try to fork alpaca-lora to make it train on conversations, if it hasn't already been done by the time I'm on it sweat_smile

I just trained the 7B model using the https://github.com/tloen/alpaca-lora project. But I don't get how you use the adapter along with the dalai project.

Does someone knows how to do it?

Did you found a solution? I just got the instructions together, but no idea how to get those into an adapter or model. The Alpaca one seems to be a binary. (sorry total beginner)

Eugene-Bond mentioned this issue Mar 22, 2023

Alpaca 13B repeats answers #186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a way to fine tune the llama.7B model? #166

Is there a way to fine tune the llama.7B model? #166

TNTTheLagger commented Mar 20, 2023 •

edited

Loading

DrakoHyena commented Mar 20, 2023 •

edited

Loading

TNTTheLagger commented Mar 20, 2023

DrakoHyena commented Mar 20, 2023

TNTTheLagger commented Mar 21, 2023

rafaelpleite commented Mar 21, 2023

Juleffel commented Mar 21, 2023 •

edited

Loading

mouchourider commented Mar 28, 2023

Juleffel commented Mar 28, 2023

tobiasgrossmann commented Apr 17, 2023

Is there a way to fine tune the llama.7B model? #166

Is there a way to fine tune the llama.7B model? #166

Comments

TNTTheLagger commented Mar 20, 2023 • edited Loading

DrakoHyena commented Mar 20, 2023 • edited Loading

TNTTheLagger commented Mar 20, 2023

DrakoHyena commented Mar 20, 2023

TNTTheLagger commented Mar 21, 2023

rafaelpleite commented Mar 21, 2023

Juleffel commented Mar 21, 2023 • edited Loading

mouchourider commented Mar 28, 2023

Juleffel commented Mar 28, 2023

tobiasgrossmann commented Apr 17, 2023

TNTTheLagger commented Mar 20, 2023 •

edited

Loading

DrakoHyena commented Mar 20, 2023 •

edited

Loading

Juleffel commented Mar 21, 2023 •

edited

Loading