How do we finetune the model with new data? #466

ekolawole · 2023-03-24T16:12:02Z

Can we have a finetune.cpp or finetune.exe file to incorporate new data into the model? The use case will be to design an AI model that can do more than just general chat. It can become very knowledgeable in specific topics they are finetuned on. Also, after creating the finetune.exe , please ensure no GPU is required for the entire process. Because that is what makes this repo awesome in the first place.

Green-Sky · 2023-03-24T16:24:31Z

Sounds cool. But this is not on the short-term Roadmap.

ekolawole · 2023-03-24T16:45:19Z

The goal of these integrations is to enable academia adapt to the new era of AI, and to simplify the intricacies involved. So users should be able to finetune their models to suit their data needs. I was running the 30B this morning and the AI does not have important data about Langchain and other recent use cases from 2021 until now. I believe the data used to build the models are old. For my team, we are looking for a no GPU deployment like this one, that can also support finetuning. What can be done to move this request ahead on the Roadmap

rupakhetibinit · 2023-03-24T17:39:31Z

What you're talking about is training/finetuning which is theoretically possible on CPU but practically impossible/non-feasible on CPU only because you'll be training for literal months instead of days, you need a GPU to actually finetune this. This repository is only for inference/running the model.

PriNova · 2023-03-24T18:05:55Z

What you're talking about is training/finetuning which is theoretically possible on CPU but practically impossible/non-feasible on CPU only because you'll be training for literal months instead of days, you need a GPU to actually finetune this. This repository is only for inference/running the model.

I think it depents on the approach of fine-tuning.
If the LoRa approach will be used (only the k, q, v memory layers, as far as I understand it correctly), then it could be made by CPU and we can transfer and share the LoRa models.

Green-Sky · 2023-03-24T18:16:13Z

loading only the lora part IS on the short-term roadmap #457

leszekhanusz · 2023-03-27T16:55:43Z

There is the lxe/simple-llama-finetuner repo available for finetuning but you need a GPU with at least 16GB VRAM to finetune the 7B model.

Free-Radical · 2023-04-17T18:27:24Z

IS there a way to fine tune these models for reading my documents, etc utilizing cloud hardware but no openai, pinecone, non-free 3rd party dependencies? Code examples would be awesome (i've seen langchain's docs but they are not detailed enough, at leasdt not for me) @leszekhanusz @Green-Sky @PriNova @rupakhetibinit @ekolawole

ch3rn0v · 2023-04-17T19:53:17Z

@Free-Radical , try vector storage, such as Weaviate. Your query string can contain text in a natural language, the response is based on vector similarity between that string and the documents in the storage. I also tried Vespa, but it didn't work at all. The reason is a design choice that I find questionable, see vespa-engine/pyvespa#499 for details. There are other open source vector storage solutions too.

Free-Radical · 2023-04-17T23:11:41Z

@ch3rn0v Thank man, Weaviate looks good, better than going "raw" with FAISS. Will check out Vespa too.

Green-Sky · 2023-04-18T00:30:53Z

@Free-Radical you can look at https://github.com/tloen/alpaca-lora
loading lora adapter support also merged today #820, so i suggest you stay on the lora side (lower quality, but way, way faster to train).

Green-Sky · 2023-05-15T11:39:33Z

Since @xaedes contributed the backward version of the necessary tensors, this could now be withing reach. #1360

(afaik this is the tracking issue for finetuning)

Sovenok-Hacker · 2023-05-20T15:18:20Z

The goal of these integrations is to enable academia adapt to the new era of AI, and to simplify the intricacies involved. So users should be able to finetune their models to suit their data needs. I was running the 30B this morning and the AI does not have important data about Langchain and other recent use cases from 2021 until now. I believe the data used to build the models are old. For my team, we are looking for a no GPU deployment like this one, that can also support finetuning. What can be done to move this request ahead on the Roadmap

I agree. It will be helpful to fine-tune LLaMA models only using llama.cpp on CPU.

Sovenok-Hacker · 2023-05-20T15:20:45Z

What you're talking about is training/finetuning which is theoretically possible on CPU but practically impossible/non-feasible on CPU only because you'll be training for literal months instead of days, you need a GPU to actually finetune this. This repository is only for inference/running the model.

I disagree. What if we need to add a little data? It will be done in hours, why not add a little fine-tuning utility?

Dampfinchen · 2023-05-26T06:57:14Z

Hopefully this will be possible someday. Like many others, I do not have the VRAM to fine tune or create a LORA for models.

I wonder if its possible to use the newly added CUDA acceleration in llama.cpp to fine tune quantized models so it doesn't take ages compared to a CPU only approach.

Tom-0727 · 2023-06-26T09:44:51Z

What you're talking about is training/finetuning which is theoretically possible on CPU but practically impossible/non-feasible on CPU only because you'll be training for literal months instead of days, you need a GPU to actually finetune this. This repository is only for inference/running the model.

I disagree. What if we need to add a little data? It will be done in hours, why not add a little fine-tuning utility?

I'm afraid it's not as simple as a little fine-tuning utility. While you may only want to add a small amount of data, the process of fine-tuning requires updating many weights in the model. Even a small change can have a significant impact on the entire model, so it typically involves retraining or adjusting a considerable portion of the weights

Sovenok-Hacker · 2023-08-14T16:34:20Z

What you're talking about is training/finetuning which is theoretically possible on CPU but practically impossible/non-feasible on CPU only because you'll be training for literal months instead of days, you need a GPU to actually finetune this. This repository is only for inference/running the model.

I disagree. What if we need to add a little data? It will be done in hours, why not add a little fine-tuning utility?

I'm afraid it's not as simple as a little fine-tuning utility. While you may only want to add a small amount of data, the process of fine-tuning requires updating many weights in the model. Even a small change can have a significant impact on the entire model, so it typically involves retraining or adjusting a considerable portion of the weights

Yes, but a little amount of data means a little number of iterations. And also we can use LoRA or QLoRA to train only adapter and make fine-tuning simpler.

* feat: oai-adapter * simplify optional adapter for instruct start and end tags --------- Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>

github-actions · 2024-04-10T01:07:58Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

Green-Sky added the enhancement New feature or request label Mar 24, 2023

j-f1 mentioned this issue Apr 20, 2023

Is there a way to tune llama model? #1095

Closed

github-actions bot added the stale label Mar 25, 2024

github-actions bot closed this as completed Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do we finetune the model with new data? #466

How do we finetune the model with new data? #466

ekolawole commented Mar 24, 2023 •

edited

Loading

Green-Sky commented Mar 24, 2023 •

edited

Loading

ekolawole commented Mar 24, 2023

rupakhetibinit commented Mar 24, 2023

PriNova commented Mar 24, 2023

Green-Sky commented Mar 24, 2023

leszekhanusz commented Mar 27, 2023

Free-Radical commented Apr 17, 2023

ch3rn0v commented Apr 17, 2023

Free-Radical commented Apr 17, 2023

Green-Sky commented Apr 18, 2023

Green-Sky commented May 15, 2023

Sovenok-Hacker commented May 20, 2023

Sovenok-Hacker commented May 20, 2023 •

edited

Loading

Dampfinchen commented May 26, 2023

Tom-0727 commented Jun 26, 2023

Sovenok-Hacker commented Aug 14, 2023

github-actions bot commented Apr 10, 2024

How do we finetune the model with new data? #466

How do we finetune the model with new data? #466

Comments

ekolawole commented Mar 24, 2023 • edited Loading

Green-Sky commented Mar 24, 2023 • edited Loading

ekolawole commented Mar 24, 2023

rupakhetibinit commented Mar 24, 2023

PriNova commented Mar 24, 2023

Green-Sky commented Mar 24, 2023

leszekhanusz commented Mar 27, 2023

Free-Radical commented Apr 17, 2023

ch3rn0v commented Apr 17, 2023

Free-Radical commented Apr 17, 2023

Green-Sky commented Apr 18, 2023

Green-Sky commented May 15, 2023

Sovenok-Hacker commented May 20, 2023

Sovenok-Hacker commented May 20, 2023 • edited Loading

Dampfinchen commented May 26, 2023

Tom-0727 commented Jun 26, 2023

Sovenok-Hacker commented Aug 14, 2023

github-actions bot commented Apr 10, 2024

ekolawole commented Mar 24, 2023 •

edited

Loading

Green-Sky commented Mar 24, 2023 •

edited

Loading

Sovenok-Hacker commented May 20, 2023 •

edited

Loading