How to train model with databricks-dolly-15k.jsonl dataset format. #13

TapendraBaduwal · 2023-09-08T08:32:21Z

How to train model with databricks-dolly-15k.jsonl dataset format.

Can we Finetuning using BitsandBytes and SFT ?

abrdgrt · 2023-09-08T11:12:38Z

..

VatsaDev · 2023-09-08T17:12:32Z

@TapendraBaduwal you should probably wait till the models fully trained and then ask for this, but there was a mention sft in one of the closed issues

jzhang38 · 2023-09-12T09:40:40Z

Our model can largely be plugged and played in repos that support llama 2 (including BitsandBytes and SFT repos like FastChat. For your case, you need to find a training script that supports databricks-dolly-15k.jsonl dataset format and change the model name to our released checkpoint. Just make sure you have the latest version of HF transformers to support MQA. We are working on fine-tuning our model as well. Will be releasing something, probably next week.

TapendraBaduwal · 2023-09-13T04:00:29Z

@jzhang38 For fine-tune i am using Parameter-Efficient Fine-Tuning (PEFT) . PEFT supports the QLoRa method to fine-tune a small fraction of the LLM parameters with 4-bit quantization. By merge the adapter weights. It is the right way to fine-tune this tiny model ?

jzhang38 closed this as completed Sep 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train model with databricks-dolly-15k.jsonl dataset format. #13

How to train model with databricks-dolly-15k.jsonl dataset format. #13

TapendraBaduwal commented Sep 8, 2023

abrdgrt commented Sep 8, 2023

VatsaDev commented Sep 8, 2023

jzhang38 commented Sep 12, 2023 •

edited

Loading

TapendraBaduwal commented Sep 13, 2023

How to train model with databricks-dolly-15k.jsonl dataset format. #13

How to train model with databricks-dolly-15k.jsonl dataset format. #13

Comments

TapendraBaduwal commented Sep 8, 2023

abrdgrt commented Sep 8, 2023

VatsaDev commented Sep 8, 2023

jzhang38 commented Sep 12, 2023 • edited Loading

TapendraBaduwal commented Sep 13, 2023

jzhang38 commented Sep 12, 2023 •

edited

Loading