Training data #2

NtaylorOX · 2023-10-31T11:12:12Z

Great work and repo.

Whilst I'm aware the actual training likely follows general LLM training scripts/flow. It would be nice to see the training scripts. Is there any plan to upload?

SteveKGYang · 2023-10-31T11:29:10Z

Thank you very much for your interest. We mostly modified the training architecture of FastChat for current released parts (mostly SFT) of MentaLLaMA, so I'll point you to their repo for now. But we are working towards further enhancing MentaLLaMA with other techniques such as RLHF, and we will release these codes. Stay tuned!

NtaylorOX · 2023-10-31T13:39:07Z

Thanks for the reply. Very helpful, and looking forward to what is to come.

NtaylorOX · 2023-11-01T16:31:26Z

Actually, one quick question. To perform the SFT for MentalLLaMA, with the instruction training data for say, the DR task. Do you treat this as a standard auto-regressive objective and combine the "query" and the "gpt-3.5-turbo" response? Just hoping to play around with training some smaller models/architectures myself to have a play

SteveKGYang · 2023-11-01T16:58:50Z

Yes. This is the standard instruction tuning paradigm. I suggest you base on foundation models with SFT/RLHF (e.g. LLaMA2-chat, Vicuna) as they will facilitate your training process, especially with small training datasets.

NtaylorOX · 2023-11-01T17:39:04Z

Thought so, was just double checking. Thanks for prompt reply! I'll keep you posted if I develop anything that could be brought into this repo.

SteveKGYang · 2023-11-02T08:25:09Z

Thanks! Any contributions will be appreciated!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training data #2

Training data #2

NtaylorOX commented Oct 31, 2023

SteveKGYang commented Oct 31, 2023

NtaylorOX commented Oct 31, 2023

NtaylorOX commented Nov 1, 2023

SteveKGYang commented Nov 1, 2023

NtaylorOX commented Nov 1, 2023

SteveKGYang commented Nov 2, 2023

Training data #2

Training data #2

Comments

NtaylorOX commented Oct 31, 2023

SteveKGYang commented Oct 31, 2023

NtaylorOX commented Oct 31, 2023

NtaylorOX commented Nov 1, 2023

SteveKGYang commented Nov 1, 2023

NtaylorOX commented Nov 1, 2023

SteveKGYang commented Nov 2, 2023