Add example to fine-tune StarCoder for chat-based applications #17

lewtun · 2023-05-08T09:13:21Z

TODO

Add a "news" entry to the main README

lvwerra

Looks good overall. Left only a few minor comments.

I am mostly wondering if we could have used the SFTTrainer to reduce a bit the amount of code needed for train.py or run a simple preprocessing script that pushes the formated dataset to the hub for the script to reduce a bit the amount of code. Don't think we need to change anything now but I think we should provide a bit more utility to be able to make such examples more concise.

lvwerra · 2023-05-09T09:11:34Z

chat/README.md

+# Fine-tuning StarCoder for chat-based applications
+
+This is an educational example to fine-tune `StarCoderBase` on a corpus of multi-turn dialogues and thus create a coding assistant that is chatty and helpful. Check out our [blog post](ADD LINK) for more details.
+


maybe some examples of what the model can do after tuning would be cool to show here.

chat/README.md

lewtun · 2023-05-09T12:33:24Z

Looks good overall. Left only a few minor comments.

I am mostly wondering if we could have used the SFTTrainer to reduce a bit the amount of code needed for train.py or run a simple preprocessing script that pushes the formated dataset to the hub for the script to reduce a bit the amount of code. Don't think we need to change anything now but I think we should provide a bit more utility to be able to make such examples more concise.

Yes I agree this would be nice to do & I'll fix this in a follow-up PR (I ran out of time to test everything with the new API)

lewtun added 5 commits May 6, 2023 12:46

Add StarChat files

0a0e065

Clean up

07a3418

Fix readme

cc8785c

Tweak

bd5e167

Clean up

18b32a3

lewtun requested review from lvwerra and ArmelRandy May 8, 2023 09:13

lewtun added 2 commits May 8, 2023 09:20

Final polish

bde49ab

Fix steps

daf5ad3

lvwerra approved these changes May 9, 2023

View reviewed changes

philschmid approved these changes May 9, 2023

View reviewed changes

lewtun added 3 commits May 9, 2023 13:48

Final tweaks

3c6cc84

Delete dead code

f660186

Fix typo

83b3b78

lewtun merged commit 54ba27e into main May 9, 2023

lewtun deleted the starchat branch May 9, 2023 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add example to fine-tune StarCoder for chat-based applications #17

Add example to fine-tune StarCoder for chat-based applications #17

Uh oh!

lewtun commented May 8, 2023 •

edited

Loading

Uh oh!

lvwerra left a comment

Uh oh!

lvwerra May 9, 2023

Uh oh!

Uh oh!

lewtun commented May 9, 2023

Uh oh!

Uh oh!

		# Fine-tuning StarCoder for chat-based applications

		This is an educational example to fine-tune `StarCoderBase` on a corpus of multi-turn dialogues and thus create a coding assistant that is chatty and helpful. Check out our [blog post](ADD LINK) for more details.

Add example to fine-tune StarCoder for chat-based applications #17

Add example to fine-tune StarCoder for chat-based applications #17

Uh oh!

Conversation

lewtun commented May 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Uh oh!

lvwerra left a comment

Choose a reason for hiding this comment

Uh oh!

lvwerra May 9, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun commented May 9, 2023

Uh oh!

Uh oh!

lewtun commented May 8, 2023 •

edited

Loading