Edits from finetuning #459

granawkins · 2024-01-06T06:09:51Z

Fix the fine-tuning output format, add a way for fine-tune models to exclude the system_prompt, and patch up some issues I found.

Pull Request Checklist

Documentation has been updated, or this change doesn't require that

jakethekoenig · 2024-01-08T15:52:59Z

mentat/llm_api_handler.py

@@ -170,6 +175,10 @@ class Model:
    "text-embedding-ada-002": Model(
        "text-embedding-ada-002", 8191, 0.0001, 0, embedding_model=True
    ),
+    # Fine-tuned on Jan-6 2024 with `sampler-one-hundred-v1.jsonl` data
+    "ft:gpt-3.5-turbo-1106:abante::8dsQMc4F": Model(


Unfortunately only abante ai org ai keys will be able to use this model so we shouldn't add it to the list.

PCSwingle

I'm not sure this makes much sense; like @jakethekoenig mentioned, the finetuned models can only be used by us. How about we just makes the include_system_prompt a config option?

granawkins · 2024-01-09T00:40:11Z

I get what you mean about the fine-tuned 3.5, makes sense ya I'll remove that one for now.

I do think it's good to build-in support for it though. I was planning to do a lil video about the end-to-end process of making our fine-tuned 3.5, with the thought that any of our users could do the same thing with their data.

Maybe known_models should try to match a base model if it's not immediately recognized? Just like I've done with the encoding function.

jakethekoenig · 2024-01-09T11:21:26Z

I do think it's good to build-in support for it though. I was planning to do a lil video about the end-to-end process of making our fine-tuned 3.5, with the thought that any of our users could do the same thing with their data.

I like this idea. Back when I was doing this I thought we could eventually integrate it such that you could run a /finetune command and it would run on your own git repo and make fine tuning data, start a fine tuning job and automatically use the model when done. I think that approach to making training data didn't make much sense but the idea of eventually making /finetune a command or more simply distributing .jsonl's and documenting how to use them does make sense given that fine tuning is actually relatively cheap relative to inference.

Maybe known_models should try to match a base model if it's not immediately recognized? Just like I've done with the encoding function.

I like this idea. Mentat should know the costs to use a fine tuned gpt-3.5.

…dels

granawkins · 2024-01-10T06:08:36Z

Added a wrapper class around known_models to handle fine-tuned models.

Removed the new requires_system_prompt stuff - if using a fine-tuned model that doesn't need a system prompt, users can use existing --no-parser-prompt arg:

mentat -a --model ft:gpt-3.5-turbo-1106:abante::8dsQMc4F --no-parser-prompt

PCSwingle · 2024-01-11T22:51:31Z

mentat/config.py

@@ -36,17 +36,19 @@ class Config:
    # Model specific settings
    model: str = attr.field(
        default="gpt-4-1106-preview",
-        metadata={"auto_completions": list(known_models.keys())},
+        metadata={"auto_completions": list(known_models.asdict().keys())},


Do we have to do asdict? Isn't it technically a dict since it extends Dict?

thb I spent an hour trying to figure out how to have an overridden keys method return a type that my linter was happy with. Tried typing.KeyView, tried returning self.model.keys() directly - it wouldn't take.

attrs also has an asdict method which we use here and there so it seemed ok.

EDIT: nevermind I figured it out.

PCSwingle

LGTM

granawkins added 6 commits January 6, 2024 09:37

edits required to generate finetune examples

c90ed22

edits to run mentat without system prompt

8d70482

change default model to finetuned 3.5 (temporarily)

2cba43c

Merge remote-tracking branch 'upstream/main' into edits-from-finetuning

9e24132

changes after llama2

24046f1

cleanup

155c1ae

jakethekoenig reviewed Jan 8, 2024

View reviewed changes

PCSwingle reviewed Jan 8, 2024

View reviewed changes

granawkins added 3 commits January 10, 2024 11:06

Merge remote-tracking branch 'upstream/main' into edits-from-finetuning

ef30a6f

replace known_models with ModelIndex to handle fine-tuned gpts

8a5124b

rely on existing no_parser_prompt config variable for fine-tuned mo…

7124883

…dels

granawkins requested review from PCSwingle and jakethekoenig January 10, 2024 06:08

PCSwingle reviewed Jan 11, 2024

View reviewed changes

PCSwingle approved these changes Jan 11, 2024

View reviewed changes

granawkins added 3 commits January 12, 2024 07:22

rmeove the need for known_models.asdict

d03a646

one little change that gpt4 caught

52866ee

Merge remote-tracking branch 'upstream/main' into edits-from-finetuning

aba5a50

granawkins merged commit 0214b41 into main Jan 12, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Edits from finetuning #459

Edits from finetuning #459

granawkins commented Jan 6, 2024

jakethekoenig Jan 8, 2024

PCSwingle left a comment

granawkins commented Jan 9, 2024

jakethekoenig commented Jan 9, 2024

granawkins commented Jan 10, 2024

PCSwingle Jan 11, 2024

granawkins Jan 11, 2024 •

edited

PCSwingle left a comment

Edits from finetuning #459

Edits from finetuning #459

Conversation

granawkins commented Jan 6, 2024

Pull Request Checklist

jakethekoenig Jan 8, 2024

Choose a reason for hiding this comment

PCSwingle left a comment

Choose a reason for hiding this comment

granawkins commented Jan 9, 2024

jakethekoenig commented Jan 9, 2024

granawkins commented Jan 10, 2024

PCSwingle Jan 11, 2024

Choose a reason for hiding this comment

granawkins Jan 11, 2024 • edited

Choose a reason for hiding this comment

PCSwingle left a comment

Choose a reason for hiding this comment

granawkins Jan 11, 2024 •

edited