A model that picks the right sized model #32

simonw · 2023-06-15T23:06:09Z

Count tokens with tiktoken and switch to the 16k or 32k models if necessary.

simonw · 2023-06-15T23:08:19Z

This may be a template and not a model - perhaps llm -t auto

Mechanism for storing prompt templates #23

Not sure what the YAML would look like.

A model might be better through, since then you could combine a template with the -m auto option.

simonw · 2023-06-15T23:10:00Z

I think it's a special model called -m auto

How should it handle some users not having GPT-4 32k access?

I think it should try anyway and error if they don't have the model - it would have errored anyway since they were over 32k tokens.

benjamin-kirkbride · 2023-06-17T15:34:43Z

Also need to consider the 3.5 4k vs 16k, guessing this is going to be a pattern that continues as well; models that are "the same" but differ in context length (and pricing).

I think there needs to be some concept of "flavors" of models, and in llm you should be able to select the base "flavor" you want and have the model be selected based on a number of other factors (including context length).

benjamin-kirkbride · 2023-06-17T15:35:30Z

Worth noting this is a problem that other tools are facing right now as well. I'm not aware of any consensus on how to handle it as of yet, but it's probably worth looking into.

benjamin-kirkbride · 2023-06-17T15:36:27Z

this is relevant to the new -c flag as well, as a conversation that fits in the context of one model may outgrow it, and ideally you can continue the conversation without interuption.

simonw · 2023-07-01T21:06:04Z

Dropped from the 0.5 milestone, it's not critical for that.

I'm actually thinking this might make more sense as a llm-auto plugin. It could be expanded to cover all kinds of other heuristics, not just the length of the context.

simonw added the enhancement New feature or request label Jun 15, 2023

simonw added this to the 0.4 milestone Jun 15, 2023

simonw mentioned this issue Jun 16, 2023

Record actual model used to run the prompt #34

Closed

simonw modified the milestones: 0.4, 0.5 Jun 16, 2023

simonw mentioned this issue Jun 17, 2023

Plugin mechanism for registering extra commands #49

Closed

3 tasks

simonw mentioned this issue Jun 17, 2023

Plugin hook: register_models #53

Closed

simonw removed this from the 0.5 milestone Jul 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A model that picks the right sized model #32

A model that picks the right sized model #32

simonw commented Jun 15, 2023 •

edited

Loading

simonw commented Jun 15, 2023 •

edited

Loading

simonw commented Jun 15, 2023

benjamin-kirkbride commented Jun 17, 2023

benjamin-kirkbride commented Jun 17, 2023

benjamin-kirkbride commented Jun 17, 2023

simonw commented Jul 1, 2023

A model that picks the right sized model #32

A model that picks the right sized model #32

Comments

simonw commented Jun 15, 2023 • edited Loading

simonw commented Jun 15, 2023 • edited Loading

simonw commented Jun 15, 2023

benjamin-kirkbride commented Jun 17, 2023

benjamin-kirkbride commented Jun 17, 2023

benjamin-kirkbride commented Jun 17, 2023

simonw commented Jul 1, 2023

simonw commented Jun 15, 2023 •

edited

Loading

simonw commented Jun 15, 2023 •

edited

Loading