Bug in getModelNameForTiktoken method? #1872

gal-checksum · 2023-07-05T19:19:26Z

I'm using the function calculateMaxTokens to understand how many tokens I have left for a given prompt.

This function is not working properly, and upon digging further I found the issue with getModelNameForTiktoken. There's an extra "-" that are added to every startWith.

What is the reason for having a trailing dash for each model?

It creates a problem because when I pass gpt-3.5-turbo-16k as the model, it doesn't match gpt-3.5-turbo-16k- because of the extra "-" but it does match gpt-3.5-turbo, which returns the wrong model name, and thus the wrong model.

Happy to create a PR to fix, but I'm unsure if I can just remove the "-".

const getModelNameForTiktoken = (modelName) => {
    if (modelName.startsWith("gpt-3.5-turbo-16k-")) {
        return "gpt-3.5-turbo-16k";
    }
    if (modelName.startsWith("gpt-3.5-turbo-")) {
        return "gpt-3.5-turbo";
    }
    if (modelName.startsWith("gpt-4-32k-")) {
        return "gpt-4-32k";
    }
    if (modelName.startsWith("gpt-4-")) {
        return "gpt-4";
    }
    return modelName;
};

The text was updated successfully, but these errors were encountered:

jacoblee93 · 2023-07-12T06:18:26Z

Should be fixed by #1931 and released in 0.0.107!

chungyau97 mentioned this issue Jul 10, 2023

[FEATURE] Is there any chain for sumarizing large context ? FlowiseAI/Flowise#507

Open

jacoblee93 mentioned this issue Jul 12, 2023

fix: correct token count for larger context windows #1931

Merged

jacoblee93 closed this as completed Jul 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in getModelNameForTiktoken method? #1872

Bug in getModelNameForTiktoken method? #1872

gal-checksum commented Jul 5, 2023

jacoblee93 commented Jul 12, 2023

Bug in getModelNameForTiktoken method? #1872

Bug in getModelNameForTiktoken method? #1872

Comments

gal-checksum commented Jul 5, 2023

jacoblee93 commented Jul 12, 2023