increase default max_tokens for older non-chat OpenAI models so NER/spancat works #236

kabirkhan · 2023-07-31T20:05:12Z

Description

The default max_tokens for the LLM response of the old completions endpoint for OpenAI is 16 tokens. This often causes too short of an output for the NER/SpanCat tasks (and of course for longer tasks like summarization).

Setting the default at the API level higher value is a probably a good bet as a default but we should also probably update the docs here.

This is only required for the legacy models, the chat completion models set this to Infinity by default.

Corresponding documentation PR

explosion/spaCy#12961

Types of change

enhancement

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran all tests in tests and usage_examples/tests, and all new and existing tests passed. This includes
- all external tests (i. e. pytest ran with --external)
- all tests requiring a GPU (i. e. pytest ran with --gpu)
My changes don't require a change to the documentation, or if they do, I've added all required information.

…ncat works better

…s of each model

rmitsch

This seems reasonable overall. Some notes:

You mention setting max_tokens to 100, but the actual values are >2k. Why the divergence between description and code?
Have you run the external tests with this change?
The docs on spacy.io should be updated as well.

spacy_llm/models/rest/openai/registry.py

kabirkhan · 2023-08-01T16:36:23Z

@honnibal mentioned we should increase the default a little higher if possible. OpenAI docs on managing tokens (https://platform.openai.com/docs/guides/gpt/managing-tokens) say that the prompt + response tokens cannot exceed the model's context width so I scaled down the default max_tokens for the models by context window. This should allow the NER/Spancat tasks to work in most cases

spacy_llm/models/rest/openai/registry.py

kabirkhan · 2023-08-03T16:30:47Z

External tests all run correctly here but yeah docs need to be updated still, will add a link to a PR shortly

svlandeg

When we change the default settings like this for these models, don't we have to bump all the versions?

rmitsch · 2023-08-07T12:44:00Z

When we change the default settings like this for these models, don't we have to bump all the versions?

Hm...I'd say we don't have to, but we mention it in the release notes.

svlandeg · 2023-08-09T10:16:26Z

When we change the default settings like this for these models, don't we have to bump all the versions?

Hm...I'd say we don't have to, but we mention it in the release notes.

Hmm. Then at the very least we should have this on develop and not main?

rmitsch · 2023-08-11T11:29:08Z

Hmm. Then at the very least we should have this on develop and not main?

Fair, changed to develop.

rmitsch · 2023-08-11T11:36:19Z

Added a docs PR. I think we can merge both the docs PR and this one?

svlandeg · 2023-08-16T15:07:54Z

It looks like the external tests are failing with an error about writing to a frozen dict...

svlandeg · 2023-09-06T10:11:03Z

The only remaining failing external test should be fixed with the new confection release.

spacy_llm/models/rest/openai/registry.py

svlandeg · 2023-09-06T10:19:01Z

new docs PR: explosion/spaCy#12961

Kabir Khan added 3 commits July 31, 2023 13:02

default max_tokens to 100 for older non-chat openai models so NER/spa…

d6d93f0

…ncat works better

add 0.0 temparature default and increase max_tokens to context length…

da33d8a

…s of each model

make max_tokens default smaller

e0e72d5

rmitsch reviewed Aug 1, 2023

View reviewed changes

spacy_llm/models/rest/openai/registry.py Outdated Show resolved Hide resolved

kabirkhan changed the title ~~default max_tokens to 100 for older non-chat OpenAI models so NER/spancat works~~ increase default max_tokens for older non-chat OpenAI models so NER/spancat works Aug 1, 2023

prefix constant with _

80ddb25

rmitsch reviewed Aug 2, 2023

View reviewed changes

spacy_llm/models/rest/openai/registry.py Outdated Show resolved Hide resolved

spacy_llm/models/rest/openai/registry.py Outdated Show resolved Hide resolved

svlandeg reviewed Aug 7, 2023

View reviewed changes

rmitsch changed the base branch from main to develop August 11, 2023 11:29

rmitsch added the feat/model Feature: models label Aug 11, 2023

svlandeg added Test external Run external tests Test GPU Run GPU tests labels Aug 16, 2023

svlandeg added 5 commits September 6, 2023 09:46

make v2 versions

470443b

add imports, incl missing openai_code_davinci

0f82508

update tests and examples

54b126a

Merge branch 'develop' into kab/config-max-tokens

5dbe606

fix test parameters

e41abf5

svlandeg mentioned this pull request Sep 6, 2023

Document v2 OpenAI models explosion/spaCy#12961

Merged

3 tasks

rmitsch reviewed Sep 6, 2023

View reviewed changes

spacy_llm/models/rest/openai/registry.py Outdated Show resolved Hide resolved

svlandeg added 2 commits September 6, 2023 12:58

shorten doc strings and provide doc link

21bef95

use SimpleFrozenDict from confection

d593b75

svlandeg mentioned this pull request Sep 6, 2023

Consistently use same default model #283

Merged

3 tasks

rmitsch approved these changes Sep 7, 2023

View reviewed changes

rmitsch merged commit 3be42e3 into develop Sep 7, 2023
11 checks passed

svlandeg deleted the kab/config-max-tokens branch September 7, 2023 07:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

increase default max_tokens for older non-chat OpenAI models so NER/spancat works #236

increase default max_tokens for older non-chat OpenAI models so NER/spancat works #236

kabirkhan commented Jul 31, 2023 •

edited by svlandeg

Loading

rmitsch left a comment

kabirkhan commented Aug 1, 2023

kabirkhan commented Aug 3, 2023

svlandeg left a comment

rmitsch commented Aug 7, 2023

svlandeg commented Aug 9, 2023

rmitsch commented Aug 11, 2023

rmitsch commented Aug 11, 2023

svlandeg commented Aug 16, 2023

svlandeg commented Sep 6, 2023 •

edited

Loading

svlandeg commented Sep 6, 2023

increase default max_tokens for older non-chat OpenAI models so NER/spancat works #236

increase default max_tokens for older non-chat OpenAI models so NER/spancat works #236

Conversation

kabirkhan commented Jul 31, 2023 • edited by svlandeg Loading

Description

Corresponding documentation PR

Types of change

Checklist

rmitsch left a comment

Choose a reason for hiding this comment

kabirkhan commented Aug 1, 2023

kabirkhan commented Aug 3, 2023

svlandeg left a comment

Choose a reason for hiding this comment

rmitsch commented Aug 7, 2023

svlandeg commented Aug 9, 2023

rmitsch commented Aug 11, 2023

rmitsch commented Aug 11, 2023

svlandeg commented Aug 16, 2023

svlandeg commented Sep 6, 2023 • edited Loading

svlandeg commented Sep 6, 2023

kabirkhan commented Jul 31, 2023 •

edited by svlandeg

Loading

svlandeg commented Sep 6, 2023 •

edited

Loading