Token Usage Tracking #85

teoh · 2022-12-07T08:23:07Z

What is this?

From #56. This PR adds support for counting tokens used during calls to the LLM. This is done via the decorator llm_token_counter() that lives in gpt_index/utils.py.

At the moment, this decorator can only be used on class instance methods with a _llm_predictor attribute.

e.g.

    class GPTTreeIndexBuilder:
        ...
        @llm_token_counter("build_from_text")
        def build_from_text(self, documents: Sequence[BaseDocument]) -> IndexGraph:
            ...

If you run build_from_text(), it will print the output in the form below:

    [build_from_text] Total token usage: <some-number> tokens

Why do we need this?

Calls to LLMs such as GPT3 cost money. For example, from OpenAI's pricing, the Davinci endpoint is $0.02 for every 1000 tokens.

Since gpt_index makes multiple LLM calls when building the index, it's handy to know how many tokens we're going through.

Remaining TODOs for this PR

add tests specific to token tracking. may have to patch the OpenAI object in chain_wrapper.py; we cannot mock the whole LLMPredictor object since we need the _total_tokens_used instance attribute to do its thing
add support for remaining index/query classes that call the LLM
test with openai billing and usage
consider adding this change on the abstract class level so that we're not repeating the same code everywhere, or turn it into a decorator (this may be helpful).

Other comments

Other implementations I considered

add a variable to the LLM prediction response: adding more seemed complicated. today you'll have 3 return values but what if you have 4 tomorrow
make a separate class to count this: seemed unnecessary

We might also miss token counts if you call the llm where we're not surrounding with token_start and end.

For the future

estimate cost before running indexing or querying
actual dollar cost

Known issues:

Sometimes the token count is off by a few. See this issue for an example: openai/openai-python#150

tests/indices/keyword_table/test_base.py

…cuments

jerryjliu

thanks for doing this! a few comments/questions

gpt_index/indices/base.py

gpt_index/utils.py

tests/indices/embedding/test_base.py

tests/indices/keyword_table/test_base.py

…pt-3.5-turbo` (run-llama#85) * update default recommended openai model from text-davinci-003 to gpt-3.5-turbo * fix unintended update in models list in README

Mathew Teoh and others added 9 commits December 6, 2022 23:43

Add first cut of main logic to count token usage

5ac077e

format

7a0cd24

Added token counter to BaseGPTKeywordTableIndex, GPTTreeIndexBuilder

929ff2b

format

9fb8c5b

Fix existing test by mocking the total token count

d779d9d

Merge branch 'main' into teoh/cost-estimator

fa36ff8

Merge branch 'main' into teoh/cost-estimator

90a8c88

add one test for token counting

dabc86e

comment

e4cdad2

teoh commented Dec 10, 2022

View reviewed changes

tests/indices/keyword_table/test_base.py Show resolved Hide resolved

teoh added 11 commits December 9, 2022 22:58

first cut at the decorator version

808e56d

add decorator to base gpt index class: query + insert

5b333d6

update decorator docstring

1cb3ba9

for debug

c73c6b4

(tree and keyword_table) moved decorator to under build_index_from_do…

169f0ff

…cuments

tests: replace mock llm call fn name

a7fc636

Merge branch 'main' into teoh/cost-estimator

24934b1

add decorator to the rest of the methods

5f35b88

remove debug print

c64ec11

fix existing tests so that they pass

4c1359a

add more tests to verify token count

0a73965

teoh changed the title ~~[Work in progress] Token Usage Tracking~~ Token Usage Tracking Dec 12, 2022

format

945cb72

jerryjliu reviewed Dec 12, 2022

View reviewed changes

incorporate reviewer suggestions

21d3a21

teoh mentioned this pull request Dec 13, 2022

Refactor mocks in unit testing #100

Closed

Jerry Liu added 4 commits December 17, 2022 10:09

swap in tiktoken, swap out transformers

3899d22

fix test

6db42ec

fix unit tests

d100725

cr

3d71609

Jerry Liu added 7 commits December 17, 2022 10:44

Merge remote-tracking branch 'upstream/main' into teoh/cost-estimator

df06949

bump readthedocs to 3.9

5150960

hmm

4a69cd9

cr

dc50755

cr

9967af5

cr

c1c9d14

cr

d5c5440

jerryjliu merged commit 9b3c262 into run-llama:main Dec 18, 2022

This was referenced Dec 19, 2022

Refactor token decorator to only be on base class #111

Merged

How many tokens per search ? #94

Closed

gdoctor mentioned this pull request Mar 28, 2024

QueryFusionRetriever sends incorrect QueryBundle type to it's retrievers #12387

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token Usage Tracking #85

Token Usage Tracking #85

teoh commented Dec 7, 2022 •

edited

Loading

jerryjliu left a comment

Token Usage Tracking #85

Token Usage Tracking #85

Conversation

teoh commented Dec 7, 2022 • edited Loading

What is this?

Why do we need this?

Remaining TODOs for this PR

Other comments

For the future

Known issues:

jerryjliu left a comment

Choose a reason for hiding this comment

teoh commented Dec 7, 2022 •

edited

Loading