return usage for all providers - As OpenAI does #11

ishaan-jaff · 2023-07-28T21:30:45Z

OpenAI returns usage for requests, but other providers - eg. Cohere does not.

{
  "id": "chatcmpl-7hPIZUYBst5jQA4odnYgxgZ9iPZ51",
  "object": "chat.completion",
  "created": 1690579707,
  "model": "gpt-3.5-turbo-0613",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! I'm an AI language model, so I don't have feelings, but I'm here to help you with any questions or conversations you have. How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 13,
    "completion_tokens": 38,
    "total_tokens": 51
  }

Current Cohere return val from litellm - it's missing usage - which I need to calculate costs $

{'choices': [{'finish_reason': 'stop', 'index': 0, 'message': {'content': cohere.Generation {
	id: 3369b9a7-5755-4a4b-9da6-1ebab7c7924a
	prompt: Hello, how are you?
	text:  I am doing well, thank you. How can I help you today?
	likelihood: None
	finish_reason: None
	token_likelihoods: None
}, 'role': 'assistant'}}]}

The text was updated successfully, but these errors were encountered:

krrishdholakia · 2023-07-28T21:32:48Z

lol exact conclusion i landed on in #10

krrishdholakia · 2023-07-28T21:33:54Z

@ishaan-jaff doesn't the cohere response look broken -> it should have just returned the text. seems a bit weird to just nest the cohere object in there.

krrishdholakia · 2023-07-28T21:34:53Z

For anthropic / replicate i just mapped the text output:

elif model in anthropic_models:
    #anthropic defaults to os.environ.get("ANTHROPIC_API_KEY")
    prompt = f"{HUMAN_PROMPT}" 
    for message in messages:
      if "role" in message:
        if message["role"] == "user":
          prompt += f"{HUMAN_PROMPT}{message['content']}"
        else:
          prompt += f"{AI_PROMPT}{message['content']}"
      else:
        prompt += f"{HUMAN_PROMPT}{message['content']}"
    prompt += f"{AI_PROMPT}"
    anthropic = Anthropic()
    completion = anthropic.completions.create(
        model=model,
        prompt=prompt,
        max_tokens_to_sample=max_tokens
    )
    new_response = {
      "choices": [
        {
          "finish_reason": "stop",
          "index": 0,
          "message": {
              "content": completion.completion,
              "role": "assistant"
          }
        }
      ]
    }
    print(f"new response: {new_response}")
    response = new_response

krrishdholakia · 2023-08-05T19:29:59Z

version 0.1.341 now returns token usage across all providers. Where possible it uses the providers own tokenizer (E.g. anthropic) else it defaults to tiktoken.

krrishdholakia · 2023-08-05T19:30:14Z

commit: 7575d7e

ishaan-jaff mentioned this issue Jul 28, 2023

Track pricing per request #10

Closed

krrishdholakia closed this as completed Aug 5, 2023

krrishdholakia self-assigned this Aug 5, 2023

krrishdholakia added the enhancement New feature or request label Aug 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

return usage for all providers - As OpenAI does #11

return usage for all providers - As OpenAI does #11

ishaan-jaff commented Jul 28, 2023 •

edited

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Aug 5, 2023

krrishdholakia commented Aug 5, 2023

return usage for all providers - As OpenAI does #11

return usage for all providers - As OpenAI does #11

Comments

ishaan-jaff commented Jul 28, 2023 • edited

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Jul 28, 2023

krrishdholakia commented Aug 5, 2023

krrishdholakia commented Aug 5, 2023

ishaan-jaff commented Jul 28, 2023 •

edited