[Bug]: Ollama chat tool calls are not parsed correctly #3333

jackmpcollins · 2024-04-27T20:11:28Z

What happened?

ollama_chat/llama2 returns a tool call where the function name and arguments appear as a single object in the arguments field, and name is incorrectly set to an empty string. This object should be parsed into the separate arguments and name field.

ModelResponse(id='chatcmpl-ae59691c-0d45-4863-b014-d336e2bfbcfd', choices=[Choices(finish_reason='stop', index=0, message=Message(content=None, role='assistant', tool_calls=[ChatCompletionMessageToolCall(function=Function(arguments='{\n"name": "get_current_weather",\n"arguments": {\n"location": "San Francisco, CA",\n"unit": "celsius"\n}\n}', name=''), id='call_532322f5-f8b9-4243-8d9d-018ed629861a', type='function')]))], created=1714247542, model='ollama/llama2', object='chat.completion', system_fingerprint=None, usage=Usage(prompt_tokens=12, completion_tokens=42, total_tokens=54))

Separately, when using ollama_chat/llama2 with stream=True the tool call JSON appears in the content field but it should be parsed into the tool_calls field.

ModelResponse(id='chatcmpl-a60d7240-720c-408d-865e-9a9e3ae58d3e', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content='{', role='assistant', function_call=None, tool_calls=None), logprobs=None)], created=1714247544, model='llama2', object='chat.completion.chunk', system_fingerprint=None, usage=Usage())
ModelResponse(id='chatcmpl-a60d7240-720c-408d-865e-9a9e3ae58d3e', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content='\n', role=None, function_call=None, tool_calls=None), logprobs=None)], created=1714247544, model='llama2', object='chat.completion.chunk', system_fingerprint=None, usage=Usage())
ModelResponse(id='chatcmpl-a60d7240-720c-408d-865e-9a9e3ae58d3e', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content='"', role=None, function_call=None, tool_calls=None), logprobs=None)], created=1714247544, model='llama2', object='chat.completion.chunk', system_fingerprint=None, usage=Usage())
ModelResponse(id='chatcmpl-a60d7240-720c-408d-865e-9a9e3ae58d3e', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content='name', role=None, function_call=None, tool_calls=None), logprobs=None)], created=1714247544, model='llama2', object='chat.completion.chunk', system_fingerprint=None, usage=Usage())
ModelResponse(id='chatcmpl-a60d7240-720c-408d-865e-9a9e3ae58d3e', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content='":', role=None, function_call=None, tool_calls=None), logprobs=None)], created=1714247544, model='llama2', object='chat.completion.chunk', system_fingerprint=None, usage=Usage())

Here's the streamed tool call response using gpt-3.5-turbo-1106 which is correct

ModelResponse(id='chatcmpl-9IiNxc82AjQTSu3yBABzB4vsFZa8Q', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content=None, role='assistant', function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id='call_5lHC4zXMKuvxDMOlp5MBASCP', function=Function(arguments='', name='get_current_weather'), type='function', index=0)]), logprobs=None)], created=1714247549, model='gpt-3.5-turbo-1106', object='chat.completion.chunk', system_fingerprint='fp_b953e4de39', usage=Usage())
ModelResponse(id='chatcmpl-9IiNxc82AjQTSu3yBABzB4vsFZa8Q', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='{"lo', name=None), type=None, index=0)]), logprobs=None)], created=1714247549, model='gpt-3.5-turbo-1106', object='chat.completion.chunk', system_fingerprint='fp_b953e4de39', usage=Usage())
ModelResponse(id='chatcmpl-9IiNxc82AjQTSu3yBABzB4vsFZa8Q', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='catio', name=None), type=None, index=0)]), logprobs=None)], created=1714247549, model='gpt-3.5-turbo-1106', object='chat.completion.chunk', system_fingerprint='fp_b953e4de39', usage=Usage())
ModelResponse(id='chatcmpl-9IiNxc82AjQTSu3yBABzB4vsFZa8Q', choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='n": "S', name=None), type=None, index=0)]), logprobs=None)], created=1714247549, model='gpt-3.5-turbo-1106', object='chat.completion.chunk', system_fingerprint='fp_b953e4de39', usage=Usage())

Fixing this would allow more features of https://github.com/jackmpcollins/magentic to be used with ollama.
Related issue: jackmpcollins/magentic#194

Code to reproduces this

import litellm

messages = [{"role": "user", "content": "What's the weather like in San Francisco, Tokyo, and Paris?"}]
tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_weather",
            "description": "Get the current weather in a given location",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city and state, e.g. San Francisco, CA",
                    },
                    "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]},
                },
                "required": ["location"],
            },
        },
    }
]

response = litellm.completion(
    model="ollama_chat/llama2",
    messages=messages,
    tools=tools,
    stream=False,
)
print(response)


print("\n---\n")
response = litellm.completion(
    model="ollama_chat/llama2",
    messages=messages,
    tools=tools,
    stream=True,
)
for chunk in response:
    print(chunk)


print("\n---\n")
response = litellm.completion(
    model="gpt-3.5-turbo-1106",
    messages=messages,
    tools=tools,
    stream=True,
)
for chunk in response:
    print(chunk)

Relevant log output

No response

Twitter / LinkedIn details

@jackmpcollins / https://www.linkedin.com/in/jackmpcollins/

The text was updated successfully, but these errors were encountered:

rick-github · 2024-04-30T23:38:15Z

I had the same issue but haven't got around to looking at it yet, but it's possible that #1526 addresses this.

krrishdholakia · 2024-05-01T17:24:32Z

Merged the relevant PR in - should be live in the next litellm release - v1.35.34+

cc: @jackmpcollins @rick-github

jackmpcollins · 2024-05-02T17:47:49Z

@krrishdholakia The stream=True response is still parsed incorrectly. The code in the description reproduces this.

Regular responses are now parsing correctly, thanks.

ChristianWeyer · 2024-05-05T07:22:03Z

Merged the relevant PR in - should be live in the next litellm release - v1.35.34+

cc: @jackmpcollins @rick-github

#2209 (comment)

jackmpcollins added the bug Something isn't working label Apr 27, 2024

jackmpcollins mentioned this issue Apr 27, 2024

Are there some models in ollama can support function calling or object return? jackmpcollins/magentic#194

Closed

jackmpcollins mentioned this issue May 4, 2024

[Bug]: function calling not working for ollama/gemma:7b #2209

Closed

jackmpcollins mentioned this issue May 6, 2024

Fix Ollama streamed tool calls. Set finish_reason to tool_calls for all tool_calls responses #3469

Merged

4 tasks

krrishdholakia closed this as completed in #3469 May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Ollama chat tool calls are not parsed correctly #3333

[Bug]: Ollama chat tool calls are not parsed correctly #3333

jackmpcollins commented Apr 27, 2024 •

edited

Loading

rick-github commented Apr 30, 2024

krrishdholakia commented May 1, 2024

jackmpcollins commented May 2, 2024

ChristianWeyer commented May 5, 2024

[Bug]: Ollama chat tool calls are not parsed correctly #3333

[Bug]: Ollama chat tool calls are not parsed correctly #3333

Comments

jackmpcollins commented Apr 27, 2024 • edited Loading

What happened?

Relevant log output

Twitter / LinkedIn details

rick-github commented Apr 30, 2024

krrishdholakia commented May 1, 2024

jackmpcollins commented May 2, 2024

ChristianWeyer commented May 5, 2024

jackmpcollins commented Apr 27, 2024 •

edited

Loading