Python client: Extra slash in base_uri leads to failures in chat endpoint #1823

kcarnold · 2024-04-27T13:45:57Z

System Info

If you create a Client with a server URI ending in a slash, the generation endpoint works fine but the chat endpoint fails silently (for the Python client, it's a JSONDecodeError because the server returns a 404 with an empty body, and empty-string isn't valid JSON; that's a separate bug though).

(I'm also confused about the intended relationship between the Python client library in this repo and the InferenceClient one in huggingface_hub. The docs refer to both, in different places.)

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

from text_generation import Client
client = Client('http://localhost:3000/')
client.chat(messages=[
    {
        "role": "system",
        "content": "You answer in bulleted lists."
    },
    {
        "role": "user",
        "content": "Why is the sky blue?"
    }
], max_tokens=100)

Expected behavior

Same as if the trailing / in the Client URI is missing.

This could be as simple as self.base_url = base_url.rstrip('/') on:

text-generation-inference/clients/python/text_generation/client.py

Line 62 in e9f03f8

self.base_url = base_url

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python client: Extra slash in base_uri leads to failures in chat endpoint #1823

Python client: Extra slash in base_uri leads to failures in chat endpoint #1823

kcarnold commented Apr 27, 2024 •

edited

Python client: Extra slash in base_uri leads to failures in chat endpoint #1823

Python client: Extra slash in base_uri leads to failures in chat endpoint #1823

Comments

kcarnold commented Apr 27, 2024 • edited

System Info

Information

Tasks

Reproduction

Expected behavior

kcarnold commented Apr 27, 2024 •

edited