Update response type for `/v1/chat/completions` and `/v1/completions` #1747

Wauplin · 2024-04-15T14:59:08Z

/v1/chat/completions and /v1/completions have different output types depending on the stream parameter. This PR aims at fixing the inconsistency in the auto-generated openapi.json specs.

cc @OlivierDehaene @drbh I reused what had been done for the / endpoint but haven't tested anything myself. Could you confirm this is the correct way of handling things?

Also, should I update the openapi.json file manually? If yes, how can I do it?

@OlivierDehaene

`/v1/chat/completions` and `/v1/completions` have different output types depending on the `stream` parameter. This PR aims at fixing the inconsistency in the auto-generated [openapi.json](https://huggingface.github.io/text-generation-inference/openapi.json) specs. cc @OlivierDehaene @drbh I reused what had been done for the `/` endpoint but haven't tested anything myself. Could you confirm this is the correct way of handling things? Also, should I update the openapi.json file manually? If yes, how can I do it?

OlivierDehaene

Thanks
I will update the json myself afterwards.

drbh

thank you both!

Wauplin · 2024-04-16T07:00:29Z

Thanks for the reviews! Is it safe to merge? CI is failing with

ERROR integration-tests/models/test_flash_awq.py::test_flash_llama_awq - RuntimeError: Health check failed
ERROR integration-tests/models/test_flash_awq.py::test_flash_llama_awq_all_params - RuntimeError: Health check failed
ERROR integration-tests/models/test_flash_awq.py::test_flash_llama_awq_load - RuntimeError: Health check failed

@OlivierDehaene

…huggingface#1747) `/v1/chat/completions` and `/v1/completions` have different output types depending on the `stream` parameter. This PR aims at fixing the inconsistency in the auto-generated [openapi.json](https://huggingface.github.io/text-generation-inference/openapi.json) specs. cc @OlivierDehaene @drbh I reused what had been done for the `/` endpoint but haven't tested anything myself. Could you confirm this is the correct way of handling things? Also, should I update the openapi.json file manually? If yes, how can I do it?

@OlivierDehaene

…huggingface#1747) `/v1/chat/completions` and `/v1/completions` have different output types depending on the `stream` parameter. This PR aims at fixing the inconsistency in the auto-generated [openapi.json](https://huggingface.github.io/text-generation-inference/openapi.json) specs. cc @OlivierDehaene @drbh I reused what had been done for the `/` endpoint but haven't tested anything myself. Could you confirm this is the correct way of handling things? Also, should I update the openapi.json file manually? If yes, how can I do it?

@OlivierDehaene

…huggingface#1747) `/v1/chat/completions` and `/v1/completions` have different output types depending on the `stream` parameter. This PR aims at fixing the inconsistency in the auto-generated [openapi.json](https://huggingface.github.io/text-generation-inference/openapi.json) specs. cc @OlivierDehaene @drbh I reused what had been done for the `/` endpoint but haven't tested anything myself. Could you confirm this is the correct way of handling things? Also, should I update the openapi.json file manually? If yes, how can I do it?

Wauplin requested review from Narsil, drbh and OlivierDehaene April 15, 2024 14:59

OlivierDehaene approved these changes Apr 15, 2024

View reviewed changes

Wauplin mentioned this pull request Apr 15, 2024

Generate specs from TGI openapi.json huggingface/huggingface.js#629

Merged

2 tasks

drbh approved these changes Apr 15, 2024

View reviewed changes

Wauplin merged commit 00f3653 into main Apr 16, 2024
6 of 7 checks passed

Wauplin deleted the Wauplin-patch-1 branch April 16, 2024 17:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update response type for `/v1/chat/completions` and `/v1/completions` #1747

Update response type for `/v1/chat/completions` and `/v1/completions` #1747

Wauplin commented Apr 15, 2024

OlivierDehaene left a comment

drbh left a comment •

edited

Wauplin commented Apr 16, 2024

Update response type for /v1/chat/completions and /v1/completions #1747

Update response type for /v1/chat/completions and /v1/completions #1747

Conversation

Wauplin commented Apr 15, 2024

OlivierDehaene left a comment

Choose a reason for hiding this comment

drbh left a comment • edited

Choose a reason for hiding this comment

Wauplin commented Apr 16, 2024

Update response type for `/v1/chat/completions` and `/v1/completions` #1747

Update response type for `/v1/chat/completions` and `/v1/completions` #1747

drbh left a comment •

edited