Include two different stages for building TGI image: #34

mfuntowicz · 2024-05-02T09:00:39Z

Default standalone image
Inference Endpoint specific image

- Default standalone image - Inference Endpoint specific image

tengomucho · 2024-05-02T09:12:48Z

text-generation-inference/docker/Dockerfile

+FROM tpu_base as final_image
+ENTRYPOINT text-generation-launcher
+CMD ["--json-output"]


this seems to revert 8756c8a, are you sure about it?

For the case of non inference-endpoints (IE) Docker image we want the image to provide their own argument, so this was correct and the changes we did were too focus on IE loosing flexibility in other deployment scenarios

Ok, in that case we might want to keep 2 docker images, one for TGI non inference-endpoints (IE), the other for IE. Or, if we are not interested in maintaining one of those, we should probably drop the image and the associated targets.

Yes but we are definitely interested in keeping both and potentially adding a third one for vertex if it makes sense later 👍🏻

tengomucho · 2024-05-02T09:19:00Z

Makefile

 	             --build-arg VERSION=$(VERSION) \
 	             --build-arg TGI_VERSION=$(TGI_VERSION) \
 				 -t huggingface/optimum-tpu:$(VERSION)-tgi .
 	docker tag huggingface/optimum-tpu:$(VERSION)-tgi huggingface/optimum-tpu:latest

+tpu-tgi-ie:


target tgi_docker_test should depend on this (and we should probably fix it at some point)

Does the tgi_docker_test needs to be run on an inference-endpoints specific docker image?

HuggingFaceDocBuilderDev · 2024-05-03T07:46:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

… latest

Include two different stages for building TGI image:

6e8431c

- Default standalone image - Inference Endpoint specific image

mfuntowicz requested a review from tengomucho May 2, 2024 09:00

mfuntowicz added 2 commits May 2, 2024 11:02

Include a make rule for tpu-tgi-ie

458dc37

Style

75fe850

tengomucho approved these changes May 2, 2024

View reviewed changes

tengomucho reviewed May 2, 2024

View reviewed changes

mfuntowicz added 3 commits May 2, 2024 15:33

Fix not using the latest layer correctly

7176ed6

Using the exact same as raw TGI

13e5960

Update documentation

ba19177

tengomucho approved these changes May 3, 2024

View reviewed changes

mfuntowicz added 2 commits May 3, 2024 09:56

(docs) Remove specific optimum-tpu version mention in the doc and use…

2e54ad2

… latest

(docs) Use $MODEL_ID to specify the model when starting TGI container

c699493

tengomucho approved these changes May 3, 2024

View reviewed changes

mfuntowicz merged commit c9937a9 into main May 3, 2024
2 checks passed

mfuntowicz deleted the make-tgi-image-more-generic branch May 3, 2024 08:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include two different stages for building TGI image: #34

Include two different stages for building TGI image: #34

mfuntowicz commented May 2, 2024

tengomucho May 2, 2024

mfuntowicz May 2, 2024

tengomucho May 3, 2024

mfuntowicz May 3, 2024

tengomucho May 2, 2024

mfuntowicz May 2, 2024

HuggingFaceDocBuilderDev commented May 3, 2024

Include two different stages for building TGI image: #34

Include two different stages for building TGI image: #34

Conversation

mfuntowicz commented May 2, 2024

tengomucho May 2, 2024

Choose a reason for hiding this comment

mfuntowicz May 2, 2024

Choose a reason for hiding this comment

tengomucho May 3, 2024

Choose a reason for hiding this comment

mfuntowicz May 3, 2024

Choose a reason for hiding this comment

tengomucho May 2, 2024

Choose a reason for hiding this comment

mfuntowicz May 2, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 3, 2024