Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

404 for Multi-modal docs #1853

Open
1 of 4 tasks
RonanKMcGovern opened this issue May 3, 2024 · 0 comments
Open
1 of 4 tasks

404 for Multi-modal docs #1853

RonanKMcGovern opened this issue May 3, 2024 · 0 comments

Comments

@RonanKMcGovern
Copy link

System Info

NA

Information

  • Docker
  • The CLI directly

Tasks

  • An officially supported command
  • My own modifications

Reproduction

It is unclear how to query TGI for multi-modal models.

The links to LLaVA Next and IDEFICS2 give 404:

https://huggingface.co/docs/text-generation-inference/HuggingFaceM4/idefics-9b-instruct

https://huggingface.co/docs/text-generation-inference/llava-hf/llava-v1.6-mistral-7b-hf

@Narsil @VictorSanh

Expected behavior

When querying transformers, an <image> placeholder is used and the images are passed as a separate input argument to the prompt. This doesn't appear to be the case with TGI, which just expects a prompt input.

Something like this:

curl https://yd64jhjr8ylu54-8080.proxy.runpod.net/generate \
    -X POST \
    -d '{"inputs": "User: ![](http://images.cocodataset.org/val2017/000000219578.jpg)Tell me about this image<end_of_utterance>\\nAssistant:","parameters":{"max_new_tokens":20}}' \
    -H 'Content-Type: application/json'

works, although it fails when trying to do two images (the model ignores the second image):

curl https://yd64jhjr8ylu54-8080.proxy.runpod.net/generate \
    -X POST \
    -d '{"inputs": "User: ![](http://images.cocodataset.org/val2017/000000219578.jpg)Tell me about this image, and also about this second image: ![](http://images.cocodataset.org/val2017/000000039769.jpg)<end_of_utterance>\\nAssistant:","parameters":{"max_new_tokens":50}}' \
    -H 'Content-Type: application/json'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant