[Startup Plan]: Failed to launch GPU inference #34

Matthieu-Tinycoaching · 2021-06-09T11:42:21Z

Hi community,

I have subscribed a 7-day free trial of the Startup Plan and I wish to test GPU inference API on this model: https://huggingface.co/Matthieu/stsb-xlm-r-multilingual-custom

However, when using the below code:

import json
import requests

API_URL = "https://api-inference.huggingface.co/models/Matthieu/stsb-xlm-r-multilingual-custom"
headers = {"Authorization": "Bearer API_ORG_TOKEN"}

def query(payload):
    data = json.dumps(payload)
    response = requests.request("POST", API_URL, headers=headers, data=data)
    return json.loads(response.content.decode("utf-8"))

payload1 = {"inputs": "Navigateur Web : Ce logiciel permet d'accéder à des pages web depuis votre ordinateur. Il en existe plusieurs téléchargeables gratuitement comme Google Chrome ou Mozilla. Certains sont même déjà installés comme Safari sur Mac OS et Edge sur Microsoft.", "options": {"use_cache": False, "use_gpu": True}}

sentence_embeddings1 = query(payload1)
print(sentence_embeddings1)

I got the following error: {'error': 'Model Matthieu/stsb-xlm-r-multilingual-custom is currently loading', 'estimated_time': 44.490336920000004}

Do I have to wait some time until the model is loaded for GPU inference?

Thanks!

The text was updated successfully, but these errors were encountered:

LysandreJik · 2021-06-09T15:49:37Z

Maybe of interest to @Narsil

Narsil · 2021-06-09T15:55:12Z

Hi @Matthieu-Tinycoaching This is linked to:
#26

Community images do not implement:

private models
GPU inference
Acceleration

So what you are seeing is quite normal and is expected.
If you don't mind we should keep the discussion over there as all 3 are correlated.

Matthieu-Tinycoaching · 2021-06-09T16:23:31Z

Hi @Narsil thanks for the feedback.

However I don't understand so how I can test accelerated inference (CPU+GPU) API on my custom public model?

What is testable so on accelerated inference API and what should I benefit from the free trial startup plan from?

Narsil mentioned this issue Jun 9, 2021

Run production grade private models #26

Closed

LysandreJik transferred this issue from huggingface/huggingface_hub Mar 16, 2022

osanseviero transferred this issue from huggingface/hub-docs Mar 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Startup Plan]: Failed to launch GPU inference #34

[Startup Plan]: Failed to launch GPU inference #34

Matthieu-Tinycoaching commented Jun 9, 2021

LysandreJik commented Jun 9, 2021

Narsil commented Jun 9, 2021

Matthieu-Tinycoaching commented Jun 9, 2021 •

edited

[Startup Plan]: Failed to launch GPU inference #34

[Startup Plan]: Failed to launch GPU inference #34

Comments

Matthieu-Tinycoaching commented Jun 9, 2021

LysandreJik commented Jun 9, 2021

Narsil commented Jun 9, 2021

Matthieu-Tinycoaching commented Jun 9, 2021 • edited

Matthieu-Tinycoaching commented Jun 9, 2021 •

edited