Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"llama-3-70b-instruct", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"llama-3-8b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"llama-3.1-70b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"llama-3.1-8b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"mistral-7b-instruct-v0.3", "messages":[{"role": "user","content": "Explain Public Cloud in a nutshell."}], "top_p": 1, "temperature": 0.7, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"mistral-nemo-instruct-2407", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "top_p": 1, "temperature": 0.35, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"mixtral-8x7b-instruct-v0.1", "messages":[{"role": "user","content": "Sing me a song about Scaleway"}], "max_tokens": 200, "top_p": 1, "temperature": 1, "stream": false}'
```

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ The Chat Completions API is designed for models fine-tuned for conversational ta

To invoke Scaleway Managed Inference's OpenAI-compatible Chat API, simply edit your dedicated endpoints with a suffix `/v1/chat/completions`:
```
https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions
https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions
```

### OpenAI Python client library
Expand All @@ -34,7 +34,7 @@ Use OpenAI's SDK how you normally would.
from openai import OpenAI

client = OpenAI(
base_url='https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/',
base_url='https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/',
api_key='<IAM API key>'
)

Expand Down Expand Up @@ -94,11 +94,11 @@ The Embeddings API is designed to get a vector representation of an input that c

Use your dedicated endpoints as follows:
```
https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/embeddings
https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/embeddings
```

```
curl https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/embeddings \
curl https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/embeddings \
-H "Authorization: Bearer $SCW_API_KEY" \
-H "Content-Type: application/json" \
-d '{
Expand All @@ -114,7 +114,7 @@ curl https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/embeddings \
from openai import OpenAI

client = OpenAI(
base_url='https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/',
base_url='https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/',
api_key='<IAM API key>'
)

Expand Down Expand Up @@ -143,11 +143,11 @@ The Models API returns the model(s) available for inferencing.
In the context of a Scaleway Managed Inference deployment, it returns the name of the current model being served.

```
https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/models
https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/models
```

```
curl https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/models \
curl https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/models \
-H "Authorization: Bearer $SCW_API_KEY" \
-H "Content-Type: application/json"
```
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ The Sentence-T5-XXL model is highly ranked on the [MTEB leaderboard](https://hug
To perform inference tasks with your Embedding model deployed at Scaleway, use the following command:

```bash
curl https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/embeddings \
curl https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/embeddings \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
-d '{
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,7 @@ curl -s \
-H "Authorization: Bearer <IAM API key>" \
-H "Content-Type: application/json" \
--request POST \
--url "https://<Deployment UUID>.ifr.fr-par.scw.cloud/v1/chat/completions" \
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
--data '{"model":"wizardlm-70B-V1.0", "messages":[{"role": "user","content": "Say hello to Scaleway's Inference"}], "max_tokens": 200, "top_p": 1, "temperature": 1, "stream": false}'
```

Expand Down