Skip to content

Commit a1cee6c

Browse files
alekhyarao1218-wqv-alingampally
andauthored
Update DI update curl to latest deployment spec (#1170)
Co-authored-by: v-alingampally <v-alingampally@digitalocean.com>
1 parent 781eb48 commit a1cee6c

1 file changed

Lines changed: 9 additions & 2 deletions

File tree

specification/resources/dedicated_inferences/examples/curl/dedicated_inferences_update.yml

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,13 +5,20 @@ source: |-
55
-H "Authorization: Bearer $DO_TOKEN" \
66
-d '{
77
"spec": {
8+
"version": 1,
89
"name": "renamed-dedicated-inference",
910
"region": "atl1",
1011
"vpc": { "uuid": "997615ce-132d-4bae-9270-9ee21b395e5d" },
12+
"enable_public_endpoint": true,
1113
"model_deployments": [{
1214
"model_slug": "mistral/mistral-7b-instruct-v3",
13-
"accelerator_slug": "gpu-mi300x1-192gb",
14-
"node_count": 3
15+
"model_provider": "hugging_face",
16+
"workload_config": {},
17+
"accelerators": [{
18+
"scale": 2,
19+
"type": "prefill_decode",
20+
"accelerator_slug": "gpu-mi300x1-192gb"
21+
}]
1522
}]
1623
},
1724
"access_tokens": { "hugging_face_token": "$HF_TOKEN" }

0 commit comments

Comments
 (0)