Skip to content

Serverless and Agent Inference endpoints#1146

Merged
harshmaru7 merged 10 commits intomainfrom
poc/inference
Mar 25, 2026
Merged

Serverless and Agent Inference endpoints#1146
harshmaru7 merged 10 commits intomainfrom
poc/inference

Conversation

@SSharma-10
Copy link
Contributor

Added chat completion endpoint for serverless and agent inference

$ref: 'resources/gen-ai/genai_list_evaluation_test_cases_by_workspace.yml'


/v1/inference/chat/completions:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shouldn't it be /v1/chat/completions

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated. Thanks!

- key: "type"
value: "test"
responses:
"200":
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this should be 202 since its async

type: string
description: >
The desired dimensions of the generated image. Supported values:
256x256, 512x512, 1024x1024.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1536x1024 (landscape), 1024x1536 (portrait) - reference https://gradient-sdk.digitalocean.com/api/resources/images/methods/generate

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

- prompt
- model
- "n"
- size
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it should be optional

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

list, and delete custom VPCs as well as retrieve information about the
resources assigned to them.

- name: Serverless Inference
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will we need to be Alphabetical here ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, added in alphabetical order. TY!

@@ -0,0 +1,6 @@
lang: cURL
source: |-
curl -X GET \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@@ -0,0 +1,62 @@
type: object
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this file required ??

@@ -0,0 +1,16 @@
lang: cURL
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this file required ?

@@ -0,0 +1,17 @@
lang: cURL
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • do we need this file too ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants