api: Create generative AI APIs using AI subnet #2246

victorges · 2024-07-08T22:03:21Z

What does this pull request do? Explain your changes. (required)

This creates the initial versions of the AI Generate APIs, based on the design doc.

This initial version implements basically the proxy to an internal AI Gateway service, thus
also implementing the exact same interface as the gateway.

The main complexity of this was adding support to the multipart requests, currently used by
the AI Gateway interface for these APIs. This required upgrading the major version of the
ajv library and some of its related packages, as well as adding a bit of code to handle
multipart form validation and error handling.

Note: haven't added the actual API reference on purpose (paths: section on API schema), given we're
not yet sure how this new API is going to be advertised. Right now it is under an experiment so no end-users
will be able to use by themselves.

Specific updates (required)

Add schemas for the new APIs inputs
Implement the JSON handler for /generate/text-to-image API
Implement the multipart handler for other /generate APIs
Create a couple tests for the API proxy behavior

How did you test each of these updates (required)

yarn test
Put on staging and called the APIs

Does this pull request close any open issues?
Implements ENG-2181

Checklist

I have read the CONTRIBUTING document.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have added tests to cover my changes.

vercel · 2024-07-08T22:03:24Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
livepeer-studio	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jul 16, 2024 10:53pm

packages/api/src/controllers/generate.ts

+  const path = `/${name}`;
+  return app.post(
+    path,
+    authorizer({}),


packages/api/src/controllers/generate.ts

Pretend they are JSON for now, adding support for multipart forms next.

That required upgrading ajv itself, which was a bit of a pain but worked and also found a few issues in our schema.

leszko

Looks good. Added some minor comments. I have some other questions comments (about billing and monitoring) but I'll open them in Discord.

packages/api/src/parse-cli.ts

leszko · 2024-07-16T08:19:44Z

packages/api/src/schema/api-schema.yaml

+          type: string
+          default: SG161222/RealVisXL_V4.0_Lightning
+          enum:
+            - SG161222/RealVisXL_V4.0_Lightning


Do I understand correctly that currently, we accept only 2 models and that these 2 specific models are always available in Os?

Btw. what happens if the model is not available in the specific O?

@leszko Forgot to reply this. Yeah right now we accept only these 2 models (or others in other APIs) which are the ones recommended by the AI Network team on the docs. e.g. https://docs.livepeer.org/ai/pipelines/text-to-image#models

Other models (called "on-demand") are still supported, but they might have less support from Orchestrators so I opted to keep them out on this first version.

Currently, an O can keep only 1 model (from any pipeline) warmed up, as it has to keep an ai-worker container running with that model loaded in the GPU. When the model is not available (warm) in the O, it will just kill that running container and start another ai-worker configured with the requested model. The model is usually available on the disk if the operator previously configured it (see this getting started), otherwise the ai-worker will download the model dynamically from hugging face.

This means that asking for an uncommon model can make the request really slow. Just loading the model on the GPU already takes a couple dozen seconds, and if the O has to download it as well it could take even minutes. So that's why I opted to limit the models that can be asked here (but there's still a risk no O has it warm and it takes longer than usual).

I believe this whole flow is being improved on by the AI network team, like with the "remote worker" architecture (similar to splitting O+T nodes). Maybe @rickstaa could correct if I said anything wrong or provide more info here 😄

packages/api/src/middleware/validators.ts

packages/api/src/controllers/generate.test.ts

packages/api/src/schema/api-schema.yaml

gioelecerati

LGTM, a couple tests are failing

packages/api/src/middleware/validators.ts

This reverts commit 802d297.

github-advanced-security bot found potential problems Jul 8, 2024

View reviewed changes

packages/api/src/controllers/generate.ts Fixed Show fixed Hide fixed

vercel bot deployed to Preview July 8, 2024 22:06 View deployment

victorges force-pushed the vg/feat/ai-generate branch from ff1c8d3 to 11f49a3 Compare July 9, 2024 21:33

vercel bot deployed to Preview July 9, 2024 21:38 View deployment

victorges force-pushed the vg/feat/ai-generate branch from 6cb0eef to 010cd01 Compare July 9, 2024 21:42

vercel bot deployed to Preview July 9, 2024 21:43 View deployment

vercel bot had a problem deploying to Preview July 9, 2024 22:35 Failure

github-advanced-security bot found potential problems Jul 9, 2024

View reviewed changes

packages/api/src/controllers/generate.ts

const path = `/${name}`;

return app.post(

path,

authorizer({}),

Check failure

Code scanning / CodeQL

Missing rate limiting High

This route handler performs

authorization

Loading
, but is not rate-limited.

vercel bot deployed to Preview July 10, 2024 20:27 View deployment

github-advanced-security bot found potential problems Jul 12, 2024

View reviewed changes

packages/api/src/controllers/generate.ts Dismissed Show dismissed Hide dismissed

vercel bot deployed to Preview July 12, 2024 02:57 View deployment

victorges force-pushed the vg/feat/ai-generate branch from eb782a5 to caf4590 Compare July 12, 2024 18:40

vercel bot deployed to Preview July 12, 2024 18:44 View deployment

victorges force-pushed the vg/feat/ai-generate branch 2 times, most recently from f999499 to db20f20 Compare July 12, 2024 18:52

vercel bot deployed to Preview July 12, 2024 18:56 View deployment

victorges force-pushed the vg/feat/ai-generate branch from db20f20 to 427a89f Compare July 12, 2024 19:09

vercel bot deployed to Preview July 12, 2024 19:13 View deployment

victorges force-pushed the vg/feat/ai-generate branch from 427a89f to 4baac9d Compare July 15, 2024 12:46

vercel bot deployed to Preview July 15, 2024 12:48 View deployment

victorges force-pushed the vg/feat/ai-generate branch from 4baac9d to deee56d Compare July 15, 2024 12:52

vercel bot deployed to Preview July 15, 2024 12:54 View deployment

victorges added 4 commits July 15, 2024 10:15

api/generate: Create the simplest form of a /text-to-image

8c887fe

(noop) api/controllers: Sort API router alphabetically

b75a378

api/generate: Create the other generate API proxies

b3fda8f

Pretend they are JSON for now, adding support for multipart forms next.

api: Add ajv-formats to support binary format

8f0253f

That required upgrading ajv itself, which was a bit of a pain but worked and also found a few issues in our schema.

victorges force-pushed the vg/feat/ai-generate branch 2 times, most recently from f7f5590 to deb0f50 Compare July 15, 2024 13:20

vercel bot deployed to Preview July 15, 2024 13:23 View deployment

victorges force-pushed the vg/feat/ai-generate branch from deb0f50 to d89416f Compare July 15, 2024 14:55

vercel bot deployed to Preview July 15, 2024 14:57 View deployment

victorges added 2 commits July 15, 2024 12:51

api/generate: Add multipart request support

d72f5ad

[DEV-only] Default to the staging gateway

802d297

victorges force-pushed the vg/feat/ai-generate branch from d89416f to 802d297 Compare July 15, 2024 15:51

vercel bot deployed to Preview July 15, 2024 15:53 View deployment

vercel bot deployed to Preview July 15, 2024 19:55 View deployment

api/test: Create tests for new /generate APIs

b0994b8

victorges force-pushed the vg/feat/ai-generate branch from 0e4de82 to b0994b8 Compare July 15, 2024 20:02

victorges marked this pull request as ready for review July 15, 2024 20:03

victorges requested a review from a team as a code owner July 15, 2024 20:03

vercel bot deployed to Preview July 15, 2024 20:04 View deployment

leszko approved these changes Jul 16, 2024

View reviewed changes

gioelecerati reviewed Jul 16, 2024

View reviewed changes

packages/api/src/controllers/generate.test.ts Show resolved Hide resolved

gioelecerati approved these changes Jul 16, 2024

View reviewed changes

packages/api/src/schema/api-schema.yaml Show resolved Hide resolved

packages/api/src/schema/api-schema.yaml Show resolved Hide resolved

gioelecerati reviewed Jul 16, 2024

View reviewed changes

packages/api/src/middleware/validators.ts Show resolved Hide resolved

api/test: Fix multistream tests

ee89a07

vercel bot deployed to Preview July 16, 2024 15:34 View deployment

prettier 😡

cbcf415

vercel bot deployed to Preview July 16, 2024 16:58 View deployment

victorges added 2 commits July 16, 2024 19:49

api/test: Add more cases for multipart validation

02c07d6

Revert "[DEV-only] Default to the staging gateway"

685ed22

This reverts commit 802d297.

vercel bot deployed to Preview July 16, 2024 22:53 View deployment

victorges merged commit a9979b7 into master Jul 17, 2024
8 checks passed

victorges deleted the vg/feat/ai-generate branch July 17, 2024 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api: Create generative AI APIs using AI subnet #2246

api: Create generative AI APIs using AI subnet #2246

victorges commented Jul 8, 2024 •

edited

Loading

vercel bot commented Jul 8, 2024 •

edited

Loading

leszko left a comment

leszko Jul 16, 2024

victorges Jul 18, 2024

gioelecerati left a comment

api: Create generative AI APIs using AI subnet #2246

api: Create generative AI APIs using AI subnet #2246

Conversation

victorges commented Jul 8, 2024 • edited Loading

vercel bot commented Jul 8, 2024 • edited Loading

leszko left a comment

Choose a reason for hiding this comment

leszko Jul 16, 2024

Choose a reason for hiding this comment

victorges Jul 18, 2024

Choose a reason for hiding this comment

gioelecerati left a comment

Choose a reason for hiding this comment

victorges commented Jul 8, 2024 •

edited

Loading

vercel bot commented Jul 8, 2024 •

edited

Loading