GitHub - mharrvic/serverless-llm-playground: Serverless LLM Playground with Modal

Serverless LLM Playground with Modal (Heavily Inspired by Vercel AI Playground and Nat.dev) [WIP]

Kapture.2023-12-30.at.12.43.15.mp4

1. Getting Started

Requirements:

Install packages via pnpm https://pnpm.io/installation and then pnpm install
Setup your environment variables from .env.example and rename it to .env then provide the values

Modal setup

Save the Modal LLM models to llm/modal directory and deploy. Here are some instructions to follow:
- https://modal.com/docs/guide/ex/openllama
- https://modal.com/docs/guide/ex/falcon_bitsandbytes#serve-the-model
- https://modal.com/docs/guide/ex/falcon_gptq Make sure to add secrets on llm-playground-secrets collection with AUTH_TOKEN value for web_endpoint authentication layer.
Copy the Modal URL and add it to the model config under src/model-config.ts

2. Vercel Postgres with Drizzle ORM

Setup your pg database https://vercel.com/storage/postgres

Update your schema under src/lib/db/schema.ts

To generate migrations

npm run migrations:generate

To push the migrations to the database

npm run migrations:push

To seed

npm run seed

To delete the migrations from the database (if needed, use with caution, don't use in initial setup)

npm run migrations:drop

3. Setup your Authentication Provider with Clerk

Follow instructions here https://clerk.com/docs/nextjs/get-started-with-nextjs
Then setup the webhook from the api/auth-webhook for Syncing Clerk data to the database. Read more here https://clerk.com/docs/users/sync-data-to-your-backend

4. Run the app

npm run dev

TODO:

Proxy Modal endpoint via Next API (for additional protection with Modal auth endpoint)
Dynamic model selection
Update LLM settings via UI
Add settings page
Add token count usage
Add support for other models from HuggingFace, OpenAI, Anthropic, Cohere, and Replicate

FAQ:
q: Why not use Next.js API routes instead for additional protection with Clerk and Modal endpoint? a: Because llm generation with modal might take a while like 20-30 seconds and it will timeout the request (also this https://vercel.com/docs/concepts/limits/overview#general-limits)

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
llm		llm
public		public
src		src
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
drizzle.config.ts		drizzle.config.ts
next.config.js		next.config.js
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. Getting Started

2. Vercel Postgres with Drizzle ORM

3. Setup your Authentication Provider with Clerk

4. Run the app

About

Releases

Packages

Languages

License

mharrvic/serverless-llm-playground

Folders and files

Latest commit

History

Repository files navigation

1. Getting Started

2. Vercel Postgres with Drizzle ORM

3. Setup your Authentication Provider with Clerk

4. Run the app

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages