Skip to content
@bentoml

BentoML

The easiest way to run AI Inference in the cloud

Welcome to BentoML 👋 Twitter Follow Slack

BentoML

What is BentoML? 👩‍🍳

BentoML is an open-source model serving library for building model inference APIs and multi-model serving systems with any open-source or custom AI models. It comes with everything you need for serving optimization, model packaging, and simplifies production deployment via ☁️ BentoCloud.

Get in touch 💬

👉 Join our Slack community!

👀 Follow us on X @bentomlai and LinkedIn

📖 Read our blog

Pinned

  1. BentoML BentoML Public

    The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

    Python 6.7k 761

  2. OpenLLM OpenLLM Public

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

    Python 9.1k 582

Repositories

Showing 10 of 79 repositories
  • BentoML Public

    The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

    bentoml/BentoML’s past year of commit activity
    Python 6,734 Apache-2.0 761 215 17 Updated Jun 19, 2024
  • chatgpt-lite Public Forked from blrchen/chatgpt-lite

    Fast ChatGPT UI with support for both OpenAI and Azure OpenAI. 快速的ChatGPT UI,支持OpenAI和Azure OpenAI。

    bentoml/chatgpt-lite’s past year of commit activity
    TypeScript 0 MIT 78 0 1 Updated Jun 19, 2024
  • asynq Public Forked from hibiken/asynq

    Simple, reliable, and efficient distributed task queue in Go

    bentoml/asynq’s past year of commit activity
    Go 0 MIT 695 0 2 Updated Jun 19, 2024
  • bentoml/BentoWhisperX’s past year of commit activity
    Python 6 1 0 9 Updated Jun 18, 2024
  • BentoBLIP Public

    how to build an image captioning application on top of a BLIP model with BentoML

    Python 1 0 0 3 Updated Jun 18, 2024
  • BentoCLIP Public

    building a CLIP application using BentoML

    bentoml/BentoCLIP’s past year of commit activity
    Python 4 1 0 3 Updated Jun 18, 2024
  • BentoDiffusion Public

    BentoDiffusion: A collection of diffusion models served with BentoML

    Python 325 Apache-2.0 24 9 1 Updated Jun 18, 2024
  • OpenLLM Public

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

    bentoml/OpenLLM’s past year of commit activity
    Python 9,146 Apache-2.0 582 58 0 Updated Jun 17, 2024
  • terraform-azure-modules Public Forked from Azure/terraform-azure-modules

    Azure verified modules for Terraform

    bentoml/terraform-azure-modules’s past year of commit activity
    HCL 0 MIT 27 0 0 Updated Jun 17, 2024
  • bentoml/BentoTRTLLM’s past year of commit activity
    Python 2 1 0 0 Updated Jun 17, 2024