model-inference-service

Here are 2 public repositories matching this topic...

bentoml / BentoML

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering ai-inference llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated Aug 6, 2024
Python

bentoml / transformers-nlp-service

Star

Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more

nlp transformer nlp-machine-learning model-deployment model-serving mlops online-inference llm llmops model-inference-service

Updated Mar 16, 2024
Python

Improve this page

Add a description, image, and links to the model-inference-service topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-inference-service topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly