#

serving

Here are 17 public repositories matching this topic...

tensorflow / serving

A flexible, high-performance serving system for machine learning models

python machine-learning deep-neural-networks deep-learning neural-network cpp tensorflow ml serving

Updated Jul 19, 2024
C++

PaddlePaddle / FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

android intel rockchip object-detection jetson tensorrt serving onnx openvino onnxruntime graphcore yolov5 kunlun uie picodet stable-diffusion yolov8

Updated Jul 20, 2024
C++

PaddlePaddle / Serving

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

python docker deep-learning pipeline gpu prediction micro-service rpc-service dag paddle microservice-toolkit predictor serving online-service paddle-serving

Updated May 6, 2024
C++

openvinotoolkit / model_server

A scalable inference server for models optimized with OpenVINO™

kubernetes machine-learning cloud ai deep-learning inference edge dag model-serving serving openvino

Updated Jul 22, 2024
C++

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

performance gpu model production cuda efficiency inference transformer llama speculative serving llm llm-inference llama3

Updated Jul 12, 2024
C++

torchpipe / torchpipe

Serving Inside Pytorch With Multi-threads

deployment inference pytorch ray serve tensorrt serving pipeline-parallelism torch2trt triton-inference-server llm-serving

Updated Jul 19, 2024
C++

emacski / tensorflow-serving-arm

TensorFlow Serving ARM - A project for cross-compiling TensorFlow Serving targeting popular ARM cores

docker arm tensorflow armv7 arm64 armhf aarch64 cross-compile armv8 serving

Updated Nov 7, 2021
C++

NetEase-Media / grps

【深度学习模型部署框架】支持tensorflow/torch/tensorrt/vllm以及更多nn框架，支持dynamic batching、streaming模式，可限制、可拓展、高性能。帮助用户快速地将模型部署到线上，并通过HTTP/RPC接口方式提供服务。

tensorflow torch tensorrt serving triton-inference-server dynamic-batching vllm

Updated Jul 19, 2024
C++

Laiye-Tech / serving

TensorFlow Serving based on encrypted model, protect model files from being stolen | 基于加密模型的 TensorFlow Serving ，保护模型文件免于被盗取

python machine-learning crypto deep-learning neural-network cpp tensorflow ml serving protobuffer deep-neural-network

Updated Aug 11, 2022
C++

Peter-Chou / libtorch_grpc_serving

pytorch during training, libtorch during serving via gRPC

deep-learning microservice deploy grpc pytorch serve serving libtorch

Updated Sep 9, 2019
C++

jeongukjae / lightgbm-serving

A lightweight server for LightGBM

lightgbm serving ml-serving

Updated Oct 16, 2020
C++

Chris10M / tensorflow-c-plus-plus-rest-api-serving

A simple tensorflow C++ REST API server

api fast opencv performance server cpp tensorflow rest-api inference cpp11 serving inference-performance

Updated May 11, 2019
C++

sbcd90 / machine-learning-rest-server

This project implements a common rest server which can serve tensorflow-serving & xgboost models.

machine-learning rest tensorflow xgboost proxygen serving

Updated Jul 15, 2018
C++

zhangjun / tf_serving_client_brpc

tensorflow serving client using brpc

client deep-learning tensorflow tensorflow-serving serving brpc

Updated Sep 19, 2019
C++

XUJiahua / serving

A flexible, high-performance serving system for machine learning models

python golang tensorflow grpc serving

Updated Mar 31, 2018
C++

xiedongmingming / PaddleServing

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

Updated Jul 10, 2024
C++

ieee820 / serving

A flexible, high-performance serving system for machine learning models

tensorflow serving

Updated May 30, 2019
C++

Improve this page

Add a description, image, and links to the serving topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the serving topic, visit your repo's landing page and select "manage topics."