inference-server

An AI-powered mobile crop advisory app for farmers, gardeners that can provide information about crops using an image taken by the user. This supports 10 crops and 37 kinds of crop diseases. The AI model is a ResNet network that has been fine-tuned using crop images that were collected by web-scraping from Google Images and Plant-Village Dataset.

artificial-intelligence inference-server

Updated Aug 16, 2020
Python

xdevfaheem / TGS

Star

Effortlessly Deploy and Serve Large Language Models in the Cloud as an API Endpoint for Inference

inference-server llmops llm-inference

Updated Jun 30, 2024
Python

ajinkyapuar / qis

Star

rabbitmq celery job-queue inference-server

Updated Mar 20, 2021
Python

leimao / Simple-Inference-Server

Sponsor

Star

Inference Server Implementation from Scratch for Machine Learning Models

inference-server

Updated Dec 31, 2020
Python

geniusrise / audio

Star

Audio components for geniusrise framework

audio ai inference speech-recognition speech-to-text inference-server huggingface

Updated May 15, 2024
Python

geniusrise / vision

Star

Vision and vision-multi-modal components for geniusrise framework

inference vision inference-server multimodal mlops huggingface

Updated Mar 25, 2024
Python

roboflow / inference-dashboard-example

Star

Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates informative visualizations and CSV outputs.

inference object-detection predictions inference-server

Updated May 21, 2024
Python

geniusrise / text

Star

Text components powering LLMs & SLMs for geniusrise framework

ai inference inference-server huggingface inference-api llm

Updated Mar 25, 2024
Python

tensorchord / inference-benchmark

Star

Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)

benchmark whisper inference-server llm stable-diffusion

Updated Jun 28, 2023
Python

csy1204 / TripBigs_Web

Star

Session Based Real-time Hotel Recommendation Web Application

react redux inference-server

Updated Jan 6, 2021
Python

friendliai / friendli-client

Star

Friendli: the fastest serving engine for generative AI

ai ml inference gpt inference-server mistral inference-engine serving mlops gpt3 llm stable-diffusion llms generative-ai llmops llm-serving llm-inference llama2 llm-ops

Updated Jun 19, 2024
Python

k9ele7en / Triton-TensorRT-Inference-CRAFT-pytorch

Star

Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server - multi-format). Supported model format for Triton inference: TensorRT engine, Torchscript, ONNX