GitHub - stellasphere/inference: An easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

👋 hello

Roboflow Inference is an opinionated tool for running inference on state-of-the-art computer vision models. With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments. Inference supports object detection, classification, and instance segmentation models, and running foundation models (CLIP and SAM).

🎥 Inference in action

Check out Inference running on a video of a football game:

inference.mp4

👩‍🏫 Examples

The /examples directory contains example code for working with and extending inference, including HTTP and UDP client code and an insights dashboard, along with community examples (PRs welcome)!

💻 Why Inference?

Inference provides a scalable method through which you can manage inferences for your vision projects.

Inference is backed by:

A server, so you don’t have to reimplement things like image processing and prediction visualization on every project.
Standardized APIs for computer vision tasks, so switching out the model weights and architecture can be done independently of your application code.
Model architecture implementations, which implement the tensor parsing glue between images and predictions for supervised models that you've fine-tuned to perform custom tasks.
A model registry, so your code can be independent from your model weights & you don't have to re-build and re-deploy every time you want to iterate on your model weights.
Data management integrations, so you can collect more images of edge cases to improve your dataset & model the more it sees in the wild.

And more!

📌 Install pip vs Docker:

pip: Installs inference into your Python environment. Lightweight, good for Python-centric projects.
Docker: Packages inference with its environment. Ensures consistency across setups; ideal for scalable deployments.

💻 install

With ONNX CPU Runtime:

For CPU powered inference:

pip install inference

or

pip install inference-cpu

With ONNX GPU Runtime:

If you have an NVIDIA GPU, you can accelerate your inference with:

pip install inference-gpu

Without ONNX Runtime:

Roboflow Inference uses Onnxruntime as its core inference engine. Onnxruntime provides an array of different execution providers that can optimize inference on differnt target devices. If you decide to install onnxruntime on your own, install inference with:

pip install inference-core

Alternatively, you can take advantage of some advanced execution providers using one of our published docker images.

Extras:

Some functionality requires extra dependancies. These can be installed by specifying the desired extras during installation of Roboflow Inference.

extra	description
`http`	Ability to run the http interface
`gaze`	Ability to use the core `Gaze` model

Example install with http dependancies:

pip install inference[http]

🐋 docker

You can learn more about Roboflow Inference Docker Image build, pull and run in our documentation.

Run on x86 CPU:

docker run --net=host roboflow/roboflow-inference-server-cpu:latest

Run on NVIDIA GPU:

docker run --network=host --gpus=all roboflow/roboflow-inference-server-gpu:latest

👉 more docker run options

Run on arm64 CPU:

docker run -p 9001:9001 roboflow/roboflow-inference-server-arm-cpu:latest

Run on NVIDIA GPU with TensorRT Runtime:

docker run --network=host --gpus=all roboflow/roboflow-inference-server-trt:latest

Run on NVIDIA Jetson with JetPack 4.x:

docker run --privileged --net=host --runtime=nvidia roboflow/roboflow-inference-server-trt-jetson:latest

Run on NVIDIA Jetson with JetPack 5.x:

docker run --privileged --net=host --runtime=nvidia roboflow/roboflow-inference-server-trt-jetson-5.1.1:latest

🔥 quickstart

Docker Quickstart:

import requests

dataset_id = "soccer-players-5fuqs"
version_id = "1"
image_url = "https://source.roboflow.com/pwYAXv9BTpqLyFfgQoPZ/u48G0UpWfk8giSw7wrU8/original.jpg"
#Replace ROBOFLOW_API_KEY with your Roboflow API Key
api_key = "ROBOFLOW_API_KEY"
confidence = 0.5

url = f"http://localhost:9001/{dataset_id}/{version_id}"

params = {
    "api_key": api_key,
    "confidence": confidence,
    "image": image_url,
}

res = requests.post(url, params=params)
print(res.json())

pip Quickstart:

After installing via pip, you can run a simple inference using:

from inference.models.utils import get_roboflow_model

model = get_roboflow_model(
    model_id="soccer-players-5fuqs/1", 
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key="ROBOFLOW_API_KEY"
)

results = model.infer(image="https://source.roboflow.com/pwYAXv9BTpqLyFfgQoPZ/u48G0UpWfk8giSw7wrU8/original.jpg", confidence=0.5, iou_threshold=0.5)

print(results)

Pip quickstart using CLIP:

You can also use Roboflow Inference to infer on foundation models like CLIP:

from inference.core.data_models import (
    ClipCompareRequest,
    InferenceRequestImage
)
from inference.models.clip.clip import ClipOnnxRoboflowCoreModel

model = ClipOnnxRoboflowCoreModel(
    model_id="demo-kwgoi/1", device_id="my-pc", 
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key="ROBOFLOW_API_KEY"
)

request = ClipCompareRequest(
    subject = "https://source.roboflow.com/7fLqS2r1SV8mm0YzyI0c/yy6hjtPUFFkq4yAvhkvs/original.jpg",
    subject_type = "image",
    prompt = "a horse",
    prompt_type = "text",
)

results = model.infer(request)

**SAM Quickstart**:

You can also run inference on Meta's Segment Anything model using:

```python
from inference.core.data_models import (
    SamSegmentationRequest,
    InferenceRequestImage
)

from inference.models.sam import SegmentAnything

model = SegmentAnything(
    model_id = "sam/vit_h",
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key = "ROBOFLOW_API_KEY"
)

image = InferenceRequestImage(
    type = "url",
    value = "https://source.roboflow.com/7fLqS2r1SV8mm0YzyI0c/yy6hjtPUFFkq4yAvhkvs/original.jpg"
)

request = SamSegmentationRequest(
    image = image,
)

results = model.infer_from_request(request)

print(results)

🏗️ inference process

To standardize the inference process throughout all our models, Roboflow Inference has a structure for processing inference requests. The specifics can be found on each model's respective page, but overall it works like this for most models:

📝 license

The Roboflow Inference code is distributed under an Apache 2.0 license. The models supported by Roboflow Inference have their own licenses. View the licenses for supported models below.

model	license
`inference/models/clip`	MIT
`inference/models/gaze`	MIT, Apache 2.0
`inference/models/sam`	Apache 2.0
`inference/models/vit`	Apache 2.0
`inference/models/yolact`	MIT
`inference/models/yolov5`	AGPL-3.0
`inference/models/yolov7`	GPL-3.0
`inference/models/yolov8`	AGPL-3.0

🚀 enterprise

With a Roboflow Inference Enterprise License, you can access additional Inference features, including:

Server cluster deployment
Device management
Active learning
YOLOv5 and YOLOv8 model sub-license

To learn more, contact the Roboflow team.

📚 documentation

Visit our documentation for usage examples and reference for Roboflow Inference.

🏆 contribution

We would love your input to improve Roboflow Inference! Please see our contributing guide to get started. Thank you to all of our contributors! 🙏

💻 explore more Roboflow open source projects

Project	Description
supervision	General-purpose utilities for use in computer vision projects, from predictions filtering and display to object tracking to model evaluation.
Autodistill	Automatically label images for use in training computer vision models.
Inference (this project)	An easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Notebooks	Tutorials for computer vision tasks, from training state-of-the-art models to tracking objects to counting objects in a zone.
Collect	Automated, intelligent data collection powered by CLIP.

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.github		.github
.release/pypi		.release/pypi
docker		docker
docs		docs
examples		examples
inference		inference
requirements		requirements
signatures/version1		signatures/version1
test		test
.actrc		.actrc
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE.core		LICENSE.core
Makefile		Makefile
README.md		README.md
banner.png		banner.png
mkdocs.yml		mkdocs.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

👋 hello

🎥 Inference in action

👩‍🏫 Examples

💻 Why Inference?

📌 Install pip vs Docker:

💻 install

With ONNX CPU Runtime:

With ONNX GPU Runtime:

Without ONNX Runtime:

Extras:

🐋 docker

🔥 quickstart

🏗️ inference process

📝 license

🚀 enterprise

📚 documentation

🏆 contribution

💻 explore more Roboflow open source projects

About

Uh oh!

Releases

Packages

Languages

License

stellasphere/inference

Folders and files

Latest commit

History

Repository files navigation

👋 hello

🎥 Inference in action

👩‍🏫 Examples

💻 Why Inference?

📌 Install pip vs Docker:

💻 install

With ONNX CPU Runtime:

With ONNX GPU Runtime:

Without ONNX Runtime:

Extras:

🐋 docker

🔥 quickstart

🏗️ inference process

📝 license

🚀 enterprise

📚 documentation

🏆 contribution

💻 explore more Roboflow open source projects

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages