aibrix

History

Name		Name	Last commit message	Last commit date
parent directory ..
aibrix		aibrix
scripts		scripts
tests		tests
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

README.md

AI Runtime

A versatile sidecar enabling metric standardization, model downloading, and management.

Quick Start

Installation

AI Runtime can be installed by pip.

pip install aibrix

Model download

The AI Runtime supports model downloading from the following storage backends:

HuggingFace
S3
TOS

For more details on model downloading, please refer to our Runtime docs.

Integrate with inference engines

The AI Runtime hides various implementation details on the inference engine side, providing a universal method to guide model management, as well as expose inference monitoring metrics.

At present, vLLM engine is supported, and in the future, SGLang and other inference engines will be supported.

For more details on integrate with vLLM, please refer to our Runtime docs.

Contributing

We welcome contributions from the community! Check out our contributing guidelines to see how you can make a difference.

Build from source

# This may take several minutes
pip install -e .

Lint, Format and Type Check

Before contribute your code, please run the following commands to ensure that your code passes the tests and linting checks.

# install dependencies
poetry install --no-root --with dev

# linting, formatting and type checking
bash ./scripts/format.sh

License

AI Runtime is licensed under the APACHE License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

aibrix

aibrix

README.md

AI Runtime

Quick Start

Installation

Model download

Integrate with inference engines

Contributing

Build from source

Lint, Format and Type Check

License

Files

aibrix

Directory actions

More options

Directory actions

More options

Latest commit

History

aibrix

Folders and files

parent directory

README.md

AI Runtime

Quick Start

Installation

Model download

Integrate with inference engines

Contributing

Build from source

Lint, Format and Type Check

License