FastAPI LoRA Adapter Proxy

Overview

This project provides a FastAPI-based server that acts as a proxy to dynamically download, load, and unload LoRA (Low-Rank Adaptation) adapters based on user requests.

Features

Dynamic LoRA Management: Load and unload LoRA adapters on demand.
Proxy Server: Acts as a middleware to facilitate requests for different LoRA adapters.

Installation

Clone the repository:

git clone https://github.com/VijayRavichander/nano-LoRAX

Create a virtual environment and activate it:

 wget -qO- https://astral.sh/uv/install.sh | sh
 source $HOME/.local/bin/env
 uv venv
 source .venv/bin/activate

Install dependencies:
```
 uv pip install -r requirements.txt
```

Usage

Start the FastAPI Server

Export Variables:

export VLLM_ALLOW_RUNTIME_LORA_UPDATING=1
export HF_HUB_ENABLE_HF_TRANSFER=1

Add your HF_TOKEN in .env.example and rename it to .env

Run the vllm server with:

nohup uv run vllm serve neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8 \
    --max-model-len 8192 \
    --enable-lora \
    --max-lora-rank 128 \
    --port 8000 > logs/vllm.log 2>&1 &

After Starting Your vLLM server, Run the proxy server with:

nohup uv run python -m server > proxy.log 2>&1 &

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
config.py		config.py
lora_manager.py		lora_manager.py
redis_manager.py		redis_manager.py
requirements.txt		requirements.txt
script.sh		script.sh
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FastAPI LoRA Adapter Proxy

Overview

Features

Installation

Usage

Start the FastAPI Server

About

Uh oh!

Releases

Packages

Languages

VijayRavichander/nano-LoRAX

Folders and files

Latest commit

History

Repository files navigation

FastAPI LoRA Adapter Proxy

Overview

Features

Installation

Usage

Start the FastAPI Server

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages