Polaris Workloads

Collection of containerized AI/ML workloads optimized for deployment in Polaris Containers. Includes CPU and GPU variants for different use cases.

Workloads

CPU Version
- Location: /polaris-ai/cpu/torchserve
- Overview: Sets up TorchServe for CPU workloads. The start script downloads the model and starts the service.
GPU Version
- Location: /polaris-ai/gpu/torchserve
- Overview: Sets up TorchServe for GPU workloads using a GPU-enabled TorchServe image.

Ollama CPU
- Location: /polaris-llm/cpu/ollama
- Overview: Packages the Ollama service in a Docker container for CPU usage. Specify the model via the POLARIS_LLM_OLLAMA_MODEL environment variable.
Ollama GPU
- Location: /polaris-llm/gpu/ollama
- Overview: GPU-accelerated version of Ollama service. Uses NVIDIA CUDA for model inference.
- Prerequisites:
  - POLARIS_LLM_OLLAMA_MODEL: The model identifier.
vLLM GPU
- Location: /polaris-llm/gpu/vllm
- Overview: Sets up vllm in a Docker container for GPU usage. The start script validates required environment variables, downloads the specified model (if needed), and launches vllm.
- Prerequisites:
  - HF_TOKEN: Your Hugging Face token.
  - POLARIS_VLLM_MODEL: The model identifier.
  - POLARIS_VLLM_DIR: (Optional) Directory for model storage (default: /models).

On first startup, if the specified model is not already present, the related workload will automatically download it (this may take some time depending on the model size and connection speed).
Refer to each workload's README for more detailed instructions and configuration options.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
polaris-ai		polaris-ai
polaris-llm		polaris-llm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md