feat: Add Inferless service definition to orchestration (#14)

* Add Inferless service definition to orchestration * Refactored inferless.yaml for improved model deployment and cost optimization
unifyai · Nov 16, 2023 · 9544573 · 9544573
1 parent 0603b2c
commit 9544573
Showing 1 changed file with 18 additions and 0 deletions.
diff --git a/orchestration/inferless.yaml b/orchestration/inferless.yaml
@@ -0,0 +1,18 @@
+inferless:
+  name: "Inferless"
+
+  image_url: https://assets-global.website-files.com/640eae2485ab876a9c047cb2/640eb15f24ccc7295ffa2f76_Component%20logo.svg
+
+  tags: 
+    - model-endpoint
+    - orchestration
+
+  url: https://www.inferless.com/
+
+  description: "Inferless offers a serverless GPU solution for scaling machine learning inference without the complexities of server management. It enables rapid model deployment and customization, focusing on reducing infrastructure costs and enhancing user flexibility. Inferless streamlines the deployment process, handling scalability and operational challenges."
+
+  features:
+    - "Serverless GPU Inference: Deploy ML models seamlessly with serverless GPUs, offering ease of scaling and deployment."
+    - "Infrastructure Cost Optimization: Save up to 80% on infrastructure costs with transparent billing and efficient resource utilization."
+    - "Seamless Autoscaling: In-house built load balancer for efficient service scaling with minimal overhead."
+    - "GPU Virtualization: Facilitates the deployment of multiple models on a single GPU instance, catering to customized requirements."