Skip to content

Commit

Permalink
feat: Add Inferless service definition to orchestration (#14)
Browse files Browse the repository at this point in the history
* Add Inferless service definition to orchestration

* Refactored inferless.yaml for improved model
deployment and cost optimization
  • Loading branch information
NripeshN committed Nov 16, 2023
1 parent 0603b2c commit 9544573
Showing 1 changed file with 18 additions and 0 deletions.
18 changes: 18 additions & 0 deletions orchestration/inferless.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
inferless:
name: "Inferless"

image_url: https://assets-global.website-files.com/640eae2485ab876a9c047cb2/640eb15f24ccc7295ffa2f76_Component%20logo.svg

tags:
- model-endpoint
- orchestration

url: https://www.inferless.com/

description: "Inferless offers a serverless GPU solution for scaling machine learning inference without the complexities of server management. It enables rapid model deployment and customization, focusing on reducing infrastructure costs and enhancing user flexibility. Inferless streamlines the deployment process, handling scalability and operational challenges."

features:
- "Serverless GPU Inference: Deploy ML models seamlessly with serverless GPUs, offering ease of scaling and deployment."
- "Infrastructure Cost Optimization: Save up to 80% on infrastructure costs with transparent billing and efficient resource utilization."
- "Seamless Autoscaling: In-house built load balancer for efficient service scaling with minimal overhead."
- "GPU Virtualization: Facilitates the deployment of multiple models on a single GPU instance, catering to customized requirements."

0 comments on commit 9544573

Please sign in to comment.