Open
Description
Load Thresholds
- KVCacheThreshold
- QueueLengthThreshold
metricsEndpoint
Currently ext proc is hard-coded to scrape from [POD_IP]:[PORT]/metrics
for prometheus metrics.
Different model server may expose different endpoints for scraping metrics. We'd like to introduce config fields to capture this information.
Metadata
Metadata
Assignees
Labels
No labels