-
Notifications
You must be signed in to change notification settings - Fork 115
feat: make model metrics endpoints configurable #1000
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for gateway-api-inference-extension ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
I have some doubts about adding additional fields to InferencePool. cc @kfswain @ahg-g @robscott @danehans @elevran /hold for others to comment. |
ffad486
to
f57478d
Compare
@nirrozenbaum , thanks for you advise. I think you are right. |
@nayihz I would start with command-line args with default values (the existing ones). |
f57478d
to
9fa17f7
Compare
@nayihz @nirrozenbaum this introduces a fixed endpoint for all model servers in the pool. apiVersion: v1
kind: Pod
metadata:
name: my-app
annotations:
prometheus.io/scrape: "true"
prometheus.io/path: "/metrics"
prometheus.io/port: "8080" |
@elevran we already have a fixed endpoint, so this PR is not introducing it :). the intention was to make that endpoint configurable. |
/unhold |
9fa17f7
to
7466a28
Compare
937f686
to
d86effa
Compare
looks good for the most part, added a comment about switching to env vars to have better interfacing with our helm chart |
d86effa
to
9a9affa
Compare
9a9affa
to
96bee04
Compare
/lgtm leaving final stamp to the other reviewers. |
looks great. Thanks! |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: kfswain, nayihz The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
fix: #16