Skip to content

Write a troubleshooting guide #689

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
liu-cong opened this issue Apr 14, 2025 · 1 comment
Open

Write a troubleshooting guide #689

liu-cong opened this issue Apr 14, 2025 · 1 comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@liu-cong
Copy link
Contributor

What would you like to be added:

A troubleshooting guide with instructions and common errors to help troubleshoot. A few things on top of my mind:

  1. Start with describe the InferenceModel and InferencePool resources and how to interpret the status.
  2. How to read EPP logs, e.g., a very useful log to read is the pods and metrics that EPP sees.
  3. Permission issues.
  4. Common errors, e.g., Model-Pool mismatch, EPP-Pool mismatch.

Related issues: #476

Why is this needed:

@kfswain
Copy link
Collaborator

kfswain commented Apr 24, 2025

This issue is not quite the same, but related: #735

@kfswain kfswain added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Apr 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

2 participants