New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add amdserver #179
Add amdserver #179
Conversation
Signed-off-by: Varun Sharma <varun.sharma@amd.com>
Signed-off-by: Varun Sharma <varun.sharma@amd.com>
Signed-off-by: Varun Sharma <varun.sharma@amd.com>
✅ Deploy Preview for elastic-nobel-0aef7a ready!
To edit notification comments on pull requests, go to your Netlify site settings. |
@yuzisun what do you think? |
Signed-off-by: Varun Sharma <varun.sharma@amd.com>
@varunsh-xilinx Is this good to review again now? |
@yuzisun I'm currently working on documentation for adding a new serving runtime as we discussed earlier. Testing with ModelMesh will take some time so I'm thinking I'll remove the multi-model example for now and focus on the single model case. I'm hoping it'll be ready to review this week |
Signed-off-by: Varun Sharma <varun.sharma@amd.com>
@yuzisun this is ready to review again, thanks |
Looks great! Thanks @varunsh-xilinx ! /lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: varunsh-xilinx, yuzisun The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Thank you for your help @yuzisun! |
* Add amdserver documentation Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update amdserver example Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Remove multi-model example Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README.md Signed-off-by: Varun Sharma <varun.sharma@amd.com> Co-authored-by: Dan Sun <dsun20@bloomberg.net> Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
* Add amdserver documentation Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update amdserver example Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Remove multi-model example Signed-off-by: Varun Sharma <varun.sharma@amd.com> * Update README.md Signed-off-by: Varun Sharma <varun.sharma@amd.com> Co-authored-by: Dan Sun <dsun20@bloomberg.net> Signed-off-by: agriffith50 <agriffith50@bloomberg.net>
This adds documentation for the AMD Inference Server: #2149
Proposed Changes
Since we don't have a distributable image right now, I haven't modified servingruntimes.md or serving_runtime.md because using one of the KServe runtimes requires a predefined image in the
clusterservingruntimes
definition.