Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(ray): check CDI availability for model container #538

Merged
merged 1 commit into from
Mar 19, 2024
Merged

Conversation

heiruwu
Copy link
Member

@heiruwu heiruwu commented Mar 19, 2024

Because

  • When deploying CPU model will result in Error: setting up CDI devices: unresolvable CDI devices nvidia.com/gpu=all

This commit

  • add gpu flag in model configuration for Nvidia CDI setup

Copy link

linear bot commented Mar 19, 2024

Copy link

codecov bot commented Mar 19, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 0.99%. Comparing base (388a69e) to head (f16dd90).

Additional details and impacted files
@@          Coverage Diff          @@
##            main    #538   +/-   ##
=====================================
  Coverage   0.99%   0.99%           
=====================================
  Files         15      15           
  Lines       5819    5819           
=====================================
  Hits          58      58           
  Misses      5753    5753           
  Partials       8       8           
Flag Coverage Δ
unittests 0.99% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@heiruwu heiruwu merged commit d7f78dd into main Mar 19, 2024
14 checks passed
@heiruwu heiruwu deleted the INS-3937 branch March 19, 2024 07:32
heiruwu added a commit that referenced this pull request Mar 19, 2024
Because

- When deploying CPU model will result in `Error: setting up CDI
devices: unresolvable CDI devices nvidia.com/gpu=all`

This commit

- add `gpu` flag in model configuration for Nvidia CDI setup
heiruwu pushed a commit that referenced this pull request Jun 6, 2024
🤖 I have created a release *beep* *boop*
---


##
[0.24.0-alpha](v0.23.0-alpha...v0.24.0-alpha)
(2024-06-06)


### ⚠ BREAKING CHANGES

* **model:** adopt containerized model serving
([#542](#542))

### Features

* **handler:** implement get latest operation
([#589](#589))
([33d2395](33d2395))
* **handler:** support listing available regions for model deployment
([#561](#561))
([52c2172](52c2172))
* **handler:** support model profile image
([#566](#566))
([0c8dbba](0c8dbba))
* **model:** add permission field in model object
([#576](#576))
([2d36a58](2d36a58))
* **model:** add task schema in model struct
([#578](#578))
([647069d](647069d))
* **model:** adopt containerized model serving
([#542](#542))
([3c80f39](3c80f39))
* **model:** embed sample input/output in model proto message
([#558](#558))
([5fba538](5fba538))
* **model:** support latest model version trigger
([#580](#580))
([47cb36c](47cb36c))
* **model:** support resource spec in model definition
([#557](#557))
([fee6e4b](fee6e4b))
* **model:** support search/filter with list endpoints
([#559](#559))
([7b17393](7b17393))
* **model:** support watch latest model and `order_by` for list
endpoints
([#586](#586))
([1a5e48c](1a5e48c))
* **prediction:** implement sync/async prediction records
([#555](#555))
([8d58eda](8d58eda))
* **ray:** support containerized model deployment
([#529](#529))
([4dcab05](4dcab05))
* **ray:** support custom accelerator type
([#547](#547))
([f0cc0d7](f0cc0d7))


### Bug Fixes

* **acl:** fix wrong type name
([#560](#560))
([89d09a5](89d09a5))
* **dockerfile:** update deploy config yaml path
([#590](#590))
([ee369e0](ee369e0))
* **model:** fix missing package in test models
([#552](#552))
([a28a21b](a28a21b))
* **ray:** check CDI availability for model container
([#538](#538))
([28bad42](28bad42))
* **server:** add missing message size option
([#597](#597))
([d0a0aac](d0a0aac))
* **service:** fix list model version pagination
([#569](#569))
([d8fb04a](d8fb04a))
* **service:** fix list model version return list size
([#556](#556))
([9b69f9c](9b69f9c))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
2 participants