build: Use official nvidia-ml-py package instead of fork #4208

ecederstrand · 2023-09-25T13:16:40Z

What does this PR address?

There are two packages in PyPI installing a pynvml module - pynvml and nvidia-ml-py. The former is a forked version of the latter, which is the official package published by NVIDIA.

I stumbled upon this because I built a virtual environment that also installs gpustat, and that failed because that package pulls in nvidia-ml-py, thus overwriting the BentoML dependency.

Before submitting:

Does the Pull Request follow Conventional Commits specification naming? Here are GitHub's
guide on how to create a pull request.
Does the code follow BentoML's code style, pre-commit run -a script has passed (instructions)?
Did you read through contribution guidelines and follow development guidelines?
Did your changes require updates to the documentation? Have you updated
those accordingly? Here are documentation guidelines and tips on writting docs.
Did you write tests to cover your changes?

sauyon · 2023-10-10T21:26:09Z

Hiya, sorry about the slow response! Mind if I push to fix the CI errors?

ecederstrand · 2023-10-10T22:03:38Z

Please do 😊

sauyon · 2023-10-10T22:19:27Z

Ah, seems like there are real test failures caused by the dependency change, we'll probably have to go through and fix those.

ecederstrand · 2023-10-12T07:11:42Z

Tests are failing because they can't find the NVidia driver (cannot find libnvidia-ml.so.1). I assume the CI pipeline is missing e.g. apt install libnvidia-compute-535-server. Any suggestions on where a good place to add that would be? Also, something similar will be needed for the OS X and Windows tests.

ecederstrand · 2023-10-13T13:04:51Z

It turns out the tests don't actually test the pynvml package. They just die because pynvml now throws a different exception when being imported. Should be fixed now.

ecederstrand · 2023-11-04T11:23:05Z

Test failures are dues to a bad pdm.lock file. I'm not familiar with pdm and just wanted to swap pynvml with nvidia-ml-py in pyproject.toml. What's the correct way to do that and re-generate the lock file?

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

ecederstrand · 2023-11-07T06:13:03Z

The 3 failing checks don't look related to the changes in this MR.

aarnphm · 2023-11-07T08:55:33Z

Yep. Our tests are just very much broken.

aarnphm · 2023-11-07T08:57:45Z

Thanks.

ecederstrand requested a review from a team as a code owner September 25, 2023 13:16

ecederstrand requested review from sauyon and removed request for a team September 25, 2023 13:16

EKC (Erik Cederstrand) added 2 commits September 26, 2023 12:49

build: Use the official package instead of a forked version

08ce644

chore: update hash after pyproject.toml update

2a2e61f

ecederstrand changed the title ~~build: Use the official package instead of a forked version~~ build: Use official nvidia-ml-py package instead of fork Sep 26, 2023

relock pdm

0c0cecc

ci: pynvml raises different Exception now

26ecbce

ecederstrand and others added 5 commits October 13, 2023 15:19

Merge branch 'main' into patch-1

18dd66c

Re-generate pdm.lock

82d8c5e

chore: fix sorting

7297da4

Merge branch 'main' into patch-1

50c2ca5

Merge branch 'main' into patch-1

2bce067

aarnphm added 2 commits November 6, 2023 21:56

chore: update pdm lock

d9437c9

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

merge: branch 'main'@github.com:bentoml/BentoML -> patch-1

458598b

aarnphm approved these changes Nov 7, 2023

View reviewed changes

aarnphm merged commit a59750c into bentoml:main Nov 7, 2023
35 of 38 checks passed

ecederstrand deleted the patch-1 branch November 7, 2023 12:09

aarnphm mentioned this pull request Nov 8, 2023

infra: update to use Ruff formatter #4269

Merged

This was referenced Nov 17, 2023

chore(version): checking using importlib.metadata #4285

Closed

fix(dependencies): lock cattrs<23.2 for now #4292

Merged

This was referenced Nov 20, 2023

docs: update quickstart with OpenLLM #4295

Merged

fix(docs): correct server implementation #4297

Merged

This was referenced Dec 22, 2023

feat: support .python-version symlink #4354

Merged

fix(with_config): annotate return type #4355

Merged

chore(generated): new stubs for proto 4 #4374

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build: Use official nvidia-ml-py package instead of fork #4208

build: Use official nvidia-ml-py package instead of fork #4208

ecederstrand commented Sep 25, 2023 •

edited

Loading

sauyon commented Oct 10, 2023

ecederstrand commented Oct 10, 2023

sauyon commented Oct 10, 2023

ecederstrand commented Oct 12, 2023 •

edited

Loading

ecederstrand commented Oct 13, 2023 •

edited

Loading

ecederstrand commented Nov 4, 2023

ecederstrand commented Nov 7, 2023

aarnphm commented Nov 7, 2023

aarnphm commented Nov 7, 2023

build: Use official nvidia-ml-py package instead of fork #4208

build: Use official nvidia-ml-py package instead of fork #4208

Conversation

ecederstrand commented Sep 25, 2023 • edited Loading

What does this PR address?

Before submitting:

sauyon commented Oct 10, 2023

ecederstrand commented Oct 10, 2023

sauyon commented Oct 10, 2023

ecederstrand commented Oct 12, 2023 • edited Loading

ecederstrand commented Oct 13, 2023 • edited Loading

ecederstrand commented Nov 4, 2023

ecederstrand commented Nov 7, 2023

aarnphm commented Nov 7, 2023

aarnphm commented Nov 7, 2023

ecederstrand commented Sep 25, 2023 •

edited

Loading

ecederstrand commented Oct 12, 2023 •

edited

Loading

ecederstrand commented Oct 13, 2023 •

edited

Loading