improv(package): use python slim base image and let pytorch install cuda #807

larme · 2023-12-22T03:02:00Z

We can let pytorch install it's own cuda runtime and save some duplicate spaces

aarnphm · 2023-12-22T06:51:07Z

I think the reason why we set the base image to use cuda is that all bento will be idempotent.

It won't depend on machine host cuda version. Therefore, a drawback is that we will have to download the binary twice.

larme · 2023-12-22T08:07:21Z

I think the reason why we set the base image to use cuda is that all bento will be idempotent.

It won't depend on machine host cuda version. Therefore, a drawback is that we will have to download the binary twice.

It won't depend on machine host cuda version because pytorch will download cuda runtime version that suitable to this pytorch version.

Host's cuda version is just cuda driver version, which is compatible with higher cuda runtime version, which is installed inside the docker container by pytorch. So we are not depending on host's cuda version in most case

aarnphm · 2023-12-22T09:21:17Z

I think the reason why we set the base image to use cuda is that all bento will be idempotent.
It won't depend on machine host cuda version. Therefore, a drawback is that we will have to download the binary twice.

It won't depend on machine host cuda version because pytorch will download cuda runtime version that suitable to this pytorch version.

Host's cuda version is just cuda driver version, which is compatible with higher cuda runtime version, which is installed inside the docker container by pytorch. So we are not depending on host's cuda version in most case

This is not true, because pytorch will still pick up host machine cuda nonetheless. So if the container doesn't have a cuda, and host machine has older cuda, then it won't work.

We have seen this on bentocloud b4. Hence, this is the current fix.

larme · 2023-12-22T20:20:15Z

We have seen this on bentocloud b4. Hence, this is the current fix.

Was this the test done with xipeng? I think we (xipeng, jiangbo and me) have verified that pytorch will install cuda regardless if host cuda is available. @yetone could you confirm?

aarnphm · 2023-12-22T23:01:23Z

We have seen this on bentocloud b4. Hence, this is the current fix.

Was this the test done with xipeng? I think we (xipeng, jiangbo and me) have verified that pytorch will install cuda regardless if host cuda is available. @yetone could you confirm?

Yes we decided that we still set cuda as base image so that the future update won't break old bento.

improv(package): use python slim base image and let pytorch install cuda

ddfe0f5

larme requested a review from aarnphm as a code owner December 22, 2023 03:02

aarnphm approved these changes Jan 12, 2024

View reviewed changes

aarnphm merged commit 8baaf12 into bentoml:main Jan 12, 2024
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improv(package): use python slim base image and let pytorch install cuda #807

improv(package): use python slim base image and let pytorch install cuda #807

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023

improv(package): use python slim base image and let pytorch install cuda #807

improv(package): use python slim base image and let pytorch install cuda #807

Conversation

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023

larme commented Dec 22, 2023

aarnphm commented Dec 22, 2023