Skip to content

Upgrade ML GPU bot config to run TF 2.3.0rc2 and other appropriate deps (#1540).#1924

Merged
Dor1s merged 1 commit into
masterfrom
tf23_gpu
Jul 27, 2020
Merged

Upgrade ML GPU bot config to run TF 2.3.0rc2 and other appropriate deps (#1540).#1924
Dor1s merged 1 commit into
masterfrom
tf23_gpu

Conversation

@Dor1s

@Dor1s Dor1s commented Jul 24, 2020

Copy link
Copy Markdown
Contributor

I'm not quite sure how to test this. I did try to install things manually on a bot and that worked, but we have slightly different setup for deploying CF bots.

One idea might be to update Chromium's ml-with-gpu image and just re-create those bots. Even if they break, we won't lose much.

Another option is to set up a separate GCP project with a ml-with-gpu bot. It'll take a bit longer but won't be "testing in prod".

Maybe there are any other tricks that I don't remember?

@google-cla google-cla Bot added the cla: yes CLA signed. label Jul 24, 2020
@Dor1s

Dor1s commented Jul 24, 2020

Copy link
Copy Markdown
Contributor Author

/gcbrun

@Dor1s Dor1s requested a review from mbarbella-chromium July 24, 2020 23:31

@inferno-chromium inferno-chromium left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No good way to test, just can check docker build on the images itself. Since it is broken anyway, it is totally fine to try directly.


# Install TensorRT. Requires that libcudnn7 is installed above.
RUN apt-get install -y --no-install-recommends \
libnvinfer6=6.0.1-1+cuda10.1 \

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: CUDA_VERSION=6.0.1-1+cuda10.1 at start of command and then use in three places as $CUDA_VERSION ?
Same for libcudnn7 above.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the suggestion! I don't think that would improve readability a lot, plus it's not CUDA version (cuda is 10.1), so I'm just leaving this like suggested in the docs: https://www.tensorflow.org/install/gpu#ubuntu_1604_cuda_101

@Dor1s

Dor1s commented Jul 27, 2020

Copy link
Copy Markdown
Contributor Author

/gcbrun

@Dor1s Dor1s merged commit d6390c2 into master Jul 27, 2020
@oliverchang oliverchang deleted the tf23_gpu branch January 21, 2021 02:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla: yes CLA signed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants