Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade cuda to 9 and cudnn version to 7. #14773

Merged
merged 6 commits into from Dec 8, 2017
Merged

Upgrade cuda to 9 and cudnn version to 7. #14773

merged 6 commits into from Dec 8, 2017

Conversation

gunan
Copy link
Contributor

@gunan gunan commented Nov 21, 2017

No description provided.

@yifeif
Copy link
Contributor

yifeif commented Nov 22, 2017

@gunan looks like the jenkins GPU failures are new and might be related to the upgrade?

@yifeif yifeif self-assigned this Nov 22, 2017
@gunan gunan added the kokoro:force-run Tests on submitted change label Nov 22, 2017
@kokoro-team kokoro-team removed the kokoro:force-run Tests on submitted change label Nov 22, 2017
@gunan
Copy link
Contributor Author

gunan commented Nov 22, 2017

Looks like there was an eigen bug, we will need #14770 merged, then this change should be good to go.

@gunan
Copy link
Contributor Author

gunan commented Dec 7, 2017

@zheng-xq looks like another build issue with cuda 9 crept in. Have you seen this one before?
https://ci.tensorflow.org/job/tensorflow-pull-requests-gpu/7826/consoleFull

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

Jenkins, test this please.

@gunan gunan added the kokoro:force-run Tests on submitted change label Dec 8, 2017
@kokoro-team kokoro-team removed the kokoro:force-run Tests on submitted change label Dec 8, 2017
@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

@zheng-xq now the transpose_op_test is timing out. Do you think it could be caused by the cuda9 upgrade?

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

Jenkins , test this please.

@zheng-xq
Copy link
Contributor

zheng-xq commented Dec 8, 2017

Feel free to disable some big ones. @yzhwang can take a look afterwards. Thanks!

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

Merging, since python3 tests passed on kokoro despite the failure on jenkins.

@gunan gunan merged commit b43d0f3 into tensorflow:master Dec 8, 2017
@gunan gunan deleted the cuda9 branch December 8, 2017 07:19
@flx42
Copy link
Contributor

flx42 commented Dec 8, 2017

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

good idea! I can send a followup PR for that.

@snnn
Copy link
Contributor

snnn commented Dec 8, 2017

Hi @gunan

After this commit, we'll use Ubuntu 16 for GPU build but Ubuntu 14 for CPU build. This will cause a lot confusions. Can you unify that?

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

We still need to continue testing for ubuntu 14, however nvidia docker does not have 14.04 support anymore. So the version skew is partially intentional right now.

@flx42
Copy link
Contributor

flx42 commented Dec 8, 2017

Wait, does it matter for building a Docker image anyway?
You can still use nvidia-docker 1.0 on trusty, or you can try to use the xenial packages for 2.0, but you will have to setup the new runtime manually.

@gunan
Copy link
Contributor Author

gunan commented Dec 8, 2017

All our CI machines are on ubuntu 16.04. But I think the concern is the CPU docker images are still based off of ubuntu 14.04, while the base image for CUDA enabled packages are ubuntu 16.04

@alanpurple
Copy link
Contributor

alanpurple commented Dec 22, 2017

any plan for cuda 9.1?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants