Pytorch 1.12.1 - and Disable kineto due to cupti conflict #136

conda-forge-linter · 2022-07-31T22:00:26Z

Hi! This is the friendly automated conda-forge-webservice.

I've rerendered the recipe as instructed in #135.

Here's a checklist to do before merging.

Bump the build number if needed.

conda-forge-linter · 2022-07-31T22:00:31Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

…nda-forge-pinning 2022.07.31.08.11.05

hmaarrfk · 2022-08-01T07:31:02Z

This isn't having the desired effect, the 10.2 build shows:

2022-08-01T03:53:43.6127771Z    INFO (pytorch,lib/python3.10/site-packages/torch/lib/libtorch_cpu.so): Needed DSO lib/libcupti.so.10.2 found in conda-forge::cudatoolkit-10.2.89-h713d32c_10

ngam · 2022-08-01T13:23:48Z

recipe/build_pytorch.sh

 # CUPTI seems to cause trouble when users install a version of
 # cudatoolkit different than the one specified at compile time.
 # https://github.com/conda-forge/pytorch-cpu-feedstock/issues/135
-export LIBKINETO_NOCUPTI=ON
+export USE_KINETO=ON


Did you forget to set it to "OFF" here?

ngam · 2022-08-01T13:24:45Z

Suggestion: limit the builds to cuda112 and only use one cuda arch (e.g. 8.0) for testing here so that the ci finishes within six hours

hmaarrfk · 2022-08-01T13:33:30Z

Suggestion: limit the builds to cuda112 and only use one cuda arch (e.g. 8.0) for testing here so that the ci finishes within six hours

Seems like that would complicate the process even more. I would rather do one thing at once. address the cupti bugs instead of revamping the build system at the same time.

ngam · 2022-08-01T13:36:22Z

Suggestion: limit the builds to cuda112 and only use one cuda arch (e.g. 8.0) for testing here so that the ci finishes within six hours

Seems like that would complicate the process even more. I would rather do one thing at once. address the cupti bugs instead of revamping the build system at the same time.

Fine with me. My suggestions was merely to get this to finish within six hours for diagnostics only within this PR, not a permanent change. So once we see if disabling kineto actually does the trick, we can go back to build all the cuda arches as before. This would remove the need to build this locally (unless you got the info below from the CI?)

2022-08-01T03:53:43.6127771Z    INFO (pytorch,lib/python3.10/site-packages/torch/lib/libtorch_cpu.so): Needed DSO lib/libcupti.so.10.2 found in conda-forge::cudatoolkit-10.2.89-h713d32c_10

hmaarrfk · 2022-08-01T13:42:56Z

The 10.2 builds finish within the allotted time (sometimes) so you can check those. That is what I did in what you quoted.

ngam · 2022-08-01T13:48:46Z

10.2 builds finish within the allotted time (sometimes)

Ah, didn't know that. Okay then, we can check that in the meanwhile!

hmaarrfk · 2022-08-01T18:25:58Z

Well kineto is properly disabled, and cupti is nowhere to be found in the logs for now. I think we are good. I'm going to build cuda locally.

ngam · 2022-08-05T17:51:23Z

Let's try to change this into 1.12.1?

hmaarrfk · 2022-08-05T19:13:27Z

I guess i had the builds ready, but I don't see a sense in having too much uploaded.

ngam · 2022-08-05T19:24:41Z

I guess i had the builds ready, but I don't see a sense in having too much uploaded.

If you do, just upload them! Sorry I only wrote that thinking you haven't built them already!

Upload them and we can deal with the 1.12.1 stuff later

ngam · 2022-08-05T19:27:08Z

Also, I am not sure if this is their final 1.12.1 tag... pytorch/pytorch@v1.12.0...v1.12.1

They didn't announce it under releases, but it has already appeared on pypi, etc.

hmaarrfk · 2022-08-05T19:29:05Z

I'm pretty swamped right now with the day job.i think i can get this in by next week

hmaarrfk · 2022-08-05T19:30:41Z

Also.

pytorch/pytorch#81680 (comment)

ngam · 2022-08-05T19:33:42Z

Take your time :)

hmaarrfk · 2022-08-10T12:53:28Z

GPU build logs
log_files.zip

hmaarrfk · 2022-08-10T12:53:42Z

GPU builds are being uploaded now.

dummy commit for rerendering

486c6fc

conda-forge-linter requested review from benjaminrwilson, hmaarrfk and sodre as code owners July 31, 2022 22:00

conda-forge-linter mentioned this pull request Jul 31, 2022

pytorch 1.12 -- libcupti.so.11.2 not found -- cudatoolkit >11.2 #135

Closed

1 task

conda-forge-webservices[bot] and others added 3 commits July 31, 2022 22:02

MNT: Re-rendered with conda-build 3.21.9, conda-smithy 3.21.0, and co…

4fecd8e

…nda-forge-pinning 2022.07.31.08.11.05

Disable CUPTI

6f6f137

Update meta.yaml

4c45910

hmaarrfk changed the title ~~MNT: rerender~~ Pytorch 1.12 - Disable cupti Jul 31, 2022

hmaarrfk changed the title ~~Pytorch 1.12 - Disable cupti~~ Pytorch 1.12 - Disable kineto due to cupti conflict Aug 1, 2022

Disable kineto alltogether

ee36b63

ngam reviewed Aug 1, 2022

View reviewed changes

disable kineto

aa111fa

Update to 1.12.1

93d901f

hmaarrfk changed the title ~~Pytorch 1.12 - Disable kineto due to cupti conflict~~ Pytorch 1.12.1 - and Disable kineto due to cupti conflict Aug 5, 2022

hmaarrfk merged commit 2a3f37b into conda-forge:main Aug 10, 2022

h-vetinari mentioned this pull request May 6, 2023

Re-enable kineto submodule #76

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch 1.12.1 - and Disable kineto due to cupti conflict #136

Pytorch 1.12.1 - and Disable kineto due to cupti conflict #136

conda-forge-linter commented Jul 31, 2022 •

edited by hmaarrfk

Loading

conda-forge-linter commented Jul 31, 2022

hmaarrfk commented Aug 1, 2022

ngam Aug 1, 2022

hmaarrfk Aug 1, 2022

ngam commented Aug 1, 2022 •

edited

Loading

hmaarrfk commented Aug 1, 2022

ngam commented Aug 1, 2022

hmaarrfk commented Aug 1, 2022

ngam commented Aug 1, 2022

hmaarrfk commented Aug 1, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

ngam commented Aug 5, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 10, 2022

hmaarrfk commented Aug 10, 2022

Pytorch 1.12.1 - and Disable kineto due to cupti conflict #136

Pytorch 1.12.1 - and Disable kineto due to cupti conflict #136

Conversation

conda-forge-linter commented Jul 31, 2022 • edited by hmaarrfk Loading

conda-forge-linter commented Jul 31, 2022

hmaarrfk commented Aug 1, 2022

ngam Aug 1, 2022

Choose a reason for hiding this comment

hmaarrfk Aug 1, 2022

Choose a reason for hiding this comment

ngam commented Aug 1, 2022 • edited Loading

hmaarrfk commented Aug 1, 2022

ngam commented Aug 1, 2022

hmaarrfk commented Aug 1, 2022

ngam commented Aug 1, 2022

hmaarrfk commented Aug 1, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

ngam commented Aug 5, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

hmaarrfk commented Aug 5, 2022

ngam commented Aug 5, 2022

hmaarrfk commented Aug 10, 2022

hmaarrfk commented Aug 10, 2022

conda-forge-linter commented Jul 31, 2022 •

edited by hmaarrfk

Loading

ngam commented Aug 1, 2022 •

edited

Loading