Compilation of <torch/extension.h> error on Windows CUDA 11.5 #69460

atalman · 2021-12-06T17:36:15Z

We have following error when compiling CUDA 11.5 on windows

C:\actions-runner_work\pytorch\pytorch\build\win_tmp\build\torch\include\pybind11\cast.h(1429): error: too few arguments for temp
late template parameter "Tuple"
detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]
"
(1507): here

C:\actions-runner_work\pytorch\pytorch\build\win_tmp\build\torch\include\pybind11\cast.h(1503): error: too few arguments for temp
late template parameter "Tuple"
detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]
"
(1507): here

Complete failure log:
https://github.com/pytorch/pytorch/runs/4408796098?check_suite_focus=true

This looks like the same issue as this one:
facebookresearch/pytorch3d#843

Here is the workaround for this issue:
facebookresearch/pytorch3d@cb170ac

cc @ezyang @gchanan @zou3519 @peterjc123 @mszhanyi @skyline75489 @nbcsm @brianjo @mruberry @ngimel @bdhirsh @jbschlosser @malfet @seemethere @pytorch/pytorch-dev-infra

The text was updated successfully, but these errors were encountered:

seemethere · 2021-12-06T17:54:15Z

Adding this to the 1.11.0 milestone since this will be a blocker for the 1.11.0 release

malfet · 2021-12-06T18:06:27Z

Grabbing for myself, as it probably involves tweaking pybind a bit

malfet · 2021-12-06T21:29:43Z

I can reproduce it with hello world example that simply includes pybind, which looks like a CUDA compiler bug (happens during the invocation of cicc):

#include <stdio.h>
#include <pybind11/pybind11.h>
__global__ void kernel() {
  printf("Hello World");
}
int main(void) {
 kernel<<<1, 1>>>();
 return cudaDeviceSynchronize();
}

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.5\bin>nvcc "c:\Users\runneruser\Documents\hello.cu" -o c:\Users\runneruser\Documents\a.exe -IC:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include -IC:\Jenkins\Miniconda3\include
hello.cu
...
#$ cicc --microsoft_version=1928 --msvc_target_version=1928 --compiler_bindir "C:/Program Files (x86)/Microsoft Visual Studio/2019/BuildTools/VC/Tools/MSVC/14.28.29333/bin/Hostx64/x64/../../../../../../.." --sdk_dir "C:/Program Files (x86)/Windows Kits/10/" --display_error_number --orig_src_file_name "c:/Users/runneruser/Documents/hello.cu" --orig_src_path_name "c:\Users\runneruser\Documents\hello.cu" --allow_managed  -arch compute_52 -m64 --no-version-ident -ftz=0 -prec_div=1 -prec_sqrt=1 -fmad=1 --include_file_name "tmpxft_00001bb8_00000000-7_hello.fatbin.c" -tused --gen_module_id_file --module_id_file_name "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-8_hello.module_id" --gen_c_file_name "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-10_hello.cudafe1.c" --stub_file_name "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-10_hello.cudafe1.stub.c" --gen_device_file_name "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-10_hello.cudafe1.gpu"  "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-13_hello.cpp1.ii" -o "C:/Users/RUNNER~1/AppData/Local/Temp/2/tmpxft_00001bb8_00000000-10_hello.ptx"
C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\detail/common.h(810): warning #1388-D: base class dllexport/dllimport specification differs from that of the derived class

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\pytypes.h(338): warning #1388-D: base class dllexport/dllimport specification differs from that of the derived class

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\pytypes.h(387): warning #1394-D: field of class type without a DLL interface used in a class with a DLL interface

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\pytypes.h(387): warning #1394-D: field of class type without a DLL interface used in a class with a DLL interface

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\pytypes.h(387): warning #1394-D: field of class type without a DLL interface used in a class with a DLL interface

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\cast.h(567): error: too few arguments for template template parameter "Tuple"
          detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]"
(648): here

C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\cast.h(644): error: too few arguments for template template parameter "Tuple"
          detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]"
(648): here

2 errors detected in the compilation of "c:/Users/runneruser/Documents/hello.cu".
# --error 0x1 --

For debugging: hello.cpp1.ii

zasdfgbnm · 2021-12-10T21:51:14Z

Further narrow down:

#include <utility>

// Base implementation for std::tuple and std::pair
template <template<typename...> class Tuple, typename... Ts> class tuple_caster {
    using type = Tuple<Ts...>;
};

template <typename T1, typename T2> class type_caster
    : public tuple_caster<std::pair, T1, T2> {};


__global__ void kernel() {
	printf("Hello World");
}
int main(void) {
	kernel <<<1, 1>>> ();
	return cudaDeviceSynchronize();
}

malfet · 2022-02-15T19:00:19Z

Need to document the regression of nvcc compiler from CUDA-11.5 on Windows

mszhanyi · 2022-02-16T09:54:22Z

@malfet @atalman
I'd created a project GPU and CUDA regression on Windows to collect GPU and cuda regressions.

MinttHu · 2022-06-28T03:25:32Z

My system is window 10+CUDA11.3. My problem is that when I use "torch. utils.cpp_extension.load_inline" to built a C++/CUDA extension, it shows: "ninja: build stopped: subcommand failed."

3a1b2c3 · 2022-08-04T12:15:20Z

Any workarounds?

atalman · 2022-09-23T15:55:58Z

@malfet @ptrblck Since we have cuda 11.7 I will rerun testing on this cuda to see if this issue is resolved for CUDA 11.7

atalman · 2022-10-14T00:10:45Z

Looks like this issue is resolved in 11.7. I am not observing the failure anymore on #85966

@peterjc123

Reenable aot tests on windows for cuda 11.7 and up Issue: #69460 seems to be mitigated in CUDA 11.7 hence re-enable this test cc @peterjc123 @mszhanyi @skyline75489 @nbcsm Pull Request resolved: #87193 Approved by: https://github.com/malfet

@peterjc123

Reenable aot tests on windows for cuda 11.7 and up Issue: pytorch#69460 seems to be mitigated in CUDA 11.7 hence re-enable this test cc @peterjc123 @mszhanyi @skyline75489 @nbcsm Pull Request resolved: pytorch#87193 Approved by: https://github.com/malfet

@peterjc123

Reenable aot tests on windows for cuda 11.7 and up Issue: #69460 seems to be mitigated in CUDA 11.7 hence re-enable this test cc @peterjc123 @mszhanyi @skyline75489 @nbcsm Pull Request resolved: #87193 Approved by: https://github.com/malfet

atalman · 2022-10-21T12:55:55Z

Removing from milestones, since this issue is not critical for release 1.13 and this is resolved for CUDA 11.7

@peterjc123

Reenable aot tests on windows for cuda 11.7 and up Issue: pytorch#69460 seems to be mitigated in CUDA 11.7 hence re-enable this test cc @peterjc123 @mszhanyi @skyline75489 @nbcsm Pull Request resolved: pytorch#87193 Approved by: https://github.com/malfet

As CUDA-11.5 is no longer supported, just remove the check Fixes #69460

atalman · 2023-02-02T16:01:47Z

With deprecation of CUDA 11.6, we can resolve this issue, will post a PR once 11.6 is deprecated from CI

As CUDA-11.5 is no longer supported, just remove the check Fixes #69460

atalman added module: cuda Related to torch.cuda, and CUDA support in general module: ci Related to continuous integration module: infra Relates to CI infrastructure triage review labels Dec 6, 2021

seemethere added the module: windows Windows support for PyTorch label Dec 6, 2021

seemethere added this to the 1.11.0 milestone Dec 6, 2021

malfet self-assigned this Dec 6, 2021

gchanan added module: cpp-extensions Related to torch.utils.cpp_extension triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module and removed triage review labels Dec 6, 2021

malfet added high priority and removed module: cpp-extensions Related to torch.utils.cpp_extension module: ci Related to continuous integration module: infra Relates to CI infrastructure labels Dec 6, 2021

pytorch-probot bot added the triage review label Dec 6, 2021

malfet added the module: pybind Related to our Python bindings / interactions with other Python libraries label Dec 6, 2021

albanD added module: dependency bug Problem is not caused by us, but caused by an upstream library we use and removed triage review labels Dec 13, 2021

malfet removed their assignment Dec 13, 2021

atalman mentioned this issue Dec 13, 2021

Adding windows cuda 11.5 workflows #69377

Closed

malfet added has workaround module: docs Related to our documentation, both in docs/ and docblocks labels Feb 15, 2022

malfet assigned atalman Feb 15, 2022

mszhanyi added this to In progress in GPU and CUDA regression on Windows Feb 16, 2022

atalman added this to the 1.12.1 milestone May 10, 2022

atalman mentioned this issue Jun 1, 2022

Pytorch CUDA Upgrade to 11.7 and Decommsion 11.3 and 10.2 pytorch/builder#1042

Open

34 tasks

atalman assigned malfet Jun 27, 2022

atalman modified the milestones: 1.12.1, 1.13.0 Jun 30, 2022

Ken1256 mentioned this issue Jul 8, 2022

pytorch 1.12.0 CUDA 11.6 Win10 VS2019 build error SHI-Labs/Neighborhood-Attention-Transformer#43

Closed

malfet unassigned atalman Oct 13, 2022

atalman mentioned this issue Oct 18, 2022

Reenable aot tests on windows for cuda 11.7 and up #87193

Closed

atalman mentioned this issue Oct 19, 2022

Reenable aot tests on windows for cuda 11.7 and up (#87193) #87307

Merged

atalman self-assigned this Oct 20, 2022

atalman modified the milestones: 1.13.0, 1.14.0 Oct 21, 2022

bitRAKE mentioned this issue Oct 23, 2022

Detailed Windows Install Guide? ashawkey/stable-dreamfusion#42

Open

machenmusik mentioned this issue Dec 15, 2022

Consider CUDA 11.3 --> 11.6 to support newer hardware nerfstudio-project/nerfstudio#1080

Closed

malfet added a commit that referenced this issue Jan 17, 2023

Re-enable compilation tests

8845415

As CUDA-11.5 is no longer supported, just remove the check Fixes #69460

malfet mentioned this issue Jan 17, 2023

Re-enable compilation tests #92333

Closed

pytorchmergebot pushed a commit that referenced this issue Feb 2, 2023

Re-enable compilation tests

d15fac9

As CUDA-11.5 is no longer supported, just remove the check Fixes #69460

pytorchmergebot closed this as completed in a07d129 Feb 6, 2023

ksmaze mentioned this issue Feb 20, 2023

.cu files should not include torch/extension.h NVIDIA/apex#1455

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compilation of <torch/extension.h> error on Windows CUDA 11.5 #69460

Compilation of <torch/extension.h> error on Windows CUDA 11.5 #69460

atalman commented Dec 6, 2021 •

edited by pytorch-bot bot

Loading

seemethere commented Dec 6, 2021

malfet commented Dec 6, 2021

malfet commented Dec 6, 2021 •

edited

Loading

zasdfgbnm commented Dec 10, 2021

malfet commented Feb 15, 2022

mszhanyi commented Feb 16, 2022 •

edited

Loading

MinttHu commented Jun 28, 2022

3a1b2c3 commented Aug 4, 2022

atalman commented Sep 23, 2022

atalman commented Oct 14, 2022

atalman commented Oct 21, 2022

atalman commented Feb 2, 2023

Compilation of <torch/extension.h> error on Windows CUDA 11.5 #69460

Compilation of <torch/extension.h> error on Windows CUDA 11.5 #69460

Comments

atalman commented Dec 6, 2021 • edited by pytorch-bot bot Loading

seemethere commented Dec 6, 2021

malfet commented Dec 6, 2021

malfet commented Dec 6, 2021 • edited Loading

zasdfgbnm commented Dec 10, 2021

malfet commented Feb 15, 2022

mszhanyi commented Feb 16, 2022 • edited Loading

MinttHu commented Jun 28, 2022

3a1b2c3 commented Aug 4, 2022

atalman commented Sep 23, 2022

atalman commented Oct 14, 2022

atalman commented Oct 21, 2022

atalman commented Feb 2, 2023

atalman commented Dec 6, 2021 •

edited by pytorch-bot bot

Loading

malfet commented Dec 6, 2021 •

edited

Loading

mszhanyi commented Feb 16, 2022 •

edited

Loading