[Don't merge] Fix dynamic array build error on MSVC. #134140

xuhancn · 2024-08-21T19:37:58Z

MSVC don't support dynamic array. Please use std::vector to instead of it.
Ref: https://stackoverflow.com/questions/56555406/creating-dynamic-sized-array-using-msvc-c-compiler

Reproduce UT:

pytest test/inductor/test_cpu_repro.py -v -k test_reduction_with_dynamic_threads

Error msg:

C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): error C2131: expression did not evaluate to a constant
C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): note: failure was caused by a read of a variable outside its lifetime
C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): note: see usage of 'max_threads'
C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(16): error C3863: array type 'float [max_threads]' is not assignable

Genarated code:

#include "C:/Users/Xuhan/AppData/Local/Temp/tmpt6mxcjzi/j2/cj22tgrdgh42wbunl7gdptg2lintcziox2kmr7rdbcc6n2njrhgx.h"
extern "C" __declspec(dllexport) void kernel(const float* in_ptr0,
                       const float* in_ptr1,
                       float* out_ptr0,
                       float* out_ptr1)
{
    {
        {
            float tmp_acc0 = 0;
            at::vec::Vectorized<float> tmp_acc0_vec = at::vec::Vectorized<float>(0);
            int max_threads = omp_get_max_threads();
            float tmp_acc0_arr[max_threads];
            for (int tid = 0; tid < max_threads; tid++)
            {
                tmp_acc0_arr[tid] = 0;
            }
            at::vec::Vectorized<float> tmp_acc0_vec_arr[max_threads];
            for (int tid = 0; tid < max_threads; tid++)
            {
                tmp_acc0_vec_arr[tid] = at::vec::Vectorized<float>(0);
            }
            #pragma omp parallel

Fixed by this PR and tested on local machine:

cc @peterjc123 @mszhanyi @skyline75489 @nbcsm @iremyux @Blackhex @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

pytorch-bot · 2024-08-21T19:38:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134140

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 14 New Failures

As of commit c692068 with merge base 1da3a04 ():

NEW FAILURES - The following jobs have failed:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (inductor_avx2, 1, 2, amz2023.linux.10xlarge.avx2) (gh)
inductor/test_torchinductor.py::CpuTests::test_any_cpu
pull / linux-focal-cuda12.1-py3.10-gcc9 / test (default, 3, 5, amz2023.linux.4xlarge.nvidia.gpu) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 3, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-focal-py3.11-clang10 / test (default, 3, 4, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-focal-py3.12-clang10 / test (default, 3, 4, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-focal-py3.12-clang10-experimental-split-build / test (default, 3, 3, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-focal-py3.8-clang10 / test (default, 3, 4, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-jammy-py3.10-clang15-asan / test (default, 4, 6, amz2023.linux.4xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
pull / linux-jammy-py3.10-clang15-asan / test (default, 5, 6, amz2023.linux.4xlarge) (gh)
inductor/test_torchinductor.py::CpuTests::test_multilayer_any_cpu
pull / linux-jammy-py3.8-gcc11 / test (default, 3, 4, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
trunk / linux-focal-cuda12.4-py3.10-gcc9-experimental-split-build-test / test (default, 3, 5, amz2023.linux.4xlarge.nvidia.gpu) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
trunk / linux-focal-cuda12.4-py3.10-gcc9-experimental-split-build-test / test (nogpu_AVX512, 1, 1, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
trunk / linux-focal-cuda12.4-py3.10-gcc9-experimental-split-build-test / test (nogpu_NO_AVX2, 1, 1, amz2023.linux.2xlarge) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec
trunk / linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 3, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) (gh)
inductor/test_cpu_repro.py::CPUReproTests::test_bool_reduction_vec

This comment was automatically generated by Dr. CI and updates every 15 minutes.

xuhancn · 2024-08-21T21:52:37Z

Try to fix it by unique_ptr: #134156

@jansel

MSVC don't support dynamic array. Ref: https://stackoverflow.com/questions/56555406/creating-dynamic-sized-array-using-msvc-c-compiler We tried to solutions: 1. use std::vector to instead of it in previous PR: #134140, but it changed variable's type and failed at UTs. 2. Use `std::unique_ptr` to instead of it in PR: #134156, @jansel reviewed and give comments: #134156 (review). It is make sense, allocation memory maybe make code run slower. 3. Use fixed size array to instead of it in PR: #134210, fixed size is hard to process the situlation, reserved size if small than CPU number. > a. Use min() function limited is local test failed: #134210 (comment) > b. Dynamic select fixed size or dynamic array: #134210 (comment) . It makes code too complex to maintains. Discussed with origin PR(#115620) author @zhuhaozhe, we think: 1. MSVC it the only one compiler, which not support VLA. 2. MSVC it worse performance than other compilers, use `std::unique_ptr` for MSVC and make it works. 3. For other compilers, keep using current `VLA` code. 4. For Windows users, they can use `clang-cl` or `icx` to get better performance than MSVC. 5. Discussed with @jansel , we need to move compiler check to python side, and make output code cleaner. Reproduce UT: ```cmd pytest test/inductor/test_cpu_repro.py -v -k test_reduction_with_dynamic_threads ``` Error msg: ```cmd C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): error C2131: expression did not evaluate to a constant C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): note: failure was caused by a read of a variable outside its lifetime C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(13): note: see usage of 'max_threads' C:/Users/Xuhan/AppData/Local/Temp/tmpncykej5v/a4/ca4534cazplidnf7vopaaxaifqkjiyhxm3h2gsylgztputbaeybx.cpp(16): error C3863: array type 'float [max_threads]' is not assignable ``` Genarated code: ```c++ #include "C:/Users/Xuhan/AppData/Local/Temp/tmpt6mxcjzi/j2/cj22tgrdgh42wbunl7gdptg2lintcziox2kmr7rdbcc6n2njrhgx.h" extern "C" __declspec(dllexport) void kernel(const float* in_ptr0, const float* in_ptr1, float* out_ptr0, float* out_ptr1) { { { float tmp_acc0 = 0; at::vec::Vectorized<float> tmp_acc0_vec = at::vec::Vectorized<float>(0); int max_threads = omp_get_max_threads(); float tmp_acc0_arr[max_threads]; for (int tid = 0; tid < max_threads; tid++) { tmp_acc0_arr[tid] = 0; } at::vec::Vectorized<float> tmp_acc0_vec_arr[max_threads]; for (int tid = 0; tid < max_threads; tid++) { tmp_acc0_vec_arr[tid] = at::vec::Vectorized<float>(0); } #pragma omp parallel ``` Pull Request resolved: #134221 Approved by: https://github.com/zhuhaozhe, https://github.com/jansel

fix dynamic array error on MSVC.

957b69e

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 21, 2024

xuhancn added module: windows Windows support for PyTorch ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category intel This tag is for PR from Intel labels Aug 21, 2024

xuhancn mentioned this pull request Aug 21, 2024

[RFC] Add new CPP builder for inductor on pytorch Windows #124245

Open

xuhancn marked this pull request as ready for review August 21, 2024 19:45

xuhancn requested review from desertfire, ezyang, jansel, jgong5 and malfet August 21, 2024 19:45

pytorchbot added the open source label Aug 21, 2024

rename var name for easy understand.

c692068

xuhancn changed the title ~~[inductor] Fix dynamic array error on MSVC.~~ [inductor] Fix dynamic build array error on MSVC. Aug 21, 2024

xuhancn changed the title ~~[inductor] Fix dynamic build array error on MSVC.~~ [inductor] Fix dynamic array build error on MSVC. Aug 21, 2024

xuhancn removed request for desertfire, ezyang, jansel, jgong5 and malfet August 21, 2024 20:54

xuhancn marked this pull request as draft August 21, 2024 20:54

xuhancn mentioned this pull request Aug 21, 2024

[Don't merge] fix dynamic size array(vla) build error on msvc v2 #134156

Closed

xuhancn changed the title ~~[inductor] Fix dynamic array build error on MSVC.~~ [Don't merge] Fix dynamic array build error on MSVC. Aug 21, 2024

This was referenced Aug 22, 2024

[Don't merge] fix dynamic size array(vla) build error on msvc v3 #134210

Closed

[inductor] fix dynamic size array(vla) build error on msvc v4 #134221

Closed

xuhancn closed this Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Don't merge] Fix dynamic array build error on MSVC. #134140

[Don't merge] Fix dynamic array build error on MSVC. #134140

Uh oh!

xuhancn commented Aug 21, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 21, 2024 •

edited

Loading

Uh oh!

xuhancn commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Don't merge] Fix dynamic array build error on MSVC. #134140

[Don't merge] Fix dynamic array build error on MSVC. #134140

Uh oh!

Conversation

xuhancn commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134140

❌ 14 New Failures

Uh oh!

xuhancn commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xuhancn commented Aug 21, 2024 •

edited

Loading

pytorch-bot bot commented Aug 21, 2024 •

edited

Loading