NVTX windows include and link fixes by gedoensmax · Pull Request #16831 · microsoft/onnxruntime

gedoensmax · 2023-07-24T11:43:08Z

Description

For windows headers are not duplicated to the normal cuda include. For linux they are:

(base) maximilianm@maximilianm-dt-linux:~$ ls /usr/local/cuda/include/nvtx3 | grep nvTool
nvToolsExt.h
nvToolsExtCuda.h
nvToolsExtCudaRt.h
nvToolsExtOpenCL.h
nvToolsExtSync.h
(base) maximilianm@maximilianm-dt-linux:~$ ls /usr/local/cuda/include | grep nvTool
nvToolsExt.h
nvToolsExtCuda.h
nvToolsExtCudaRt.h
nvToolsExtOpenCL.h
nvToolsExtSync.h

Is the preference via those added defines or should the include just be changed to be nvtx3/ ?

Also there is no library linking needed on Windows and the library is not even present.

gedoensmax · 2023-07-24T11:45:05Z

@chilo-ms and @hariharans29 for review.

hariharans29 · 2023-07-31T21:01:08Z

@wschin - Can you please take a look ?

chilo-ms · 2023-07-31T21:08:22Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, Linux QNN CI Pipeline, Windows ARM64 QNN CI Pipeline

azure-pipelines · 2023-07-31T21:08:50Z

Azure Pipelines successfully started running 7 pipeline(s).

chilo-ms · 2023-07-31T21:12:20Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

azure-pipelines · 2023-07-31T21:12:57Z

Azure Pipelines successfully started running 9 pipeline(s).

gedoensmax · 2023-08-14T13:06:13Z

@chilo-ms Can we merge this ?

chilo-ms · 2023-08-15T17:26:20Z

or windows headers are not duplicated to the no

I've asked @wschin for helping review it.
Also, I saw ORT Web CI failures, could you merge main and try it again?

gedoensmax · 2023-08-15T20:42:11Z

/* Arithmetic FP16 operations only supported on arch >= 5.3 */
#if !defined(__CUDA_ARCH__) || (__CUDA_ARCH__ >= 530) || defined(_NVHPC_CUDA)

These lines are removed from CUDA 12.2 which is why I also just added another commit fixing that in common.cuh. Hope that's fine otherwise just let me know.

chilo-ms · 2023-08-15T22:02:44Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

azure-pipelines · 2023-08-15T22:03:28Z

Azure Pipelines successfully started running 9 pipeline(s).

snnn · 2023-08-16T04:22:23Z

/azp run Linux QNN CI Pipeline, Windows ARM64 QNN CI Pipeline, Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2023-08-16T04:22:53Z

Azure Pipelines successfully started running 7 pipeline(s).

snnn · 2023-08-16T19:01:24Z

I tried it locally. It works very good.

### Description For windows headers are not duplicated to the normal cuda include. For linux they are: ``` (base) maximilianm@maximilianm-dt-linux:~$ ls /usr/local/cuda/include/nvtx3 | grep nvTool nvToolsExt.h nvToolsExtCuda.h nvToolsExtCudaRt.h nvToolsExtOpenCL.h nvToolsExtSync.h (base) maximilianm@maximilianm-dt-linux:~$ ls /usr/local/cuda/include | grep nvTool nvToolsExt.h nvToolsExtCuda.h nvToolsExtCudaRt.h nvToolsExtOpenCL.h nvToolsExtSync.h ``` Is the preference via those added defines or should the include just be changed to be `nvtx3/` ? Also there is no library linking needed on Windows and the library is not even present.

hariharans29 requested a review from wschin July 31, 2023 21:00

microsoft deleted a comment from azure-pipelines Bot Jul 31, 2023

nvtx include and link switch for windows

0f232c5

gedoensmax force-pushed the nvtx_win branch from 3462515 to 0f232c5 Compare August 15, 2023 20:23

fix: CUDA 12.2 defines half operators for all arches

cee050f

chilo-ms requested a review from snnn August 15, 2023 22:05

snnn approved these changes Aug 16, 2023

View reviewed changes

snnn merged commit 7b9d1f1 into microsoft:main Aug 16, 2023

snnn mentioned this pull request Aug 16, 2023

[Build] OnnxRuntime execution provider treat warning as error for CUDA 12.2.1 build on Windows #16942

Closed

This was referenced Jan 22, 2024

[Build] CUDA EP compile errors with Cuda 12.2 #16713

Closed

[Build] error: more than one operator "+" matches these operands #16870

Closed

[Build] build errors with 1.15.1 and CUDA enabled #17531

Closed

Conversation

gedoensmax commented Jul 24, 2023

Description

Uh oh!

gedoensmax commented Jul 24, 2023

Uh oh!

hariharans29 commented Jul 31, 2023

Uh oh!

chilo-ms commented Jul 31, 2023

Uh oh!

azure-pipelines Bot commented Jul 31, 2023

Uh oh!

chilo-ms commented Jul 31, 2023

Uh oh!

azure-pipelines Bot commented Jul 31, 2023

Uh oh!

gedoensmax commented Aug 14, 2023

Uh oh!

chilo-ms commented Aug 15, 2023

Uh oh!

gedoensmax commented Aug 15, 2023

Uh oh!

chilo-ms commented Aug 15, 2023

Uh oh!

azure-pipelines Bot commented Aug 15, 2023

Uh oh!

snnn commented Aug 16, 2023

Uh oh!

azure-pipelines Bot commented Aug 16, 2023

Uh oh!

snnn commented Aug 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants