[Bug][Feature] Added more missing FP16 specializations #4140

ndickson-nvidia · 2022-06-17T21:50:50Z

Description

Continuation of adding more missing FP16 specializations after PR [Bug][Feature] Added cublasGemm<__half> specialization (#3988) #4029 , which addressed the specific case from issue Add FP16 support for GatherMM kernel #3988
Added missing specializations for __half of DLDataTypeTraits, IndexSelect, Full, Scatter_, CSRGetData, CSRMM, CSRSum
Fixed casting issue in _LinearSearchKernel that was preventing it from supporting __half
Added more specific error messages for unimplemented FP16 specializations of Xgeam, CSRGEMM, and CSRGEAM, which would require functions that aren't provided by cublas

Checklist

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change
Related issue is referred in this PR

dgl-bot · 2022-06-17T21:51:18Z

To trigger regression tests:

@dgl-bot run [instance-type] [which tests] [compare-with-branch];
For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

dgl-bot · 2022-06-17T22:09:13Z

Commit ID: 1751ca3

Build ID: 1

Status: ❌ CI test failed in Stage [Lint Check].

Report path: link

Full logs path: link

dgl-bot · 2022-06-20T16:46:31Z

Commit ID: 9f63277538d3f54bc2f79004984c07efd7dafac1

Build ID: 2

Status: ❌ CI test failed in Stage [GPU Build].

Report path: link

Full logs path: link

dgl-bot · 2022-06-20T19:14:54Z

Commit ID: d0bc60a19bae0d11db5214fe3527abcf1e9094b3

Build ID: 3

Status: ❌ CI test failed in Stage [Lint Check].

Report path: link

Full logs path: link

dgl-bot · 2022-06-20T19:23:22Z

Commit ID: ad8972c

Build ID: 4

Status: ❌ CI test failed in Stage [Lint Check].

Report path: link

Full logs path: link

dgl-bot · 2022-06-20T20:40:41Z

Commit ID: 29f464e

Build ID: 5

Status: ✅ CI test succeeded

Report path: link

Full logs path: link

src/array/cuda/spmm.cuh

src/array/cuda/cusparse_dispatcher.cuh

isratnisa · 2022-06-23T17:09:10Z

src/array/cuda/array_index_select.cu

+#ifdef USE_FP16
+  // The initialization constructor for __half is apparently a device-
+  // only function in some setups, but the current function isn't run
+  // on the device.


I am not clear which 'current' function we are talking here.

"the current function" here refers to the function containing this comment, the function that is currently being run (on the host) as this comment is passed. If I referred to it by name instead, i.e. IndexSelect, it might sound like the comment is referring to the other function named IndexSelect, above. Maybe it would be clear if I included both, though. 🤔 I'll try something when I add the "TODO"s.

I updated the comment. Hopefully it's a bit clearer now. Thanks!

dgl-bot · 2022-06-23T20:01:09Z

Commit ID: 10c371f85996c031b10a7153b517e3cb149ae956

Build ID: 6

Status: ❌ CI test failed in Stage [Torch CPU (Win64) Unit test].

Report path: link

Full logs path: link

dgl-bot · 2022-06-23T21:47:01Z

Commit ID: c425866

Build ID: 7

Status: ✅ CI test succeeded

Report path: link

Full logs path: link

nv-dlasalle

Functionally it looks good--just the way some of the errors are reported needs updating.

src/array/cuda/cusparse_dispatcher.cuh

src/array/cuda/spmm.cuh

…IndexSelect`, `Full`, `Scatter_`, `CSRGetData`, `CSRMM`, `CSRSum`, `IndexSelectCPUFromGPU` * Fixed casting issue in `_LinearSearchKernel` that was preventing it from supporting `__half` * Added `#if`'d out specializations of `CSRGEMM`, `CSRGEAM`, and `Xgeam`, which would require functions that aren't currently provided by cublas

…ations of Xgeam, CSRGEMM, and CSRGEAM

* Added clearer comment explaining why the cast to long long is necessary

…f can't be constructed on the host side

* Also changed the existing Xgeam function for unsupported data types from LOG(INFO) to LOG(FATAL)

dgl-bot · 2022-06-27T20:13:55Z

Commit ID: 83d41e9

Build ID: 9

Status: ✅ CI test succeeded

Report path: link

Full logs path: link

yaox12 requested a review from nv-dlasalle June 20, 2022 06:25

mufeili requested a review from isratnisa June 20, 2022 06:28

ndickson-nvidia force-pushed the more_half branch from 3d913a9 to ad8972c Compare June 20, 2022 19:21

isratnisa reviewed Jun 23, 2022

View reviewed changes

ndickson-nvidia force-pushed the more_half branch from c36e7fb to c425866 Compare June 23, 2022 20:45

nv-dlasalle approved these changes Jun 24, 2022

View reviewed changes

src/array/cuda/cusparse_dispatcher.cuh Outdated Show resolved Hide resolved

src/array/cuda/cusparse_dispatcher.cuh Outdated Show resolved Hide resolved

src/array/cuda/spmm.cuh Show resolved Hide resolved

ndickson-nvidia added 8 commits June 27, 2022 14:42

* Added more specific error messages for unimplemented FP16 specializ…

ca6d3f4

…ations of Xgeam, CSRGEMM, and CSRGEAM

* Added missing instantiation of DLDataTypeTraits<__half>::dtype

52bd9b1

* Fixed linter error

d0969d4

* Added clearer comment explaining why the cast to long long is necessary

* Worked around a compile error in some particular setup, where __hal…

52ed506

…f can't be constructed on the host side

* Fixed linter formatting errors

8661d77

* Changes to comments as recommended

c7e871a

* Made recommended changes to logging errors in FP16 specializations

83d41e9

* Also changed the existing Xgeam function for unsupported data types from LOG(INFO) to LOG(FATAL)

ndickson-nvidia force-pushed the more_half branch from 7418f5c to 83d41e9 Compare June 27, 2022 18:44

nv-dlasalle merged commit a5d8460 into dmlc:master Jun 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug][Feature] Added more missing FP16 specializations #4140

[Bug][Feature] Added more missing FP16 specializations #4140

ndickson-nvidia commented Jun 17, 2022 •

edited

Loading

dgl-bot commented Jun 17, 2022

dgl-bot commented Jun 17, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

isratnisa Jun 23, 2022

ndickson-nvidia Jun 23, 2022

ndickson-nvidia Jun 23, 2022

dgl-bot commented Jun 23, 2022

dgl-bot commented Jun 23, 2022

nv-dlasalle left a comment

dgl-bot commented Jun 27, 2022

[Bug][Feature] Added more missing FP16 specializations #4140

[Bug][Feature] Added more missing FP16 specializations #4140

Conversation

ndickson-nvidia commented Jun 17, 2022 • edited Loading

Description

Checklist

dgl-bot commented Jun 17, 2022

dgl-bot commented Jun 17, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

dgl-bot commented Jun 20, 2022

isratnisa Jun 23, 2022

Choose a reason for hiding this comment

ndickson-nvidia Jun 23, 2022

Choose a reason for hiding this comment

ndickson-nvidia Jun 23, 2022

Choose a reason for hiding this comment

dgl-bot commented Jun 23, 2022

dgl-bot commented Jun 23, 2022

nv-dlasalle left a comment

Choose a reason for hiding this comment

dgl-bot commented Jun 27, 2022

ndickson-nvidia commented Jun 17, 2022 •

edited

Loading