Implement Tanh Gelu #988

rdspring1 · 2021-07-08T06:43:14Z

Double backward support for Gelu - [Original and Approximate]
Add approximate boolean flag to Gelu
Fast tanh gelu to eager mode - [CPU and CUDA, skip MKLDNN]
Fast Tanh Gelu to NvFuser composite ops
Pass Pytorch CI

Add fast-gelu implementation for CPU and CUDA

jjsjann123 · 2021-07-10T21:34:21Z

Is this one ready for review yet? It seems to be still in draft state.

rdspring1 · 2021-07-11T03:34:53Z

I'm still working on the passing the PyTorch CI.

Would adding a separate TanhGelu op be easier to upstream than adding an approximation flag to Gelu?

jjsjann123 · 2021-07-12T16:21:10Z

Would adding a separate TanhGelu op be easier to upstream than adding an approximation flag to Gelu?

Is this because of the CI failures?
I don't think adding a separate TanhGelu would make it easier. If for some reason a new TanhGeluApprox gets CI green while the corresponding changes in TanhGelu doesn't, that means we are missing some CI test and like have to patch that later as well.

jjsjann123 · 2021-07-12T16:21:24Z

linking upstream PR: pytorch#61439

Update Gelu documenation Update test_nn with tanh gelu Fix test_cpp_api_parity Update torch overrides Enable Gelu for Tensor Expressions Tensor Expressions ignores the approximate flag Add Gelu to UNTRACEABLE_FUNCTIONALS - test/test_fx Update torch/onnx/symbolic_opset

jjsjann123

LGTM.

jjsjann123 · 2021-07-14T17:24:57Z

aten/src/ATen/native/native_functions.yaml

@@ -3497,22 +3497,22 @@
    CPU: gelu_out_cpu
    CUDA: gelu_out_cuda

- func: gelu(Tensor self) -> Tensor
+- func: gelu(Tensor self, bool approximate) -> Tensor


nitpick, should we default approximate to False just so that we don't have to update tests?

torch/csrc/jit/runtime/symbolic_script.cpp

jjsjann123 · 2021-07-14T17:36:45Z

Don't worry too much about upstream XLA test. We can deal with that when we are actually upstreaming.

rdspring1 · 2021-07-14T20:38:15Z

I created an XLA PR - pytorch/xla#3039

Update Onnx tests

jjsjann123 · 2021-07-16T20:02:57Z

Tests passing. I think we are missing some BC allow list changes, but those shouldn't matter. I'm merging this one.

rdspring1 added 3 commits July 6, 2021 16:40

Enable gelu_double_backward

71daece

Add approximation flag to Gelu

6180130

Add fast-gelu implementation for CPU and CUDA

Add fast gelu to NvFuser

d8d561a

rdspring1 force-pushed the rds_fast_gelu branch from d9242ad to d8d561a Compare July 8, 2021 06:56

csarofeen requested review from jjsjann123 and kevinstephano July 10, 2021 20:31

jjsjann123 approved these changes Jul 14, 2021

View reviewed changes

Add Gelu to Tensor Expressions skip list

ed7b250

Update Onnx tests

rdspring1 marked this pull request as ready for review July 16, 2021 15:36

rdspring1 and others added 3 commits July 16, 2021 10:39

Add empty line

ede5d0d

Merge remote-tracking branch 'csarofeen/20_12_3_devel' into HEAD

acbdd8f

Merge remote-tracking branch 'csarofeen/20_12_3_devel' into HEAD

84fde64

jjsjann123 merged commit 765276e into 20_12_3_devel Jul 16, 2021

jjsjann123 deleted the rds_fast_gelu branch July 16, 2021 20:03

naor2013 mentioned this pull request Jan 1, 2022

A few questions about this fork #1346

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Tanh Gelu #988

Implement Tanh Gelu #988

rdspring1 commented Jul 8, 2021 •

edited

Loading

jjsjann123 commented Jul 10, 2021

rdspring1 commented Jul 11, 2021

jjsjann123 commented Jul 12, 2021

jjsjann123 commented Jul 12, 2021

jjsjann123 left a comment

jjsjann123 Jul 14, 2021

jjsjann123 commented Jul 14, 2021

rdspring1 commented Jul 14, 2021 •

edited

Loading

jjsjann123 commented Jul 16, 2021

Implement Tanh Gelu #988

Implement Tanh Gelu #988

Conversation

rdspring1 commented Jul 8, 2021 • edited Loading

jjsjann123 commented Jul 10, 2021

rdspring1 commented Jul 11, 2021

jjsjann123 commented Jul 12, 2021

jjsjann123 commented Jul 12, 2021

jjsjann123 left a comment

Choose a reason for hiding this comment

jjsjann123 Jul 14, 2021

Choose a reason for hiding this comment

jjsjann123 commented Jul 14, 2021

rdspring1 commented Jul 14, 2021 • edited Loading

jjsjann123 commented Jul 16, 2021

rdspring1 commented Jul 8, 2021 •

edited

Loading

rdspring1 commented Jul 14, 2021 •

edited

Loading