Packing for Gelu backwards #306

JaneIllario · 2024-04-30T05:23:35Z

Update gelu backwards kernel to do packing into 128 bits, and create gelu brackward cuda file

total average iteration time: 39.030942 ms

New Kernel

total average iteration time: 38.145030 ms

karpathy · 2024-04-30T19:03:02Z

This PR can't compare the previous kernel and the new one, and also isn't there a compile bug? x128 typedef doesn't exist.

JaneIllario · 2024-04-30T20:00:24Z

Sorry, I must've deleted the definition for x128 while cleaning up my branch. Pushed the correction now.

JaneIllario · 2024-05-01T04:37:36Z

I updated the gelu_backward.cu to match the most recent changes in master for the other .cu files -- updating the other pr now

train_gpt2.cu

dev/cuda/gelu_backward.cu

ngc92 · 2024-05-02T14:46:40Z

dev/cuda/gelu_backward.cu

+}
+
+void gelu_backward2(floatX* dinp, const floatX* inp, const floatX* dout, int N, const int block_size) {
+    const int grid_size = ceil_div(N, block_size * x128::size);


maybe add an assert(N % x128::size == 0) here? documents the assumption and we may get a better error message in case we call the kernel wrongly later

JaneIllario force-pushed the gelu_backwards branch from 68c16ab to 71da2d2 Compare April 30, 2024 05:33

JaneIllario force-pushed the gelu_backwards branch from 2771d7e to 09b313a Compare May 1, 2024 04:35

ngc92 reviewed May 1, 2024

View reviewed changes

train_gpt2.cu Outdated Show resolved Hide resolved

ngc92 reviewed May 1, 2024

View reviewed changes

dev/cuda/gelu_backward.cu Outdated Show resolved Hide resolved

ngc92 reviewed May 1, 2024

View reviewed changes

dev/cuda/gelu_backward.cu Outdated Show resolved Hide resolved

JaneIllario added 6 commits May 1, 2024 23:36

Create gelu_backward.cu

8007280

Update train_gpt2.cu

ab2de05

update gelu backward rto allow all kernels to use both types

7746217

update ceildiv for gelu_backward

e9a80b5

remove int casting

d98e5ae

update kernel with util functions

3de3c53

JaneIllario force-pushed the gelu_backwards branch from 95c59ea to 3de3c53 Compare May 2, 2024 00:00

ngc92 reviewed May 2, 2024

View reviewed changes

karpathy merged commit 99f51ba into karpathy:master May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Packing for Gelu backwards #306

Packing for Gelu backwards #306

JaneIllario commented Apr 30, 2024

karpathy commented Apr 30, 2024

JaneIllario commented Apr 30, 2024

JaneIllario commented May 1, 2024

ngc92 May 2, 2024

Packing for Gelu backwards #306

Packing for Gelu backwards #306

Conversation

JaneIllario commented Apr 30, 2024

karpathy commented Apr 30, 2024

JaneIllario commented Apr 30, 2024

JaneIllario commented May 1, 2024

ngc92 May 2, 2024

Choose a reason for hiding this comment