Pull out conv2d kernels from op #320

lockshaw · 2022-09-29T23:01:54Z

By pulling out the kernel functions into their own files, we should decrease build time by reducing the number of times the kernels (which rarely change) need to be recompiled

src/ops/conv_2d.cc

reyna-abhyankar

Overall, it looks good to me. I have one question that's related to the codebase. The .cpp file seem to mimic its corresponding .cu file. Is it auto-generated? If not, should that also be modified for this PR?

Also, what is the purpose of the kernel wrappers? If there's a forward and a backward and then also have forward_kernel and backward_kernel, I'm wondering if the wrapper is really necessary because it mostly seems to be doing some profiling.

src/ops/kernels/conv_2d_kernels.cu

lockshaw · 2022-10-13T06:04:13Z

Blocked on #345 as currently I can't check that the AMD kernel changes are correct

reyna-abhyankar

What is the purpose of the change from typename to class in model.h? Otherwise LGTM

lockshaw · 2022-12-12T06:13:30Z

What is the purpose of the change from typename to class in model.h? Otherwise LGTM

This is a C++17 feature apparently 🤷, whereas class is C++11 compliant. C++11 compliance checking is being added #271

lockshaw added 2 commits September 27, 2022 19:28

Pull out Conv2D kernels into separate functions

e7fe5bc

Format

519a7e5

lockshaw requested a review from reyna-abhyankar September 29, 2022 23:01

lockshaw added 2 commits September 29, 2022 16:02

Merge branch 'master' into re/conv2d-kernels

4b28b22

Merge branch 'master' into re/conv2d-kernels

2cdadac

reyna-abhyankar reviewed Oct 1, 2022

View reviewed changes

src/ops/conv_2d.cc Outdated Show resolved Hide resolved

reyna-abhyankar reviewed Oct 1, 2022

View reviewed changes

src/ops/kernels/conv_2d_kernels.cu Show resolved Hide resolved

Start to add AMD support

22818f9

lockshaw mentioned this pull request Oct 18, 2022

Pull kernel functions out of operator classes #303

Open

lockshaw linked an issue Nov 8, 2022 that may be closed by this pull request

Kernel refactor for Conv2D #434

Closed

lockshaw added 7 commits December 9, 2022 14:05

Merge remote-tracking branch 'origin/master' into re/conv2d-kernels

9e2349a

Update hip files

fe344a6

Fix hip stream and kernel issue

49a3ba0

Change to use ffStream_t

98b316d

Small bug fix

118f956

Merge remote-tracking branch 'origin/master' into re/conv2d-kernels

a97fe4c

Fix merge issue and include

cf61de2

lockshaw requested a review from reyna-abhyankar December 11, 2022 08:44

Fix small namespacing issue

b683ac9

lockshaw enabled auto-merge (squash) December 11, 2022 10:00

lockshaw and others added 2 commits December 11, 2022 02:02

Fix include in simulator.cpp

811da0b

Merge branch 'master' into re/conv2d-kernels

8b66325

goliaro approved these changes Dec 12, 2022

View reviewed changes

lockshaw merged commit ebe4488 into flexflow:master Dec 12, 2022

reyna-abhyankar reviewed Dec 12, 2022

View reviewed changes

lockshaw deleted the re/conv2d-kernels branch July 17, 2023 17:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull out conv2d kernels from op #320

Pull out conv2d kernels from op #320

lockshaw commented Sep 29, 2022

reyna-abhyankar left a comment

lockshaw commented Oct 13, 2022

reyna-abhyankar left a comment

lockshaw commented Dec 12, 2022

Pull out conv2d kernels from op #320

Pull out conv2d kernels from op #320

Conversation

lockshaw commented Sep 29, 2022

reyna-abhyankar left a comment

Choose a reason for hiding this comment

lockshaw commented Oct 13, 2022

reyna-abhyankar left a comment

Choose a reason for hiding this comment

lockshaw commented Dec 12, 2022