Add RAJA view performance test to benchmark #1728

artv3 · 2024-09-02T15:54:22Z

After resolving issue #1718, this PR now adds the performance test into the bench mark folder.

//------------

This PR adds the code provided in issue #1718 in an effort to reproduce the slow down.
I don't have access to pascal but on lassen I see comparable performance:

Elapsed time with RAJA view : 0.0951086
Elapsed time with NO RAJA view : 0.0952884

To avoid measuring stream initialization I added an basic forall at the start of the program.

Compiler setup:
nvcc: nvcc11.2.0, cuda_arch=70, gcc8.3.1

I also tried: nvcc11.8.0, cuda_arch=70, gcc8.3.1
Elapsed time with RAJA view : 0.0949394
Elapsed time with NO RAJA view : 0.0949237

artv3 · 2024-09-03T16:19:09Z

@artv3 move to benchmark folder.

rahulb1218 · 2024-09-03T17:59:26Z

I ran the code on Pascal and got the following results:

Elapsed time with RAJA view : 16.6409
Elapsed time with NO RAJA view : 2.26529

MrBurmark · 2024-09-03T18:04:24Z

What gpus are on pascal? It seems strange that they are ~24x slower than the V100s on lassen?

rahulb1218 · 2024-09-03T18:21:32Z

Tesla P100-PCIE-16GB I believe.

MrBurmark · 2024-09-03T18:24:54Z

Are we running the same code in the same way on both of these platforms? How did you build?

rahulb1218 · 2024-09-03T18:49:52Z

We found that the issue was that my code built for debugging which was causing the slowdown.

johnbowen42

LGTM, just some small nits about making sure the benchmark makes sense for all backends

benchmark/raja_view_blur.cpp

benchmark/CMakeLists.txt

artv3 · 2024-09-27T01:09:53Z

@johnbowen42 @rhornung67 can I get another review? I just pushed up the changes I thought I had pushed up.

artv3/raja-view-slowdown

f5bb9b8

artv3 mentioned this pull request Sep 2, 2024

Slowdown observed with Raja View #1718

Closed

artv3 added 3 commits September 16, 2024 11:36

Merge branch 'develop' into artv3/raja-view-slowdown

17a2b04

move raja_view perf test to benchmark folder

f54bcc1

Merge branch 'develop' into artv3/raja-view-slowdown

6d3a12e

artv3 changed the title ~~Reproducer for issue: 1718~~ Add RAJA view performance test to benchmark Sep 17, 2024

artv3 marked this pull request as ready for review September 17, 2024 20:25

artv3 requested review from MrBurmark, rhornung67, johnbowen42 and rchen20 September 17, 2024 20:26

johnbowen42 approved these changes Sep 17, 2024

View reviewed changes

benchmark/raja_view_blur.cpp Outdated Show resolved Hide resolved

benchmark/CMakeLists.txt Outdated Show resolved Hide resolved

artv3 added 2 commits September 19, 2024 12:16

Merge branch 'develop' into artv3/raja-view-slowdown

5360476

clean up pass, add other variants

c4ddb9e

artv3 requested a review from johnbowen42 September 27, 2024 01:09

rhornung67 approved these changes Sep 27, 2024

View reviewed changes

Merge branch 'develop' into artv3/raja-view-slowdown

8199dfa

artv3 enabled auto-merge September 27, 2024 20:52

adayton1 approved these changes Sep 27, 2024

View reviewed changes

artv3 merged commit 1ddae3d into develop Sep 27, 2024
16 of 26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RAJA view performance test to benchmark #1728

Add RAJA view performance test to benchmark #1728

artv3 commented Sep 2, 2024 •

edited

Loading

artv3 commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

MrBurmark commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

MrBurmark commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

johnbowen42 left a comment

artv3 commented Sep 27, 2024

Add RAJA view performance test to benchmark #1728

Add RAJA view performance test to benchmark #1728

Conversation

artv3 commented Sep 2, 2024 • edited Loading

artv3 commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

MrBurmark commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

MrBurmark commented Sep 3, 2024

rahulb1218 commented Sep 3, 2024

johnbowen42 left a comment

Choose a reason for hiding this comment

artv3 commented Sep 27, 2024

artv3 commented Sep 2, 2024 •

edited

Loading