Create benchmarks directory and move babelstream into it #2237

mehmetyusufoglu · 2024-01-31T09:48:59Z

A simple PR. A directory called "benchmarks" is created and babelstream example is copied into it. There is a new cmake flag alpaka_BUILD_BENCHMARKS. If this flag is ON then alpaka_ACC_CPU_B_SEQ_T_SEQ_ENABLE is turned ON (Like alpaka_BUILD_EXAMPLES flag)

The codes under benchmark directory is compiled. But Babelstream example is not run at the CI, as it was in the Examples directory before.

SimeonEhrig · 2024-01-31T13:15:41Z

Maybe we should add a cmake target all_benchmarks where we can register and execute all benchmarks.

The benchmarks need to be build in the CI. I see no reason, why we should not enable the benchmarks for all builds. Therefore we can add the CMake argument here:

alpaka/script/run_generate.sh

Lines 80 to 94 in dcc87b3

    
           "${ALPAKA_CI_CMAKE_EXECUTABLE}" --log-level=VERBOSE -G "${ALPAKA_CI_CMAKE_GENERATOR}" ${ALPAKA_CI_CMAKE_GENERATOR_PLATFORM}\ 
        
               -Dalpaka_BUILD_EXAMPLES=ON -DBUILD_TESTING=ON "$(env2cmake alpaka_ENABLE_WERROR)" \ 
        
               "$(env2cmake BOOST_ROOT)" -DBOOST_LIBRARYDIR="${ALPAKA_CI_BOOST_LIB_DIR}/lib" -DBoost_USE_STATIC_LIBS=ON -DBoost_USE_MULTITHREADED=ON -DBoost_USE_STATIC_RUNTIME=OFF -DBoost_ARCHITECTURE="-x64" \ 
        
               "$(env2cmake CMAKE_BUILD_TYPE)" "$(env2cmake CMAKE_CXX_FLAGS)" "$(env2cmake CMAKE_C_COMPILER)" "$(env2cmake CMAKE_CXX_COMPILER)" "$(env2cmake CMAKE_EXE_LINKER_FLAGS)" "$(env2cmake CMAKE_CXX_EXTENSIONS)"\ 
        
               "$(env2cmake alpaka_ACC_CPU_B_SEQ_T_SEQ_ENABLE)" "$(env2cmake alpaka_ACC_CPU_B_SEQ_T_THREADS_ENABLE)" \ 
        
               "$(env2cmake alpaka_ACC_CPU_B_TBB_T_SEQ_ENABLE)" \ 
        
               "$(env2cmake alpaka_ACC_CPU_B_OMP2_T_SEQ_ENABLE)" "$(env2cmake alpaka_ACC_CPU_B_SEQ_T_OMP2_ENABLE)" \ 
        
               "$(env2cmake TBB_DIR)" \ 
        
               "$(env2cmake alpaka_RELOCATABLE_DEVICE_CODE)" \ 
        
               "$(env2cmake alpaka_ACC_GPU_CUDA_ENABLE)" "$(env2cmake alpaka_ACC_GPU_CUDA_ONLY_MODE)" "$(env2cmake CMAKE_CUDA_ARCHITECTURES)" "$(env2cmake CMAKE_CUDA_COMPILER)" "$(env2cmake CMAKE_CUDA_FLAGS)" \ 
        
               "$(env2cmake alpaka_CUDA_FAST_MATH)" "$(env2cmake alpaka_CUDA_FTZ)" "$(env2cmake alpaka_CUDA_SHOW_REGISTER)" "$(env2cmake alpaka_CUDA_KEEP_FILES)" "$(env2cmake alpaka_CUDA_EXPT_EXTENDED_LAMBDA)" \ 
        
               "$(env2cmake alpaka_ACC_GPU_HIP_ENABLE)" "$(env2cmake alpaka_ACC_GPU_HIP_ONLY_MODE)" "$(env2cmake CMAKE_HIP_ARCHITECTURES)" "$(env2cmake CMAKE_HIP_COMPILER)" "$(env2cmake CMAKE_HIP_FLAGS)" \ 
        
               "$(env2cmake alpaka_ACC_SYCL_ENABLE)" "$(env2cmake alpaka_SYCL_ONEAPI_CPU)" "$(env2cmake alpaka_SYCL_ONEAPI_CPU_ISA)" \ 
        
               "$(env2cmake alpaka_DEBUG)" "$(env2cmake alpaka_CI)" "$(env2cmake alpaka_CHECK_HEADERS)" "$(env2cmake alpaka_CXX_STANDARD)" "$(env2cmake alpaka_USE_MDSPAN)" "$(env2cmake CMAKE_INSTALL_PREFIX)" \ 
        
               ".."

benchmarks/CMakeLists.txt

bernhardmgruber · 2024-02-01T10:40:26Z

I generally like the idea of separating benchmarks and examples, but could you please elaborate a bit on your motivation for doing this? Specifically, are you going to add more benchmarks? Are you planning to build the benchmarks differently than examples? Thx!

mehmetyusufoglu · 2024-02-01T11:02:50Z

In my opinion, examples could be designed for any reason, pedagogical or showing implementation of a new feature etc. Benchmarks will mainly focus on performance and visualising it's change through time with CI will show general performance effects of each PR merged.

bernhardmgruber · 2024-02-01T12:06:14Z

Benchmarks will mainly focus on performance and visualising it's change through time with CI will show general performance effects of each PR merged.

Alright, so you are preparing for some kind of performance CI? Here is a ticket for that: #1264

SimeonEhrig · 2024-02-05T08:47:54Z

Benchmarks will mainly focus on performance and visualising it's change through time with CI will show general performance effects of each PR merged.

Alright, so you are preparing for some kind of performance CI? Here is a ticket for that: #1264

We discussed it last week. At the moment, a CI is not possible because of lacking resources. But we want to have benchmarks to run regression benchmarks locally on laptops, workstation or server.

For example, we thought about to use mdspan for tensors in kernels. This makes the usage easier instead using raw pointers. But maybe the performance overhead is to high, which means we need also to implement an interface with raw pointers.

sliwowitz · 2024-02-05T15:19:42Z

There's also #1723 which I've just rebased on top of actual develop. It uses Catch2 for benchmarking infrastructure (thus integrated with e.g. ctest). I tried to implement a generaic fixture for benchmarking kernels that would allow us to write simple benchmarks for basic features, but I didn't implement any other use case than the random generator, so I didn't know what would be some actual sensible requirements for such a fixture.

Using Catch2 to handle the benchmarks is IMHO still a good idea since we're already using it to handle tests.

mehmetyusufoglu marked this pull request as draft January 31, 2024 09:50

mehmetyusufoglu force-pushed the benchmarkDir branch 2 times, most recently from d1769ae to ac3b661 Compare January 31, 2024 10:04

mehmetyusufoglu marked this pull request as ready for review January 31, 2024 10:10

SimeonEhrig reviewed Jan 31, 2024

View reviewed changes

benchmarks/CMakeLists.txt Outdated Show resolved Hide resolved

mehmetyusufoglu force-pushed the benchmarkDir branch 2 times, most recently from 8d30818 to 1e4d0ae Compare February 1, 2024 08:22

mehmetyusufoglu marked this pull request as draft February 2, 2024 09:51

mehmetyusufoglu force-pushed the benchmarkDir branch from bb04e0d to 614fb7f Compare February 5, 2024 09:51

mehmetyusufoglu marked this pull request as ready for review February 5, 2024 10:12

mehmetyusufoglu force-pushed the benchmarkDir branch from 614fb7f to 8cc55d6 Compare February 5, 2024 12:30

psychocoderHPC added this to the 1.2.0 milestone Feb 27, 2024

psychocoderHPC added Type:Refactoring Type:Testing labels Feb 27, 2024

mehmetyusufoglu force-pushed the benchmarkDir branch from a4317ef to 6ecabf2 Compare February 27, 2024 09:53

Create benchmarks directory and move babelstream into it

91f04ab

mehmetyusufoglu force-pushed the benchmarkDir branch from 6ecabf2 to 91f04ab Compare March 8, 2024 15:17

psychocoderHPC approved these changes Mar 20, 2024

View reviewed changes

psychocoderHPC merged commit 7a8b205 into alpaka-group:develop Mar 20, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create benchmarks directory and move babelstream into it #2237

Create benchmarks directory and move babelstream into it #2237

mehmetyusufoglu commented Jan 31, 2024 •

edited

SimeonEhrig commented Jan 31, 2024

bernhardmgruber commented Feb 1, 2024

mehmetyusufoglu commented Feb 1, 2024

bernhardmgruber commented Feb 1, 2024

SimeonEhrig commented Feb 5, 2024

sliwowitz commented Feb 5, 2024

Create benchmarks directory and move babelstream into it #2237

Create benchmarks directory and move babelstream into it #2237

Conversation

mehmetyusufoglu commented Jan 31, 2024 • edited

SimeonEhrig commented Jan 31, 2024

bernhardmgruber commented Feb 1, 2024

mehmetyusufoglu commented Feb 1, 2024

bernhardmgruber commented Feb 1, 2024

SimeonEhrig commented Feb 5, 2024

sliwowitz commented Feb 5, 2024

mehmetyusufoglu commented Jan 31, 2024 •

edited