-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a more comprehensive kokkos_{malloc, free}
perf_test
#6377
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Generally agree with Damien's comments. Also lets change the FOM to a simple rate. I.e. inverse of time per try.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good after you move the stride definition next to the parallel_for as Daniel suggested.
71e845c
to
35840c7
Compare
I made the change and rebased to two commits |
Jenkins failure are unrelated ( |
Command and sample output
$ ./Kokkos_PerformanceTest_Benchmark --benchmark_filter="Malloc"
@simongdg