You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The OpenMP GPU offloading version is created from scratch based on the official OpenMP and CUDA versions. Then the OpenACC version is created based on the OpenMP GPU offloading version.
The original OpenMP CPU version uses omp_get_thread_num() to do work chunking for multiple threads. However, OpenACC doesn't have similar APIs. We have to calculate the chunk id manually (check this commit passlab/NeoRodinia@3d88f2a).
The original OpenACC version doesn't work at all, so it's abandoned.
The OpenACC directives seem fine and can be mapped to OpenMP code. However, there are many errors in the source.
For example:
cluster()
,allocateMemory()
,deallocateMemory()
, etc.float* features
vsfloat** features
.allocateMemory()
anddeallocateMemory()
tex1Dfetch()
.We can either fix them all or start with the official OpenMP CPU version.
The text was updated successfully, but these errors were encountered: