Skip to content

Commit

Permalink
[OpenMP] Teams reduction on the NVPTX device.
Browse files Browse the repository at this point in the history
This patch implements codegen for the reduction clause on
any teams construct for elementary data types.  It builds
on parallel reductions on the GPU.  Subsequently,
the team master writes to a unique location in a global
memory scratchpad.  The last team to do so loads and
reduces this array to calculate the final result.

This patch emits two helper functions that are used by
the OpenMP runtime on the GPU to perform reductions across
teams.

Patch by Tian Jin in collaboration with Arpith Jacob

Reviewers: ABataev
Differential Revision: https://reviews.llvm.org/D29879

llvm-svn: 295335
  • Loading branch information
arpith-jacob committed Feb 16, 2017
1 parent 101e8fb commit fc711b1
Show file tree
Hide file tree
Showing 3 changed files with 1,590 additions and 7 deletions.

0 comments on commit fc711b1

Please sign in to comment.