Skip to content

Commit

Permalink
Merge pull request #15 from stoneyang/hl_device_func
Browse files Browse the repository at this point in the history
fixed build issue of double definition of atomicAdd on modern GPUs
  • Loading branch information
gangliao committed Sep 1, 2016
2 parents df82689 + f2aa274 commit f2b84d5
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions paddle/cuda/include/hl_device_functions.cuh
100644 → 100755
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,8 @@ limitations under the License. */
#ifndef HL_DEVICE_FUNCTIONS_CUH_
#define HL_DEVICE_FUNCTIONS_CUH_

#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 600

namespace hppl {

static __inline__ __device__ double atomicAdd(double* address, double val) {
Expand All @@ -38,4 +40,6 @@ static __inline__ __device__ double atomicAdd(double* address, double val) {

using hppl::atomicAdd;

#endif

#endif /* HL_DEVICE_FUNCTIONS_CUH_ */

0 comments on commit f2b84d5

Please sign in to comment.