Skip to content

mdsmith/oclCopyBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

oclCopyBench

Compile:

make

OR, for the simplified version:

make simple

Usage:

./copy

OR, for the simplified version:

./simpleCopy

Current Tests:

Far more have been added, and there are too many variables to document from memory. This should be updated soon.

  • Float + Exponent Arrays
  • Double + Exponent Arrays
  • Struct with three elements Array
  • Struct with two elements Array

Whether or not it makes sense to replace mantissa & exponent arrays with a single struct array. This goes for both copying the data to the GPU and accessing it once it is already there.

Current Results:

Currently the struct arrays are about twice as fast. More testing is needed to isolate whether or not this is from the number of arrays that need to be copied (my hypothesis). This would suggest that the time to access the data from memory is trivial if the memory is only being accessed once.

If this is the case the next test will be to experiment with the different methods of moving memory to and from the GPU, which may at some point involve an APU and zero copy transfers (perhaps even a unified memory space).

About

A suite for testing and benchmarking various methods of getting data from host to device. I'm going to go crazy and put reading and writing from device memory in here too. Wow!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors