Skip to content

Version 0.5.5: Minor changes

Compare
Choose a tag to compare
@eyalroz eyalroz released this 10 Sep 18:43
· 289 commits to development since this release

Changes since v0.5.4:

Run-time compilation functionality

  • #397 : The NVRTC compilation options class now supports passing extra options to PTXAS, and also supports --dopt
  • #403 : The program builder class can now accept named header additions using std::string's for the name and/or header source (rather than only C-style const char* strings).

Bug fixes

  • #396 : scoped_existence_ensurer_t, the gadget for ensuring there is some current context (regardless of which) will now make sure the driver has been initialized.
  • #395 : Can now start profiling with our nvtx component even if the driver not yet being initialized.

Other changes

  • #400 : Added an alias for waiting/synchronizing on an event: You can now execute cuda::wait(my_event), not just cuda::synchronize(my_event).
  • #399 : time_elapsed_between() can now accept std::pair's of events.
  • #398 : Added another example program, the CUDA sample bandwidthtest
  • #401 : Made all stream enqueuing methods const (so you can now enqueue on a stream passed by const-reference).
  • #404 : Can now construct grid::overall_dimensions_t from a dim3 object, so that they're more interoperable with CUDA-related values you obtained elsewhere.