Skip to content
Charles Determan edited this page Jun 20, 2018 · 2 revisions

crossprod(x) slower than t(x) %*% x

See issue #85. This is only has been shown for NVIDIA devices. It appears that either kernel or the OpenCL backend maintained by NVIDIA makes this operation less performant.

pocl backend

  1. Negative base to fractional powers (e.g. (-2)^0.5).

    • This should return nan values but there is a bug in the pocl platform issue 518
    • Fixed in version >= 0.15
  2. Logs of negatives

    • This should return nan values but there is a bug in the pocl platform issue 528
    • Fixed in version >= 0.15