-
Notifications
You must be signed in to change notification settings - Fork 0
Future Improvements
Nick Georgakopoulos edited this page Oct 3, 2021
·
5 revisions
-
transferdifferential
andrankmult
are the only functions that don't use preallocation or vectorization. The problem is that computing the sizes to be preallocated/vectorized incurs a performance penalty that is more than the speed advantage gained. Or so I think, there might be a better way to do it. Ultimately the code here could be improved.
- Investigate GPU-compute more thoroughly.