-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow multiple inner products with the same vector in one kernel #32
Comments
This shouldn't be a problem. However, we need to fully specify an upper 2013/5/31 Karl Rupp notifications@github.com
|
Some time ago I implemented this for up to 8 different vectors (outside ViennaCL). Higher counts just repeatedly call these kernels then. Works like a charm :-) |
I prepared the field to handle dual-kernels with multiple operations. However, I am realizing packing multiple inner products together can only be done if there is no dependency. How/Where to check the dependency between operations? This should actually be a distinct tool, almost independent from the generator (since it may also be used in the scheduler or multi-devices)... |
I'd prefer to check this in the scheduler logic. Thus, I'd like to have a 'no aliasing can be assumed' policy for the generator, because this makes this way the generator can really focus on performance without having to do too much bookkeeping. The alias checking is much better done by a scheduler logic on top of the generator. |
Awesome ! This makes my life much much easier if independent operations are in independent custom operations! |
Implementation completed here: |
Required for some flavors of GMRES.
The text was updated successfully, but these errors were encountered: