Persistent thread operator implementation #602

jowens · 2019-09-16T20:37:12Z

@neoblizz suggests we do a persistent-thread version of operators. CUDA has increasing support for persistent thread programming and cooperative-groups has good synchronization capabilities, and we can reasonably expect this support will improve in future hw/sw. The specific benefit is that if we use a PT model, we can achieve kernel fusion within a PT kernel between two operators.

@neoblizz notes that we should consider implementing alternate operators for our current operators that are implemented as PT operators at the device level. Programs would not need to change; instead a command-line switch could decide between a PT operator and the current non-PT operator.

porumbes · 2019-10-08T20:19:12Z

@YuxinxinChen take a look and discuss some of your current work with @neoblizz when you get a chance.

jowens added the 🐲 enhancement Add or request enhancements to existing functionalities within gunrock. label Sep 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persistent thread operator implementation #602

Persistent thread operator implementation #602

jowens commented Sep 16, 2019

porumbes commented Oct 8, 2019

Persistent thread operator implementation #602

Persistent thread operator implementation #602

Comments

jowens commented Sep 16, 2019

porumbes commented Oct 8, 2019