Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
Use faster lud_diagonal #11
This version of lud_diagonal uses intra-group parallelism and is faster
In comparison, lud.fut performs about similar to the old lud-clean.fut:
whereas the Rodinia implementation is a lot faster than ours:
I'm not sure there's much point in keeping both lud.fut and lud-clean.fut, as both seem to perform about the same (before this PR). I believe the original intent was to have a nice implementation that was easy (or easier) to understand and modify and one that ran fast, but that doesn't seem necessary any more.
lud-clean was originally created as a nicer but slower implementation of lud. However, it is not actually any slower any more, so we should replace lud with lud-clean. This commits does so.