Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Put in new opencl kernel for testing. #16

Merged
merged 1 commit into from
Jul 13, 2016
Merged

Conversation

jcvernaleo
Copy link
Member

We need to do some more testing (not just in benchmark more). @jolan maybe at some point when you get a chance.
For the new kernel:

gominer -B

For the old one;

gominer -B -k ./blake256-old.cl

In benchmark mode I'm getting worse performance with the new kernel but I didn't try to tune the intensity and I'm really not on production useful hardware.

@jolan
Copy link

jolan commented Jul 12, 2016

I remember a performance regression on Intel hardware but gains on AMD/Nvidia so I think that's expected. Will test soon.

@jcvernaleo
Copy link
Member Author

In that case if your testing (when you get to it) goes well I may keep the old opencl around and just rename it in a slightly less bad way (maybe in a subdirectory of something).

@jolan
Copy link

jolan commented Jul 13, 2016

With an Nvidia 750 ti, my testing results in what the comment at the top of the optimized kernel says (~440Mh/s to ~520Mh/s).

AMD was pretty inconclusive. It's not slower but not much faster either. I did try adjusting the intensity of both kernels but performance was still pretty close. I don't have a wide range of AMD cards to test with so since I don't see any regression, I think this can go in.

OK by me since #17 was merged and I successfully solo mined a few blocks on testnet.

Minor modifications of the return format was needed to work with
gominer.

Keep old blake256 kernel as it performs better on intel hardware but
not others.

The improvement is mainly for nvidia cards.  AMD cards seem to have
little difference.  For testing with Nvidia 750 ti it went from
~440Mh/s to ~520Mh/s.
@jcvernaleo jcvernaleo merged commit 7931e7d into decred:master Jul 13, 2016
@jcvernaleo jcvernaleo deleted the jcv_cl branch July 13, 2016 15:48
@jcvernaleo jcvernaleo added this to the v0.2.0 milestone Jul 27, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants