-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] allow one thread for vega gpus #624
base: dev
Are you sure you want to change the base?
Conversation
With the PR637 I got some results, but with this PR624 instanly crashes the windows with the generated 1536 intensity, 2x Vega64 here is the log
I've tried it on 3816 intensity, windows crashes too
on 1008 intensity I got some result, but above it crashes
|
@Njeroe big thanks for you test. It could be that windows is killing the miner if one thread is using the gpu for a to long time. Never the less your feedback is very helpful |
@psychocrypt I know this was already tested but I also gave it a try and I can confirm the results... After compiling and running with the auto generated amd.txt file (with intensity at 1562) Windows 10 seems to lock up and crash... BSOD ends with different reasons for the crash each time. Mining Rig on a Xeon E2650 v2 Server with 5x Vega 56(bios modded to 64) and 3x Vega 64 and optimized Power Play table. |
Thanks for the feedback I will work on this next year. I tested it on
nvidia and it works but I have currently no amd card.
|
Using those 8GBs in one alloc might necessitate setting the 64 bit device flag. "set GPU_FORCE_64BIT_PTR=1" |
Testes this PR with one RX 580 and two RX 480 GPUs in Linux with AMD drivers 17.50 + ROCm kernel 1.6. Result without any modification of |
There are still bugs in this PR. I need to fix this first.
I am currently setup my linux system that I can tested by my own too
|
I can get as high as intensity: 4016 on a Vega FE but only if I disable strided_index. With it enabled I get instant BSOD |
There is an bug in my implementation therefore the miner crash. I think I
will close this PR because I think there will be no benefit from this PR.
The two independent threads per gpu will have more advantages.
|
What would be the benefit of running 1 thread instead of 2? |
The benefit could have been to optimize just one thread intensity instead of two. But in this context we need to know what kind of advantages multiple-threads setup gives us. |
Even with 2 threads it is sometimes beneficial to be able to set an intensity higher than 2024. |
Instant crash for me, regardless of intensity or strided_index. Testing on 6x vega 56. |
This pull request will be allow to use more than 2k threads within one gpu thread. With this PR there should be no need to spawn to cpu threads per vega gpu.
f0979ec
to
675134c
Compare
I fixed the bugs and tested this PR on my RX570 (~2% more hashes). I got a small hash rate increase. Download: https://github.com/psychocrypt/xmr-stak/archive/topic-vegaOneThread.zip @Njeroe @JoKeRz42o @taisel @uentity @Nuke33 could you please test if this single thread per GPu has an advantage over two threads. Please try to increase the intensity up to the max allowed by you device. |
Thx could you please also test the hash rate of the new pr with the old cfg
(2x1932)
|
This PR definitely worth merging. For RX 480 I found the sweet spot at Actually my Vega 64 also successfully mines with this PR, but setup is far from optimal and thus absolute hashrate value is somewhat meaningless (I'm getting ~1208 H/s at best). This is due to my CPU (AMD FX) doesn't support PCIe atomics required for AMD Vega 64 TL;DR: best performance is ~1208 H/s with |
I can confirm that the last PR is increasing hashrates by around 5% on Vega GPUs. Interestingly I noticed that you can now set much higher intensities if HBCC can utilize more RAM. For example I could only set the Vega64 to intensity 1982 for 2 threads with 8GB RAM, but with 32GB RAM and maxed out HBCC slider it was possible to set intensity to 2012. |
I built this PR, but I am unable to get it to work properly, with one Vega 64 Liquid. On one thread with intensity 3864 the miner hangs. Let me know if you wish me to retest at any point. |
Here one user that just has tested xmr-stak with that PR 624 on the dev branch. Two threads with intensity=500 each one, this is my hashrate report with a single GPU (AMD ATI HD 6990):
Is this the maximum expected? |
for 6990 please check this #472
thread. You need to use the correct driver.
|
@psychocrypt when we will finally see this PR merged? :-) |
I think never. It shows no effect and is contra productive to the strided
index option.
Do you need it.?
|
Hmm.. It was effective on my setup (see my comment above from 14 Feb). But I didn't track recent development and just discovered the new I think I should wait for upcoming AMDGPU-PRO 18.10 release and check if it finally would allow me to finally run two threads effectively on Vega64 (as everybody do). |
This pull request will be allow to use more than 2k threads within one gpu thread. With this PR there should be no need to spawn to cpu threads per vega gpu.
Note: This is only a hacked version, please do not look at the implementation.
HowTo
amd.txt
is created@JerichoJones @davidpesce Could you please report the hash rate e.g. with intensity 3864 or higher.