-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory leak P-1 #40
Comments
That should happen sooner on the RX580 which has 8GB RAM. |
Doing the pm1 test right now on RX580, lets see how soon it occurs. |
Does this concern GPU memory or main system memory? |
OK. Could it be that there was not enough memory on the GPU, for example because something else (or another instance) was running at the same time taking up some GPU memory? P-1 allocates by default almost all GPU memory, at start of stage2. It does check the available memory at the very beginning of the test (when it computes the plan), but allocates it at the start of stage2. So if something reduces the GPU memory at that point, the allocation in stage 2 will fail with a bad_alloc |
To identify a memory leak, you should observe a gradual reduction in available memory in time. E.g. every time a new P-1 test starts, the GPU available memory is reduced by let's say 100MB, etc. |
On the failed P-1, what was the "buffers" message at the start of test? |
|
{"exponent":"86100473", "worktype":"PM1", "status":"NF", "program":{"name":"gpuowl", "version":"6.4-f6d3153"}, "timestamp":"2019-04-11 15:04:44 UTC", "user":"selroc", "computer":"RX580-9", "fft-length":4718592, "B1":20000, "B2":600000} verified with: No factor found. |
@valeriob01: That's fine. Here's why: The key is in the exponent of 223; in order to cover 223^2 in B1, we must have B1>=223^2, which is larger than 20000. Try it with B1=50000 or larger. Thanks for reporting it, luckily it was a bug in the test not in the program :) |
OK. ./openowl -pm1 86100473 -B1 50000 -user selroc -cpu RX580-9 -device 0 ... 2019-04-12 06:09:36 RX580-9 {"exponent":"86100473", "worktype":"PM1", "status":"F", "program":{"name":"gpuowl", "version":"6.4-f6d3153"}, "timestamp":"2019-04-12 04:09:36 UTC", "user":"selroc", "computer":"RX580-9", "fft-length":4718592, "B1":50000, "B2":1500000, "factors":["15290240534639630110561"]} |
Sorry I didn't reply sooner, 4G signal is hard to come by. I meant system memory not GPU memory, this PC just happens to have 16GB of DDR4. Each time a test starts the memory usage of openowl increases when you look at it with top, until it fails to allocate memory for the next test and quits. GPU RAM is unaffected. |
I think some part of P-1 computation is done on the cpu when a test start or finish because the cpu becomes hot. |
There might be a leak related to GMP which is used in the GCD computation on CPU. Investigating. |
Thanks! fixed |
Wow that was quick. Nice one. |
Please make sure you're running at least v6.5 for P-1.
…On Fri, 12 Apr 2019 at 22:58, sillygitter ***@***.***> wrote:
Wow that was quick. Nice one.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#40 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AArHeqF7MedhXeDjUMg2bkj6fGI5k2xBks5vgILvgaJpZM4cpMiC>
.
|
Each time a new test starts it uses a little system memory and doesn't seem to free it afterwards. Encountered when doing many small P-1 tests, it took ~150 tests to fill 16GB of memory so it's unlikely to be encountered under normal use.
The text was updated successfully, but these errors were encountered: