New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
core dump runtime - fopen failed #925
Comments
ADD: RAM is server ECC, not overclocked, tested with memtest86 burnin - RAM is fine... |
that's gotta be a drive issue |
I doubt it's that easy: eraraid from raidix - Right now I am testing the eraraid for some time. Experience so far:
Various suggestions assuming you took a standard approach in file handling:
|
deleted since replacement test disk was full (OS Samsung 980 Pro) |
so problem solved? |
No. What I did: Test run with a physical NVME (Samsung 980 Pro). Result: completed successfully: Overall too slow, System load dropped below 20% by times. So yes, I want to use the eraraid, which should be possible. Its tested, works perfectly fine without any hickup - even when creating 50+ plots simultaneously the classic way. What I did: checked with several options by removing any limitations (which shouldnt be a problem; still removed them nonetheless) - problem persists on eraraid. Opened a call with raidix. What I ask from you if possible: see, where this issue can result from and if possible point out, where this may come from (support me in my raidix call). I am aware this is a rather uncommon issue by using a software raid from HPC solutions. Yet when talking raw speed, I figure you may be really interested to make it work. |
Add - remark: As long as your chia-plotter is running on eraraid, I saw the physical write limit cap at ~15+GB/s. Still there is room for more since that high write load only happens app 1-2 seconds during 10 seconds. Comparing the raid with the slow 980 Pro almost hurts... when being unable to use it. |
From raidix support:
Any idea? |
@madMAx43v3r any chance you support me in my call with raidix? In case I didn't state it clearly: the eraraid is fine on anything else. Even the most heavy load over prolonged time (10h+). Verified it again myself. Cooling is fine too (way below 60°C on heavy load). |
More information: usually the fopen error comes up, once a phase is finished. Sometimes after phase 1, last time after phase 2:
|
Working Directory: /srv/tmp/
Working Directory 2: /srv/tmp/
Plot Name: plot-k32-2021-08-30-09-33-20e24798982d5b5e8cb2777fcddc1eeb470ae05eca66e2212342398e1480fe44
[P1] Table 1 took 25.1035 sec
[P1] Table 2 took 192.124 sec, found 4295041098 matches
[P1] Table 3 took 200.674 sec, found 4295058971 matches
terminate called after throwing an instance of 'std::runtime_error'
what(): thread failed with: fopen() failed
Aborted (core dumped)
In previous issues the trailing slash was the issue for that error. Yet here the trailing slash is set. Things to note in my system:
The text was updated successfully, but these errors were encountered: