Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WARNING: All miners crashed. Immediately restarting loop. #1389

Closed
minergenon opened this issue Feb 8, 2021 · 28 comments
Closed

WARNING: All miners crashed. Immediately restarting loop. #1389

minergenon opened this issue Feb 8, 2021 · 28 comments
Assignees

Comments

@minergenon
Copy link

minergenon commented Feb 8, 2021

After running RM, miners run on both GPU and CPU, but after passing the benchmark time, they crash. Miners find stocks normally, display hash rates normally, but why does RM break them.
On windows 7 everything works fine on the same machine, but why did it stop working on manjaro (worked until about 4.6.7.0).

@RainbowMiner
Copy link
Owner

RainbowMiner commented Feb 8, 2021

I will check this - could you please upload a Benchmark file?

Question: is that manjaro a Win7 machine, as well? Which version of Powershell is currently in use?

@RainbowMiner RainbowMiner self-assigned this Feb 8, 2021
@minergenon
Copy link
Author

minergenon commented Feb 8, 2021

Linux sergey-pc 5.10.7-3-MANJARO #1 SMP PREEMPT Fri Jan 15 21:11:34 UTC 2021 x86_64 GNU/Linux

PowerShell 7.1.1
Logs.tar.gz

@RainbowMiner
Copy link
Owner

Ah, ok, it's a linux machine. I'll look through your logs.

@RainbowMiner
Copy link
Owner

RainbowMiner commented Feb 8, 2021

Weird. OpenCL seems to think, there are two different AMD platforms thus tries to use two GPUs with index 0 ("0,0")
There is a file ./Data/gpu-count.txt - could you please upload it, too?
It almost looks like, one of the AMDs doesn't have a PCI port number. Is there something special about your setup? Any multiplexers?

EDIT: For my info: this is connected to #1352

@minergenon
Copy link
Author

minergenon commented Feb 8, 2021

Hold
gpu-count.txt
lscpu.txt

@RainbowMiner
Copy link
Owner

There is only one GPU, not two, right?

@minergenon
Copy link
Author

That's right, one

@RainbowMiner
Copy link
Owner

Could you please run ./gputest.sh again and upload the ./gputestresult.txt file?

Also, please try the following:

  • go to a shell window
  • enter pwsh + Return
  • enter Get-Command "screen" + Return
  • what does it say?
  • now exit pwsh by entering exit + Return

@minergenon
Copy link
Author

GNU Screen version 4.08.00 (GNU) 05-Feb-20

Copyright (c) 2018-2020 Alexander Naumov, Amadeusz Slawinski
Copyright (c) 2015-2017 Juergen Weigert, Alexander Naumov, Amadeusz Slawinski
Copyright (c) 2010-2014 Juergen Weigert, Sadrul Habib Chowdhury
Copyright (c) 2008-2009 Juergen Weigert, Michael Schroeder, Micah Cowan, Sadrul Habib Chowdhury
Copyright (c) 1993-2007 Juergen Weigert, Michael Schroeder
Copyright (c) 1987 Oliver Laumann

This program is free software; you can redistribute it and/or modify it under the terms of the GNU
General Public License as published by the Free Software Foundation; either version 3, or (at your
option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even
the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General
Public License for more details.

You should have received a copy of the GNU General Public License along with this program (see the
file COPYING); if not, see https://www.gnu.org/licenses/, or contact Free Software Foundation, Inc.,
51 Franklin Street, Fifth Floor, Boston, MA 02111-1301 USA.

Send bugreports, fixes, enhancements, t-shirts, money, beer & pizza to screen-devel@gnu.org

Capabilities:
+copy +remote-detach +power-detach +multi-attach +multi-user +font +color-256 +utf8 +rxvt
+builtin-telnet

(base) [sergey@sergey-pc ~]$ pwsh
PowerShell 7.1.1
Copyright (c) Microsoft Corporation.

https://aka.ms/powershell
Type 'help' to get help.

PS /home/sergey> screen
[screen is terminating]
PS /home/sergey>

gputestresult.txt

@minergenon
Copy link
Author

Here's more
clinfo result.txt

@RainbowMiner
Copy link
Owner

At least, clinfo and the gputest.sh return the same :) But - it seems, like you have two drivers running.
One is OpenCL 1.1 Mesa 21.1.0-devel (git-766538f83c), the other is OpenCL 1.2 AMD-APP (3188.4) - the active one seems to be the 3188.4, since it returns a PCI address. Do you remember having installed the 21.1.0-devel driver?

I could add some quick fix, so that we check for missing PCI bus addresses and avoid inactive GPUs. But that does not solve the starting problems. I will look further.

@RainbowMiner
Copy link
Owner

RainbowMiner commented Feb 8, 2021

Ok,
try that:

  • open a new shell
  • enter cd /home/sergey/RainbowMiner/Bin/ANY-SRBMinerMulti
  • look for files with ls ls start_cpu00_*sh - are there any?
  • if yes, try to start one: ./start_cpu00_zergpool_cpupower.sh - what does it say? Does it run, or are there any errors?

@minergenon
Copy link
Author

Mesa 21.1.0-devel was added via pamac-manager.

(base) [sergey@sergey-pc ~]$ cd /home/sergey/RainbowMiner/Bin/ANY-SRBMinerMulti
(base) [sergey@sergey-pc ANY-SRBMinerMulti]$ ls start_cpu00_*sh
start_cpu00_zergpool_cpupower.sh start_cpu00_zergpool_yespower2b.sh
(base) [sergey@sergey-pc ANY-SRBMinerMulti]$ ./start_cpu00_zergpool_cpupower.sh

CPU0 : AMD FX(tm)-8350 Eight-Core Processor [L3: 8192 KB][L2: 8192 KB][L1: 128 KB][PU: 8]

======================================================================
SRBMiner-MULTI 0.6.5

Press 's' to display stats
Press 'h' to display hashrate
Press 'p' to switch to the next pool
Press 'o' to switch to the previous pool

Algorithm/s : cpupower [0.85% fee]
Gpu mining : disabled
Cpu mining : enabled
Huge-pages : enabled
HW-Aes : available

[2021-02-09 18:45:41] 8 CPU core/s - 8 thread/s detected
[2021-02-09 18:45:41] Using 7 thread/s for CPU mining
[2021-02-09 18:45:43] Connected to cpupower.mine.zergpool.com:4250 [cpupower]
[2021-02-09 18:45:43] Difficulty 0.400000 [cpupower][0]
[2021-02-09 18:45:43] Job received [fb] block height 835661 [cpupower][0]
[2021-02-09 18:45:44] Job received [fb] block height 835661 [cpupower][0]
[2021-02-09 18:46:13] Job received [fc] block height 835662 [cpupower][0]
[2021-02-09 18:46:42] CPU result accepted [ 310ms] [cpupower][0]
[2021-02-09 18:46:57] Difficulty 0.200000 [cpupower][0]
[2021-02-09 18:46:57] Job received [fd] block height 835663 [cpupower][0]
[2021-02-09 18:47:15] Difficulty 0.100000 [cpupower][0]
[2021-02-09 18:47:15] Job received [fe] block height 835664 [cpupower][0]
[2021-02-09 18:47:15] CPU result accepted [ 361ms] [cpupower][0]
[2021-02-09 18:47:15] Job received [ff] block height 835665 [cpupower][0]
[2021-02-09 18:47:17] CPU result accepted [ 286ms] [cpupower][0]
[2021-02-09 18:47:41] ==========================================================
[2021-02-09 18:47:41] [0] [cpupower]
[2021-02-09 18:47:41] CPU : 581.34 H/s
[2021-02-09 18:47:41] Total: 581.34 H/s
[2021-02-09 18:47:41] ==========================================================
[2021-02-09 18:48:06] Job received [100] block height 835665 [cpupower][0]
[2021-02-09 18:48:07] CPU result accepted [ 192ms] [cpupower][0]
[2021-02-09 18:48:13] Job received [101] block height 835666 [cpupower][0]
[2021-02-09 18:48:18] CPU result accepted [ 397ms] [cpupower][0]
[2021-02-09 18:48:30] Job received [102] block height 835667 [cpupower][0]
[2021-02-09 18:48:30] CPU result accepted [ 378ms] [cpupower][0]
[2021-02-09 18:48:46] Job received [103] block height 835668 [cpupower][0]
[2021-02-09 18:48:51] Job received [104] block height 835669 [cpupower][0]
[2021-02-09 18:48:55] CPU result accepted [ 530ms] [cpupower][0]
[2021-02-09 18:49:06] Job received [105] block height 835670 [cpupower][0]
[2021-02-09 18:49:10] CPU result accepted [ 302ms] [cpupower][0]
[2021-02-09 18:49:18] CPU result accepted [ 598ms] [cpupower][0]
[2021-02-09 18:49:40] Job received [106] block height 835671 [cpupower][0]
[2021-02-09 18:49:41] ==========================================================
[2021-02-09 18:49:41] [0] [cpupower]
[2021-02-09 18:49:41] CPU : 620.74 H/s
[2021-02-09 18:49:41] Total: 620.74 H/s
[2021-02-09 18:49:41] ==========================================================
[2021-02-09 18:49:49] Job received [107] block height 835672 [cpupower][0]
[2021-02-09 18:49:53] CPU result accepted [ 554ms] [cpupower][0]
[2021-02-09 18:49:56] CPU result accepted [ 403ms] [cpupower][0]
[2021-02-09 18:50:11] CPU result accepted [ 438ms] [cpupower][0]
[2021-02-09 18:50:25] CPU result accepted [ 598ms] [cpupower][0]
[2021-02-09 18:50:25] CPU result accepted [ 501ms] [cpupower][0]
[2021-02-09 18:50:43] Job received [108] block height 835673 [cpupower][0]
[2021-02-09 18:50:50] CPU result accepted [ 274ms] [cpupower][0]
[2021-02-09 18:50:55] CPU result accepted [ 319ms] [cpupower][0]
[2021-02-09 18:50:59] CPU result accepted [ 511ms] [cpupower][0]
[2021-02-09 18:51:02] CPU result accepted [ 349ms] [cpupower][0]
[2021-02-09 18:51:04] CPU result accepted [ 331ms] [cpupower][0]
[2021-02-09 18:51:06] CPU result accepted [ 583ms] [cpupower][0]
[2021-02-09 18:51:16] CPU result accepted [ 329ms] [cpupower][0]
[2021-02-09 18:51:20] CPU result accepted [ 418ms] [cpupower][0]
[2021-02-09 18:51:22] CPU result accepted [ 180ms] [cpupower][0]
[2021-02-09 18:51:23] CPU result accepted [ 614ms] [cpupower][0]
[2021-02-09 18:51:26] CPU result accepted [ 179ms] [cpupower][0]
[2021-02-09 18:51:29] Job received [109] block height 835674 [cpupower][0]
[2021-02-09 18:51:41] ==========================================================
[2021-02-09 18:51:41] [0] [cpupower]
[2021-02-09 18:51:41] CPU : 635.66 H/s
[2021-02-09 18:51:41] Total: 635.66 H/s
[2021-02-09 18:51:41] ==========================================================
[2021-02-09 18:51:46] Worker thread 0 stopped
[2021-02-09 18:51:46] Worker thread 1 stopped
[2021-02-09 18:51:47] Worker thread 2 stopped
[2021-02-09 18:51:47] Worker thread 3 stopped
[2021-02-09 18:51:47] Worker thread 4 stopped
[2021-02-09 18:51:47] Worker thread 5 stopped
[2021-02-09 18:51:47] Worker thread 6 stopped
(base) [sergey@sergey-pc ANY-SRBMinerMulti]$ ./start_cpu00_zergpool_yespower2b.sh

CPU0 : AMD FX(tm)-8350 Eight-Core Processor [L3: 8192 KB][L2: 8192 KB][L1: 128 KB][PU: 8]

======================================================================
SRBMiner-MULTI 0.6.5

Press 's' to display stats
Press 'h' to display hashrate
Press 'p' to switch to the next pool
Press 'o' to switch to the previous pool

Algorithm/s : yespower2b [0.85% fee]
Gpu mining : disabled
Cpu mining : enabled
Huge-pages : enabled
HW-Aes : available

[2021-02-09 18:52:28] 8 CPU core/s - 8 thread/s detected
[2021-02-09 18:52:28] Using 7 thread/s for CPU mining
[2021-02-09 18:52:30] Connected to power2b.mine.zergpool.com:7445 [yespower2b]
[2021-02-09 18:52:30] Difficulty 0.800000 [yespower2b][0]
[2021-02-09 18:52:30] Job received [103] block height 699232 [yespower2b][0]
[2021-02-09 18:52:31] Job received [103] block height 699232 [yespower2b][0]
[2021-02-09 18:52:44] Job received [104] block height 699232 [yespower2b][0]
[2021-02-09 18:52:48] Job received [105] block height 699233 [yespower2b][0]
[2021-02-09 18:52:48] CPU result accepted [ 207ms] [yespower2b][0]
[2021-02-09 18:52:57] Job received [106] block height 699234 [yespower2b][0]
[2021-02-09 18:53:30] Job received [107] block height 699235 [yespower2b][0]
[2021-02-09 18:54:28] ==========================================================
[2021-02-09 18:54:28] [0] [yespower2b]
[2021-02-09 18:54:28] CPU : 585.19 H/s
[2021-02-09 18:54:28] Total: 585.19 H/s
[2021-02-09 18:54:28] ==========================================================
[2021-02-09 18:54:30] Job received [108] block height 699235 [yespower2b][0]
[2021-02-09 18:54:57] Job received [109] block height 699236 [yespower2b][0]
[2021-02-09 18:55:12] Job received [10a] block height 699237 [yespower2b][0]
[2021-02-09 18:55:22] Job received [10b] block height 699238 [yespower2b][0]
[2021-02-09 18:56:28] ==========================================================
[2021-02-09 18:56:28] [0] [yespower2b]
[2021-02-09 18:56:28] CPU : 561.31 H/s
[2021-02-09 18:56:28] Total: 561.31 H/s
[2021-02-09 18:56:28] ==========================================================
[2021-02-09 18:56:38] Job received [10c] block height 699238 [yespower2b][0]
[2021-02-09 18:57:00] Job received [10d] block height 699239 [yespower2b][0]
[2021-02-09 18:58:03] Socket error : 104 [yespower2b]
[2021-02-09 18:58:03] Reconnecting to power2b.mine.zergpool.com:7445 in 3 seconds [yespower2b]
[2021-02-09 18:58:06] Connected to power2b.mine.zergpool.com:7445 [yespower2b]
[2021-02-09 18:58:06] Difficulty 0.800000 [yespower2b][0]
[2021-02-09 18:58:06] Job received [10f] block height 699241 [yespower2b][0]
[2021-02-09 18:58:07] Job received [10f] block height 699241 [yespower2b][0]
[2021-02-09 18:58:28] ==========================================================
[2021-02-09 18:58:28] [0] [yespower2b]
[2021-02-09 18:58:28] CPU : 551.40 H/s
[2021-02-09 18:58:28] Total: 551.40 H/s
[2021-02-09 18:58:28] ==========================================================
[2021-02-09 18:59:07] Job received [110] block height 699241 [yespower2b][0]
[2021-02-09 18:59:20] CPU result accepted [ 136ms] [yespower2b][0]
[2021-02-09 18:59:37] Difficulty 0.400000 [yespower2b][0]
[2021-02-09 18:59:37] Job received [111] block height 699242 [yespower2b][0]
[2021-02-09 18:59:54] CPU result accepted [ 267ms] [yespower2b][0]
[2021-02-09 18:59:57] CPU result accepted [ 263ms] [yespower2b][0]
[2021-02-09 19:00:10] Difficulty 0.200000 [yespower2b][0]
[2021-02-09 19:00:10] Job received [112] block height 699243 [yespower2b][0]
[2021-02-09 19:00:17] Job received [113] block height 699244 [yespower2b][0]
[2021-02-09 19:00:28] ==========================================================
[2021-02-09 19:00:28] [0] [yespower2b]
[2021-02-09 19:00:28] CPU : 606.09 H/s
[2021-02-09 19:00:28] Total: 606.09 H/s
[2021-02-09 19:00:28] ==========================================================
[2021-02-09 19:00:33] Worker thread 0 stopped
[2021-02-09 19:00:33] Worker thread 1 stopped
[2021-02-09 19:00:34] Worker thread 2 stopped
[2021-02-09 19:00:34] Worker thread 3 stopped
[2021-02-09 19:00:34] Worker thread 4 stopped
[2021-02-09 19:00:34] Worker thread 5 stopped
[2021-02-09 19:00:34] Worker thread 6 stopped
(base) [sergey@sergey-pc ANY-SRBMinerMulti]$

@minergenon
Copy link
Author

The whole point is that after passing the benchmark, the result of a successful passage is not saved but breaks in RM

@RainbowMiner
Copy link
Owner

Ah! Ok! That makes things a bit easier to research.

@RainbowMiner
Copy link
Owner

Could you please look, if the start-stop-daemon is installed? It doesn't matter if it's not, but I need to know to check:

  • open a new shell
  • enter start-stop-daemon --version
  • is it there?

@minergenon
Copy link
Author

(base) [sergey@sergey-pc RainbowMiner]$ start-stop-daemon --version
bash: start-stop-daemon: команда не найдена
(base) [sergey@sergey-pc RainbowMiner]$

Not installed

@RainbowMiner
Copy link
Owner

Ok, thank you! I'll dig on.

RainbowMiner added a commit that referenced this issue Feb 9, 2021
- add debugging messages to StartInScreen (issue #1389)
- give a screen 5 seconds to appear (issue #1389)
@RainbowMiner
Copy link
Owner

I have changed a timer during the startup of miners and added more debug information. Could you download the newest master.zip and extract it into your RainbowMiner folder and then start RainbowMiner again?
If it still fails - please send me the Logs folder again.

@minergenon
Copy link
Author

Nothing changed

Logs.tar.gz

@RainbowMiner
Copy link
Owner

Thank you for trying - look into your Config/config.txt. Is "EnableMinersAsRoot" set to "1" ?

@minergenon
Copy link
Author

Yes.
"EnableMinersAsRoot": "1",

@minergenon minergenon mentioned this issue Feb 11, 2021
RainbowMiner added a commit that referenced this issue Feb 12, 2021
- finally fix issue #1396, issue #1389
- add propper screen kill at the end of core
@RainbowMiner
Copy link
Owner

I have now reworked the whole start/stop process for Linux machines, that do run RainbowMiner within a non-root account (commit 752d6ae) . The problem relates also to issues #1396, #1389 and #1371

Please try git clone or download the current master.zip and try again. Make sure "EnableMinersAsRoot" is set to "1" in config.txt and the ocdaemon is up and running (enter ocdaemon status).

If you still run into trouble, please let RainbowMiner run and open http://localhost:4000 in your browser, then click "Debug file" and upload the zip file here.

Thank you for your help!

@TheeeX
Copy link

TheeeX commented Jan 2, 2022

Getting this error on rainbowminer v4.7.9.7

[Client-Mode] Connected to localhost:8000
 
Waiting 35s until next run: E[x]it, [R]estart, [B]alance update, [S]kip SP, [W]D reset, [C]onfiguration, [V]erbose, [P]ause, [L]ock
WARNING: All miners crashed. Immediately restarting loop. 
Finished waiting - starting next run           

I have manjaoro os and PowerShell 7.2.1.

@RainbowMiner
Copy link
Owner

Like always, or only sometimes?

@TheeeX
Copy link

TheeeX commented Jan 2, 2022

I remember it was not coming before, it has started coming recently, I tried running last RainbowMiner version which I was using for sometime 4.7.7.1, its also prints the same warning.

First I thought it might be due to some version change in rainbow, so i cloned the new version v.7.9.7 but its still printing the warning.

It stopped showing warning now!

Its coming whenever it starts bench-marking a miner.

And now, i noticed:WARNING: No miners available: disabled by Watchdog

@TheeeX
Copy link

TheeeX commented Jan 3, 2022

@RainbowMiner I tried running miners individually from command in log from bench-marking, I noticed user param was going xxx and it was not able to run successfully, when I changed it to btc address from my old logs it started running.

Any thoughts why did this happen? Because user was not set, it was the reason for all the crash? and what shoud I check?

@RainbowMiner
Copy link
Owner

Because user was not set, it was the reason for all the crash?

Yes, most probably. I cannot tell exactly, though, because I would need some more insights: best would be, if you could provide me with a "Debug file" - that's opening http://localhost:4000 in your browser, then click "Debug file" on the left hand side (pretty much down the page). It will create a zip file, that has all sensitive data x-ed out, so you can safely upload it here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants