-
Notifications
You must be signed in to change notification settings - Fork 4.2k
-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mainnet stuck on 2020-12-04 #13958
Comments
current most suscipus log:
|
Pulled from
EDIT: expanded logs
|
5XKJwdKB2Hs7pkEXzifAysjSk6q7Rt6k5KfHwmAMPtoQ is Moonlet.io |
|
HoXANZnWTGeePertqWkMEnnhgXjTjzpfWaT2kja2ZgVU is GRom81. Perhaps the leader change never showed up in the log because of the dead slot? |
Graph from one our nodes that marked 53180936 as dead: 89% stake on 53180935 |
I decided to investigate slot Is it normal to send repair requests for your own slot?
|
This issue has now run its course, with the resulting fixes and improvements tracked elsewhere. |
Are there any infos about what the bug was? |
Hi @mankinskin! The postmortem blog post is here, https://medium.com/solana-labs/mainnet-beta-stall-postmortem-ba0c6064e3 |
Thanks! |
Edit: People are linking to this issue claiming that "Solana is not BFT" and a single faulty leader can take it down, or that this is somehow a design flaw. This is false - Solana has leaders, but 1/3+ of them have to be faulty to halt the network. People have even been running malicious fuzzer nodes on the Tour de SOL network that flipped random bits in blocks they produced. What happened here is that a single misbehaving leader triggered a bug that broke consensus, nothing more, nothing less.
(Not an official statement, and I'm not a Solana team member - Certus One runs a Solana validator and we were part of this incident investigation - yay decentralization!)
Mainnet is stuck on root slot 53180903, at 2020-12-04 13:45:45 UTC:
Snapshot of delinquency graph: https://snapshot.raintank.io/dashboard/snapshot/KqeO9cF1pcC5VAfutkGrE4oJ2rroDKx7?orgId=2
Please upload your logs to help with the investigation:
For plain logfiles:
Validator snapshot: https://gist.github.com/leoluk/9bd2a3a2eb2ec9f0ceb9ba3476ada154
Please do not copy the commands to Discord - refer to this issue instead.
Upload destination: http://logbin.certus.one:8080/Qui9iehu/solana-github-13958
(no need to reply on this issue once you uploaded logs)
The text was updated successfully, but these errors were encountered: