Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bws internal error fix #6806

Merged
merged 7 commits into from Apr 4, 2024
Merged

Conversation

jframe
Copy link
Contributor

@jframe jframe commented Mar 26, 2024

PR description

Fixes BWS issue where we are unable to import blocks for some hashes. We are currently getting this log at the end of the BWS.

{"@timestamp":"2024-01-31T10:58:00,982","level":"WARN","thread":"ForkJoinPool.commonPool-worker-374","class":"MergeCoordinator","message":"Internal error, backward sync completed but failed to import finalized block 0xd12ec91829da5af65ba80696d035bac054b8aac7cfcf60c983c12a5bcb1b926e","throwable":""}

This is due to adding hashes before have finished the initial sync and are ready to do a BWS.

Also moved the log message "Appending new head block hash {} to backward sync" into BackwardSyncContext so it is only logged if we are going to add the new block hash to the backward as may not kick off the BWS due to having not finished the initial sync.

Fixed Issue(s)

fixes #6527

Thanks for sending a pull request! Have you done the following?

  • Checked out our contribution guidelines?
  • Considered documentation and added the doc-change-required label to this PR if updates are required.
  • Considered the changelog and included an update if required.
  • For database changes (e.g. KeyValueSegmentIdentifier) considered compatibility and performed forwards and backwards compatibility tests

Locally, you can run these tests to catch failures early:

  • unit tests: ./gradlew build
  • acceptance tests: ./gradlew acceptanceTest
  • integration tests: ./gradlew integrationTest
  • reference tests: ./gradlew ethereum:referenceTests:referenceTests

Signed-off-by: Jason Frame <jason.frame@consensys.net>
Signed-off-by: Jason Frame <jason.frame@consensys.net>
Signed-off-by: Jason Frame <jason.frame@consensys.net>
This reverts commit bf07f43.

Signed-off-by: Jason Frame <jason.frame@consensys.net>
Signed-off-by: Jason Frame <jason.frame@consensys.net>
@jframe jframe requested a review from garyschulte March 26, 2024 01:51
…t is only logged when are actually adding it

Signed-off-by: Jason Frame <jason.frame@consensys.net>
Copy link
Contributor

@siladu siladu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was adding these hashes during sync providing any benefit? (if it was, it was probably before this change... ed1480b)

When and how Besu receives FCU either during sync or during node recovery (e.g. after some besu downtime) varies between every CL.
Has sync and recovery been tested for each CL?
I think holesky would suffice if you haven't and decided to.

Happy to be convinced it doesn't need all this testing if you're confident it won't break anything :)

@jframe
Copy link
Contributor Author

jframe commented Apr 3, 2024

Was adding these hashes during sync providing any benefit? (if it was, it was probably before this change... ed1480b)

Don't think it was. Can't think of any benefit as the hashes would be out of date before we finished the sync.

When and how Besu receives FCU either during sync or during node recovery (e.g. after some besu downtime) varies between every CL. Has sync and recovery been tested for each CL? I think holesky would suffice if you haven't and decided to.

Haven't done that. That's a good idea. Will tests some syncs with different CLs on holesky.

Happy to be convinced it doesn't need all this testing if you're confident it won't break anything :)

@jframe
Copy link
Contributor Author

jframe commented Apr 4, 2024

Was adding these hashes during sync providing any benefit? (if it was, it was probably before this change... ed1480b)

Don't think it was. Can't think of any benefit as the hashes would be out of date before we finished the sync.

When and how Besu receives FCU either during sync or during node recovery (e.g. after some besu downtime) varies between every CL. Has sync and recovery been tested for each CL? I think holesky would suffice if you haven't and decided to.

Haven't done that. That's a good idea. Will tests some syncs with different CLs on holesky.

Happy to be convinced it doesn't need all this testing if you're confident it won't break anything :)

Tested Besu/Lighthouse, Besu/Lodestar, Besu/Nimbus, Besu/Prysm and Besu/Teku on holeksy

@jframe jframe merged commit a7b10db into hyperledger:main Apr 4, 2024
42 checks passed
@jframe jframe deleted the bws_internal_error_fix branch April 4, 2024 07:33
amsmota pushed a commit to Citi/besu that referenced this pull request Apr 16, 2024
Signed-off-by: Jason Frame <jason.frame@consensys.net>
Signed-off-by: amsmota <antonio.mota@citi.com>
amsmota pushed a commit to Citi/besu that referenced this pull request Apr 16, 2024
Signed-off-by: Jason Frame <jason.frame@consensys.net>
Signed-off-by: amsmota <antonio.mota@citi.com>
macfarla pushed a commit to macfarla/besu that referenced this pull request Apr 26, 2024
Signed-off-by: Jason Frame <jason.frame@consensys.net>
macfarla pushed a commit to macfarla/besu that referenced this pull request Apr 26, 2024
Signed-off-by: Jason Frame <jason.frame@consensys.net>
@macfarla macfarla mentioned this pull request Apr 26, 2024
8 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Internal error, backward sync completed but failed to import finalized block
3 participants