Skip to content

Node does not return to a healthy state after the database is temporarily unreachable #1279

@fordN

Description

@fordN

Do you want to request a feature or report a bug?
bug

What is the current behavior?
The graph-node is unable to successfully recover on its own from the being down for several minutes for, as an example, unexpected database maintenance.

If the current behavior is a bug, please provide the steps to reproduce and if possible a minimal demo of the problem.
In order to reproduce the issue shutdown the database or otherwise make it unreachable to the graph-node for 10 minutes.
When the database comes back online the graph-node may continues to emit warnings from the ChainHeadUpdateListener (see below), and not receive new blocks. Though it does not seem to happen every time.
The bad state can be resolved by restarting the graph-node.

WARN Error receiving message, error: IO error: communication with the server has desynchronized due to an earlier IO error, channel: chain_head_updates, component: Store > ChainHeadUpdateListener > NotificationListener

What is the expected behavior?
Successfully reestablish a connection with the database when it comes back online.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions