Make `graph-node` tolerate chains not being available during startup #3937

lutter · 2022-09-14T20:33:41Z

Right now, when graph-node starts, it does some checks against each RPC endpoint; if those checks fail or time out, graph-node will mark the chain as not working and not use it anymore. That will also make all subgraphs that use that chain fail at startup. If the problem with the endpoint is transient, the only way to get graph-node to use it again is to restart it (at the danger that now some other chain has a transient issue)

The code needs to be changed such that graph-node is much more tolerant to such transient issues and automatically retries using a chain that caused trouble during startup. As part of solving the issue, we should also produce documentation that describes what is expected of an endpoint before we will use it, and a graphman command that allows checking any given endpoint by going through its startup sequence. Additionally, there needs to be some way to figure out which of the configured endpoints graph-node considers usable/not usable at any given point in time.

The text was updated successfully, but these errors were encountered:

matthewdarwin · 2022-09-14T23:20:15Z

Sounds like a great improvement.

Please also expose this knowledge via Prometheus monitoring wherever possible/reasonable.

paymog · 2022-10-27T12:53:21Z

I just filed #4115 which seems possibly related to this. Does graph-node mark a chain as not working if a firehose provider goes down after successful startup?

paymog · 2022-10-31T13:54:17Z

In addition to toleration of chains not being available during startup, it would be amazing to tolerate chains which go down during while the graph-node is running. Even better I think would be tolerating individual RPC/Firehose providers not being available instead of marking a chain as dead if a single provider is down.

github-actions · 2023-04-30T00:19:47Z

Looks like this issue has been open for 6 months with no activity. Is it still relevant? If not, please remember to close it.

leoyvens · 2023-07-13T09:32:31Z

#4754 should help with firehose providers, by allowing them to retry for 30 secs before giving up.

paymog · 2023-07-13T10:59:45Z

@leoyvens does #4754 also help with #4323?

EDIT: and if so, can/should we make the 30 seconds configurable?

azf20 · 2023-08-03T15:32:18Z

@paymog I think it will help, but this is more targeted #4778

#3937

azf20 mentioned this issue Oct 30, 2022

Indexing completely fails if firehose goes down #4115

Open

github-actions bot added the Stale label Apr 30, 2023

azf20 removed the Stale label May 10, 2023

azf20 mentioned this issue May 10, 2023

Intermittent failure to detect a provider with trace support #3204

Closed

fordN assigned mangas Apr 9, 2024

mangas added a commit that referenced this issue Apr 11, 2024

Remove provider checks at startup

55671da

#3937

mangas linked a pull request Apr 11, 2024 that will close this issue

Remove provider checks at startup #5337

Open

mangas added a commit that referenced this issue Apr 11, 2024

Remove provider checks at startup

eda609d

#3937

mangas added a commit that referenced this issue Apr 19, 2024

Remove provider checks at startup

bf5aa3c

#3937

mangas added a commit that referenced this issue Apr 22, 2024

Remove provider checks at startup

1c43770

#3937

mangas added a commit that referenced this issue May 8, 2024

Remove provider checks at startup

6f964f0

#3937

mangas added a commit that referenced this issue May 10, 2024

Remove provider checks at startup

3643fda

#3937

mangas added a commit that referenced this issue May 13, 2024

Remove provider checks at startup

a72212a

#3937

mangas added a commit that referenced this issue May 17, 2024

Remove provider checks at startup

b9dbf1c

#3937

mangas added a commit that referenced this issue May 20, 2024

Remove provider checks at startup

9ab52dc

#3937

mangas added a commit that referenced this issue May 22, 2024

Remove provider checks at startup

b97ffec

#3937

mangas added a commit that referenced this issue May 24, 2024

Remove provider checks at startup

b69e2b0

#3937

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `graph-node` tolerate chains not being available during startup #3937

Make `graph-node` tolerate chains not being available during startup #3937

lutter commented Sep 14, 2022

matthewdarwin commented Sep 14, 2022

paymog commented Oct 27, 2022

paymog commented Oct 31, 2022

github-actions bot commented Apr 30, 2023

leoyvens commented Jul 13, 2023 •

edited

paymog commented Jul 13, 2023 •

edited

azf20 commented Aug 3, 2023

Make graph-node tolerate chains not being available during startup #3937

Make graph-node tolerate chains not being available during startup #3937

Comments

lutter commented Sep 14, 2022

matthewdarwin commented Sep 14, 2022

paymog commented Oct 27, 2022

paymog commented Oct 31, 2022

github-actions bot commented Apr 30, 2023

leoyvens commented Jul 13, 2023 • edited

paymog commented Jul 13, 2023 • edited

azf20 commented Aug 3, 2023

Make `graph-node` tolerate chains not being available during startup #3937

Make `graph-node` tolerate chains not being available during startup #3937

leoyvens commented Jul 13, 2023 •

edited

paymog commented Jul 13, 2023 •

edited