Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[YAML, or no] pdr bots shouldn't stop if RPC goes down briefly #465

Closed
trentmc opened this issue Dec 20, 2023 · 0 comments · Fixed by #491
Closed

[YAML, or no] pdr bots shouldn't stop if RPC goes down briefly #465

trentmc opened this issue Dec 20, 2023 · 0 comments · Fixed by #491
Labels
Type: Bug Something isn't working

Comments

@trentmc
Copy link
Member

trentmc commented Dec 20, 2023

Background

Reported by @idiom-bytes in slack:

Hi, our web3 middleware [web3_config.py::Web3Config] that talks to the sapphire RPC is raising exceptions while trying to get blocks. The exception is raised after the # target_retries.

This is causing pdr-bots to throw an exception => causing bots to stop running => causing users to have to maintain/ops the bots. (See image below)

DoD

I think in situations where we have a known exception (subgraph/rpc that might self-heal in 1-2 mins) that we could wrap an outer-loop wait such that the bot doesn't crash, forcing botters to monitor/ops the bots.

I.e. "just set it, and forget it"

Image from Web3Config

image

@trentmc trentmc added the Type: Bug Something isn't working label Dec 20, 2023
@trentmc trentmc changed the title [YAML] pdr bots shouldn't stop if RPC goes down briefly [YAML, or no] pdr bots shouldn't stop if RPC goes down briefly Dec 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: Bug Something isn't working
Projects
None yet
1 participant