Add Fallback Option for Eth1 Nodes #8062

nisdas · 2020-12-07T11:49:11Z

What type of PR is this?

Feature Addition

What does this PR do? Why is it needed?

Adds the ability to have fallbacks for our eth1 endpoints.
Fix references across the repo.
Performs regular health checks to switch over to primary node in the event it comes back up.
Add unit test to test this feature.

Which issues(s) does this PR fix?

Fixes #7969

Other notes for review

beacon-chain/flags/base.go

beacon-chain/powchain/service.go

shayzluf · 2020-12-07T13:46:54Z

shared/cmd/helpers.go

-		}
-		if err := ctx.Set(flag.Name, web3endpoint); err != nil {
-			return errors.Wrapf(err, "could not set %s to %s", flag.Name, web3endpoint)
+	rawFlags := sliceutil.SplitCommaSeparated(ctx.StringSlice(flags.Name))


i think bootstrap nodes example uses a multiple flag and doesn't use SplitCommaSeparated c.BootstrapNodes = cliCtx.StringSlice(cmd.BootstrapNode.Name)

shayzluf · 2020-12-07T13:49:53Z

shared/cmd/helpers.go

+		case strings.HasPrefix(rawValue, "ws://"):
+		case strings.HasPrefix(rawValue, "wss://"):
+		default:
+			web3endpoint, err := fileutil.ExpandPath(rawValue)


not sure this code handles yaml file input
good example for this is in function registerP2P using readbootNodes

prysm/beacon-chain/node/node.go

Line 360 in 14e1f08

func (b *BeaconNode) registerP2P(cliCtx *cli.Context) error {

https://github.com/prysmaticlabs/prysm/blob/14e1f08208553292d0015f960e6821729a444cf0/beacon-chain/node/node.go#L347:6

mohamedmansour · 2020-12-07T16:04:22Z

@nisdas Thanks for doing this! The main issue with this design is that all the http endpoints are treated equally. We just want to provide a fallback endpoint in case the primary endpoint stopped working. This design will only change current endpoint if a failure occurs.

Would be nice if there was a health check on the endpoints so that if the primary endpoint is back up, it will automatically switch back to that. So instead of doing that check if the call failed, maybe another worker should be maintaining the health status of every endpoint and setting its current url.

Another design issue is that what if we have different authentication for each eth1 node, this doesn’t doesnt work here.

One of the main usecase for this feature is that we are running a local unauthenticated geth node and a remote infura authenticated node.

nisdas · 2020-12-07T17:09:55Z

Hey @mohamedmansour ,

Thanks for giving your thoughts on this issue, performing a continous health check will require quite a bit of shifting around of our code. For obvious reasons the service as of this moment is only meant to be running a single persistent eth1 connection. I will think a bit more on if its feasible to be running continous health check on the primary eth1 node without messing around too much with our other core eth1 code.

As for the latter issue, it isnt relevant to the main purpose of the PR as of the current moment prysm only supports unauthenticated connections to eth1 nodes so it is out of scope for the PR. However support for JWT auth can be added in with a follow up PR.

mohamedmansour · 2020-12-07T17:50:58Z

@nisdas prysm currently supports cert based eth1 nodes. With the current design of the PR if we supply two different eth1 endpoints, we have to use the same cert which most likely isn’t the case.

Regarding my first question, it is very important for the fallback url not to become primary eth1 node for a long time if the primary eth1 node recovered. Otherwise this solution would only be valuable for those who only have strictly remote eth1 services like alchemy (which should be just a few).

The idea of introducing a failover is to be meerly be just temporal, but in this case it will be permanent (which is not what the issue on github is about).

Thanks for replying :)

nisdas · 2020-12-08T03:00:38Z

prysm currently supports cert based eth1 nodes. With the current design of the PR if we supply two different eth1 endpoints, we have to use the same cert which most likely isn’t the case.

Currently prysm doesn't have the ability to provide a custom cert for an eth1 endpoint, we use the default certificate authorities to authenticate any upgraded https connections. From the view of the service there is no ability to provide a custom cert to an eth1 endpoint. This will also have to be designed along with JWT auth.

Regarding my first question, it is very important for the fallback url not to become primary eth1 node for a long time if the primary eth1 node recovered. Otherwise this solution would only be valuable for those who only have strictly remote eth1 services like alchemy (which should be just a few).

Understood :) , will take a look at this a bit more deeply then 👍

into addMultipleEndpoints

…bs/geth-sharding into addMultipleEndpoints

Co-authored-by: Shay Zluf <thezluf@gmail.com>

rkapka

I don't think changing the HTTPWeb3ProviderFlag to a slice is the best design. The problem is that if you specify the flag multiple times, it's not clear which endpoint is the primary one, and this is very important. Does the CLI library provide an ordering guarantee when using a slice flag?

A clearer approach IMO would be to keep the existing flag as it is, using it for the primary endpoint, and add a new FallbackHTTPWeb3Provider(s) flag for fallback endpoints, which can either be specified multiple times or as a comma-separated list of endpoints (adding s to the flag name in that case).

beacon-chain/powchain/service.go

rkapka · 2020-12-10T12:04:20Z

Another option would be to allow specifying a primary suffix e.g. http-web3provider=blah,primary and default to httpEndpoints[0] in case no endpoint is marked as primary, maybe also issuing a warning in this case. We should also handle the scenario where multiple endpoints are marked as primary (error? choose the first one?).

This solution will interfere with @shayzluf's authorization work, though.

shared/cmd/helpers.go

shared/cmd/helpers_test.go

rkapka · 2020-12-10T16:25:09Z

What about #8062 (comment)? Do you think it's worthwhile to unify this?

…bs/geth-sharding into addMultipleEndpoints

fix

6e2b6ef

nisdas requested a review from a team as a code owner December 7, 2020 11:49

nisdas requested review from farazdagi, shayzluf and rkapka December 7, 2020 11:49

nisdas added the Ready For Review A pull request ready for code review label Dec 7, 2020

nisdas added this to the v1.0.5 milestone Dec 7, 2020

nisdas added the Enhancement New feature or request label Dec 7, 2020

nisdas mentioned this pull request Dec 7, 2020

Multiple ETH1 endpoints #7969

Closed

shayzluf reviewed Dec 7, 2020

View reviewed changes

beacon-chain/flags/base.go Outdated Show resolved Hide resolved

shayzluf reviewed Dec 7, 2020

View reviewed changes

beacon-chain/powchain/service.go Show resolved Hide resolved

shayzluf reviewed Dec 7, 2020

View reviewed changes

nisdas added 2 commits December 7, 2020 22:22

fix tests and change back

14f3743

gaz

7b82726

nisdas and others added 6 commits December 10, 2020 00:43

change

bc8b5df

Merge branch 'develop' of https://github.com/prysmaticlabs/geth-sharding

2e99d48

into addMultipleEndpoints

Merge branch 'develop' into addMultipleEndpoints

43c2398

ready again

a070e34

Merge branch 'addMultipleEndpoints' of https://github.com/prysmaticla…

b842810

…bs/geth-sharding into addMultipleEndpoints

Update beacon-chain/flags/base.go

ea9a7e3

Co-authored-by: Shay Zluf <thezluf@gmail.com>

nisdas mentioned this pull request Dec 10, 2020

Support authorised access to web 3 providers #8075

Merged

rkapka requested changes Dec 10, 2020

View reviewed changes

beacon-chain/powchain/service.go Outdated Show resolved Hide resolved

radek's review

55c6417

rkapka reviewed Dec 10, 2020

View reviewed changes

shared/cmd/helpers.go Outdated Show resolved Hide resolved

rkapka reviewed Dec 10, 2020

View reviewed changes

shared/cmd/helpers_test.go Outdated Show resolved Hide resolved

rkapka reviewed Dec 10, 2020

View reviewed changes

shared/cmd/helpers_test.go Outdated Show resolved Hide resolved

rkapka and others added 6 commits December 10, 2020 17:27

Update shared/cmd/helpers.go

6ec3760

Update shared/cmd/helpers_test.go

2d4a138

Update shared/cmd/helpers_test.go

ec03fc8

Endpoint/endpoint

ee74194

Merge branch 'addMultipleEndpoints' of https://github.com/prysmaticla…

3a63c1f

…bs/geth-sharding into addMultipleEndpoints

Merge branch 'develop' into addMultipleEndpoints

4ef30d2

rkapka approved these changes Dec 11, 2020

View reviewed changes

Merge branch 'develop' into addMultipleEndpoints

aff2160

nisdas added the OK to merge label Dec 11, 2020

nisdas merged commit 99b3835 into develop Dec 11, 2020

delete-merged-branch bot deleted the addMultipleEndpoints branch December 11, 2020 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Fallback Option for Eth1 Nodes #8062

Add Fallback Option for Eth1 Nodes #8062

nisdas commented Dec 7, 2020 •

edited

shayzluf Dec 7, 2020

shayzluf Dec 7, 2020 •

edited

mohamedmansour commented Dec 7, 2020 •

edited

nisdas commented Dec 7, 2020

mohamedmansour commented Dec 7, 2020

nisdas commented Dec 8, 2020

rkapka left a comment

rkapka commented Dec 10, 2020 •

edited

rkapka commented Dec 10, 2020

Add Fallback Option for Eth1 Nodes #8062

Add Fallback Option for Eth1 Nodes #8062

Conversation

nisdas commented Dec 7, 2020 • edited

shayzluf Dec 7, 2020

Choose a reason for hiding this comment

shayzluf Dec 7, 2020 • edited

Choose a reason for hiding this comment

mohamedmansour commented Dec 7, 2020 • edited

nisdas commented Dec 7, 2020

mohamedmansour commented Dec 7, 2020

nisdas commented Dec 8, 2020

rkapka left a comment

Choose a reason for hiding this comment

rkapka commented Dec 10, 2020 • edited

rkapka commented Dec 10, 2020

nisdas commented Dec 7, 2020 •

edited

shayzluf Dec 7, 2020 •

edited

mohamedmansour commented Dec 7, 2020 •

edited

rkapka commented Dec 10, 2020 •

edited