Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

identify/: Don't fail on unknown multiaddr protocol #3244

Closed
mxinden opened this issue Dec 14, 2022 · 21 comments
Closed

identify/: Don't fail on unknown multiaddr protocol #3244

mxinden opened this issue Dec 14, 2022 · 21 comments
Labels
bug difficulty:easy priority:important The changes needed are critical for libp2p, or are blocking another project

Comments

@mxinden
Copy link
Member

mxinden commented Dec 14, 2022

Summary

libp2p-identify currently discards the entire identify payload of a remote peer, if that payload contains a multiaddr it can not parse.

Expected behaviour

When parsing an identify payload of a remote peer with an unknown multiaddr protocol (e.g. quic-v1/ or webrtc/) log the parsing error, but don't discard the entire payload.

Actual behaviour

In libp2p-identify's parsing logic, it returns an error for the entire payload, in case it can not parse a multiaddr.

let listen_addrs = {
let mut addrs = Vec::new();
for addr in msg.listen_addrs.into_iter() {
addrs.push(parse_multiaddr(addr)?);
}
addrs
};

This is especially relevant when rolling out a new protocol to a live network. Say that most nodes of a network run on an implementation version v1. Say that the multiaddr implementation is not aware of the webrtc/ protocol. Say that a new version (v2) is rolled out to the network with support for the webrtc/ protocol, listening via webrtc/ by default. In such case all v1 nodes would discard all identify payloads of v2 nodes, given that the v2 identify payloads would contain the webrtc/ protocol in their listen_addr addresses.

//CC @dignifiedquire for the role out of quic-v1/ and webtransport/ on IPFS
//CC @melekes for the role out of webrtc/ and quic-v1/ on Polkadot
//CC @divagant-martian and @AgeManning in case you experiment with quic-v1/ on lighthouse

Possible Solution

Release patch versions of libp2p-identify which log the parsing failure, but don't discard the entire identify payload.

Version

libp2p <=v0.50.0

Misc

Would you like to work on fixing this bug?

Yes, unless someone else has capacity to do so in the next couple of days.

@mxinden mxinden added bug priority:important The changes needed are critical for libp2p, or are blocking another project difficulty:easy labels Dec 14, 2022
@rkuhn
Copy link
Contributor

rkuhn commented Dec 14, 2022

In addition to this fix — which I fully agree with — I usually add a FutureCompat variant to all enums representing protocol elements that may be extended in the future (which is basically every enum that gets (de)serialised, unless Moses literally brought the protocol with him from Mt Sinaï). In this particular case, that variant could even contain the string value naming the locally unknown protocol.

The current approach in multiaddr does not foresee such a feature, do you think it makes sense to add a parsing facility that accepts syntactically valid but semantically unknown strings?

EDIT: adding a use-case: forwarding multiaddrs should be fine as long as they’re syntactically valid, even when we don’t (yet) understand them.

mxinden added a commit to mxinden/rust-libp2p that referenced this issue Dec 14, 2022
With this commit `libp2p-identify` no longer discards the whole identify payload in case a listen
addr of the remote node is invalid, but instead logs the failure, skips the invalid multiaddr and
parses the remaining identify payload.

> This is especially relevant when rolling out a new protocol to a live network. Say that most nodes
> of a network run on an implementation version v1. Say that the `multiaddr` implementation is not
> aware of the `webrtc/` protocol. Say that a new version (v2) is rolled out to the network with
> support for the `webrtc/` protocol, listening via `webrtc/` by default. In such case all v1 nodes
> would discard all identify payloads of v2 nodes, given that the v2 identify payloads would contain
> the `webrtc/` protocol in their `listen_addr` addresses.

See libp2p#3244 for details.
@thomaseizinger
Copy link
Contributor

In addition to this fix — which I fully agree with — I usually add a FutureCompat variant to all enums representing protocol elements that may be extended in the future (which is basically every enum that gets (de)serialised, unless Moses literally brought the protocol with him from Mt Sinaï). In this particular case, that variant could even contain the string value naming the locally unknown protocol.

The current approach in multiaddr does not foresee such a feature, do you think it makes sense to add a parsing facility that accepts syntactically valid but semantically unknown strings?

I would have to check but I think we don't have a generic TLV encoding here so we don't know, how long the unknown protocol is.

For example, assume tcp is unknown. TCP looks like this: /tcp/1234. Would this parse as two unknown protocols?

In any case, I think this warrants opening an issue over at multiaddr.

@altonen
Copy link

altonen commented Dec 15, 2022

Is there any workaround for this, like a possibility to retroactively attempt to upgrade the connection with these possibly unsupported protocols without causing the connection to be closed?

@rkuhn
Copy link
Contributor

rkuhn commented Dec 16, 2022

In any case, I think this warrants opening an issue over at multiaddr.

done: multiformats/rust-multiaddr#74

mxinden added a commit that referenced this issue Dec 17, 2022
With this commit `libp2p-identify` no longer discards the whole identify payload in case a listen
addr of the remote node is invalid, but instead logs the failure, skips the invalid multiaddr and
parses the remaining identify payload.

> This is especially relevant when rolling out a new protocol to a live network. Say that most nodes
> of a network run on an implementation version v1. Say that the `multiaddr` implementation is not
> aware of the `webrtc/` protocol. Say that a new version (v2) is rolled out to the network with
> support for the `webrtc/` protocol, listening via `webrtc/` by default. In such case all v1 nodes
> would discard all identify payloads of v2 nodes, given that the v2 identify payloads would contain
> the `webrtc/` protocol in their `listen_addr` addresses.

See #3244 for details.
@mxinden
Copy link
Member Author

mxinden commented Dec 17, 2022

#3246 is merged and libp2p-identify v0.41.1 is published and thus the libp2p v0.50 family is patched.

In case folks would like me to backport this patch to the libp2p v0.49 family, please comment here.

Also note that we face the same issue in at least one other protocol, namely libp2p-kad:

let mut addrs = Vec::with_capacity(peer.addrs.len());
for addr in peer.addrs.into_iter() {
let as_ma = Multiaddr::try_from(addr).map_err(invalid_data)?;
addrs.push(as_ma);
}

@mxinden
Copy link
Member Author

mxinden commented Dec 19, 2022

Is there any workaround for this, like a possibility to retroactively attempt to upgrade the connection with these possibly unsupported protocols without causing the connection to be closed?

I don't understand your comment @altonen. Would you mind rephrasing it?

@altonen
Copy link

altonen commented Dec 19, 2022

Problem for us is that parachains can be several months behind the latest client release meaning if we'd want to roll out QUIC and WebRTC soon, large portion of the network wouldn't be able to identify these new nodes which would be a problem until the network has updated to the libp2p version that has this fix in. We're working on a workaround which would temporarily split the Identify into two phases and in which these two new protocols are advertised in a second Identify push, allowing all nodes to Identify each other while also allowing nodes supporting these new protocols to exchange these new listening endpoints.

@thomaseizinger
Copy link
Contributor

thomaseizinger commented Dec 20, 2022

We're working on a workaround which would temporarily split the Identify into two phases and in which these two new protocols are advertised in a second Identify push, allowing all nodes to Identify each other while also allowing nodes supporting these new protocols to exchange these new listening endpoints.

That sounds like a clever way of doing it. Unfortunately, we didn't spot this problem earlier so I can't think of a way of having older deployments not fail here. To support your workaround, we'd probably need a configuration flag on identity::Config to allow filtering of the sent listen addresses.

@mxinden
Copy link
Member Author

mxinden commented Dec 21, 2022

that parachains can be several months behind the latest client release

In case rolling out patch releases is an option, we can backport the patch to any previous libp2p release. The patch is very self-contained and the code hasn't changed since 2018. Thus this is not a large effort.

@altonen
Copy link

altonen commented Dec 21, 2022

@thomaseizinger @mxinden

Thanks for your replies. If you could backport it to some older version of libp2p that would be great, then we wouldn't have to modify Identify temporarily for our needs. The plan was to keep the workaround inside Substrate anyway but having the fix in an older version of libp2p would be perfect.

@thomaseizinger
Copy link
Contributor

Which version(s) of libp2p do you need the backport for?

@altonen
Copy link

altonen commented Dec 21, 2022

Could you port it to all the way to v0.43.0?

@thomaseizinger
Copy link
Contributor

Do you mean all versions until then or just 0.43.0?

@altonen
Copy link

altonen commented Dec 22, 2022

All versions, sorry for confusion

Edit: We don't need the backport all the way to 0.43.0, 0.45.1 works for us just fine.

@jxs
Copy link
Member

jxs commented Dec 22, 2022

#3246 is merged and libp2p-identify v0.41.1 is published and thus the libp2p v0.50 family is patched.

In case folks would like me to backport this patch to the libp2p v0.49 family, please comment here.

Also note that we face the same issue in at least one other protocol, namely libp2p-kad:

let mut addrs = Vec::with_capacity(peer.addrs.len());
for addr in peer.addrs.into_iter() {
let as_ma = Multiaddr::try_from(addr).map_err(invalid_data)?;
addrs.push(as_ma);
}

should we open an issue for this?

mxinden added a commit to mxinden/rust-libp2p that referenced this issue Dec 23, 2022
With this commit `libp2p-kad` no longer discards the whole peer payload in case an addr is invalid,
but instead logs the failure, skips the invalid multiaddr and parses the remaining payload.

See libp2p#3244 for details.
mergify bot pushed a commit that referenced this issue Dec 24, 2022
With this commit `libp2p-identify` no longer discards the whole identify payload in case a listen addr of the remote node is invalid, but instead logs the failure, skips the invalid multiaddr and parses the remaining identify payload.

This is especially relevant when rolling out a new protocol to a live network. Say that most nodes of a network run on an implementation version v1. Say that the `multiaddr` implementation is not aware of the `webrtc/` protocol. Say that a new version (v2) is rolled out to the network with support for the `webrtc/` protocol, listening via `webrtc/` by default. In such case all v1 nodes would discard all identify payloads of v2 nodes, given that the v2 identify payloads would contain the `webrtc/` protocol in their `listen_addr` addresses.

See #3244 for details.
mxinden added a commit that referenced this issue Dec 26, 2022
With this commit `libp2p-kad` no longer discards the whole peer payload in case an addr is invalid,
but instead logs the failure, skips the invalid multiaddr and parses the remaining payload.

See #3244 for details.

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Dec 26, 2022
With this commit `libp2p-kad` no longer discards the whole peer payload in case an addr is invalid,
but instead logs the failure, skips the invalid multiaddr and parses the remaining payload.

See libp2p#3244 for details.

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>
mergify bot pushed a commit that referenced this issue Dec 27, 2022
With this commit `libp2p-kad` no longer discards the whole peer payload in case an addr is invalid, but instead logs the failure, skips the invalid multiaddr and parses the remaining payload.

See #3244 for details.

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Jan 2, 2023
With this commit `libp2p-dcutr` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
@mxinden
Copy link
Member Author

mxinden commented Jan 6, 2023

Edit: We don't need the backport all the way to 0.43.0, 0.45.1 works for us just fine.

Note that these are 4 crates (identify, kad, dcutr, autonat) across 6 versions (0.45 - 0.50), thus 24 releases in total. Do you really need all those backports? In other words, are these outdated parachains going to upgrade version by version? Note that in case they do upgrade version by version, and do so across all nodes in a parachain, only the version before rolling out a new transport needs this set of patches.

In case you still need all those backports, would you mind creating appropriate backport pull requests like #3246 targeting the release branches?

mxinden added a commit that referenced this issue Jan 11, 2023
With this commit `libp2p-dcutr` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See #3244 for details.
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Jan 11, 2023
With this commit `libp2p-dcutr` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
mergify bot pushed a commit that referenced this issue Jan 12, 2023
With this commit `libp2p-dcutr` no longer discards the whole remote payload in case an addr is unparsable, but instead logs the failure and skips the unparsable multiaddr.

See #3244 for details.
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Jan 19, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Jan 19, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
mxinden added a commit to mxinden/rust-libp2p that referenced this issue Jan 19, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
mxinden added a commit that referenced this issue Jan 20, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See #3244 for details.
jxs pushed a commit to jxs/rust-libp2p that referenced this issue Jan 20, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is
unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
mergify bot pushed a commit that referenced this issue Jan 24, 2023
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is unparsable, but instead logs the failure and skips the unparsable multiaddr.

See #3244 for details.
@jxs
Copy link
Member

jxs commented Jan 30, 2023

this one is done for both identify and autonat right?

@altonen
Copy link

altonen commented Feb 9, 2023

Hi @mxinden sorry for late repy

We had a discussion internally and decided that it is easiest for us to update libp2p, release a node, wait for a few releases and only then release QUIC/WebRTC. In other words, we don't need backports, sorry for the trouble

@mxinden
Copy link
Member Author

mxinden commented Feb 9, 2023

this one is done for both identify and autonat right?

Right. This is now done for identify, autonat, kad and dcutr. Closing here. Thanks for the reminder @jxs.

@mxinden
Copy link
Member Author

mxinden commented Feb 9, 2023

wait for a few releases and only then release QUIC/WebRTC

Note @altonen that one release is enough, e.g. updating to v0.51.0 and then rolling out QUIC and WebRTC works.

@altonen
Copy link

altonen commented Feb 9, 2023

Yeah that is our current plan, updating to v0.51.0 and waiting for couple of releases

@mxinden mxinden closed this as completed Feb 19, 2023
umgefahren pushed a commit to umgefahren/rust-libp2p that referenced this issue Mar 8, 2024
With this commit `libp2p-autonat` no longer discards the whole remote payload in case an addr is unparsable, but instead logs the failure and skips the unparsable multiaddr.

See libp2p#3244 for details.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug difficulty:easy priority:important The changes needed are critical for libp2p, or are blocking another project
Projects
None yet
Development

No branches or pull requests

5 participants