[feature]: Availability of UpdateAddHTLC ExtraData Across Restarts #7297

carlaKC · 2023-01-06T20:16:11Z

Background

The UpdateAddHTLC message in LND is extended with ExtraData which allows spec upgrades to transmit extra fields via a TLV stream. LND currently stores the extra data associated with a HTLC in our LogUpdates (where we store the full wire message), but not in channeldb.HTLC in our state machine. Understandably so, because there are no TLVs in use that need these fields and storing the full blob in the state machine would be unnecessary bloat.

Use Cases

There are a few upcoming specification proposals that start to use these fields:

Route blinding: a blinding_point is transmitted to peers as an UpdateAddHTLC tlv.
Paid Jamming Countermeasures: any jamming mitigation that requires "proof of work" from a sender before they can add a HTLC (upfront fees / anonymous routing tokens / proof of burn etc) will transmit this value in an UpdateAddHTLC tlv.
Local Reputation: peers that wish to communicate reputation information about a forwarded HTLC to mitigate long-hold jamming would use an UpdateAddHTLC tlv.

Availability of Extra Data

While working on route blinding, we started to take a look at the availability of this data across restarts (since it's not stored in our state machine). Using the terminology outlined in the spec, this section outlines the availability of data at various stages of the HTLC lifecycle and how we can recover it after restart (it at all).

This example walks through the full HTLC lifecycle forwarded from A -> B -> C from the perspective of B.

Incoming HTLC (A -> B)

Pending on the receiver

We have received an UpdateAddHTLC from our remote peer.
On Restart: We will forget the HTLC, because we do not ACKed it, and expect our remote peer to replay it.

In the receiver commitment:

We have received a CommitmentSignature from the remote peer that includes the HTLC.
On Restart: We will forget the HTLC, because we have not ACKed it, and expect our remote peer to replay it.

Irrevocably committed by the receiver:

We have sent a RevokeAndAck to the remote peer that revokes the state without the HTLC.

Our LocalCommitment is persisted in the openChannelBucket with the incoming channeldb.HTLC
The full LogUpdate is stored in the unsignedAckedUpdates bucket.

On Restart:

We will restore the incoming HTLC in our remoteUpdateLog from our local commitment

⚠️ The restored HTLC does not have any extra data attached, because it is restored from a channeldb.HTLC which strips those fields.
🔧 Workaround: We can restore this data from our unsignedAckedUpdates (see route blinding example)

In sender commitment

We have sent the remote peer a CommitSignature that includes the incoming HTLC.

LocalCommitment and unsignedAckedUpdates are stored as above.
The CommitDiff for the signed remote commitment is persisted in the openChannelBucket under the commitDIffKey. This diff includes the incoming HTLC stored as a channeldb.HTLC

On Restart: As above, we restore the HTLC to our remoteUpdateLog from our localCommitment. We can also use the workaround described above to restore any additional data attached to the HTLC.

Irrevocably committed by sender

We have received a RevokeAndAck from the remote peer that revokes state without the HTLC.**

The RemoteCommitment is persisted in the openChannelBucket with the incoming channeldb.HTLC
Our commitDiff and unsignedAckedUpdates have been cleared in the openChannelBucket
A LogUpdate for the incoming HTLC has been stores in our forwarding package under fwdPackagesKey

On Restart:

We will restore the incoming HTLC in our remoteUpdateLog
The add for the downstream link will be replayed with any additional data provided, because the forwarding package stores the full LogUpdate

⚠️ The restored HTLC on the incoming channel does not have any additional data attached to it, and it can no longer be restored from unsignedAckedUpdates. This is not a problem for forwarding the HTLC, but will become an issue if we want to use the data on HTLC settle or fail.

Outgoing HTLC (B -> C)

Pending on the receiver

We have sent an UpdateAddHTLC to our remote peer.

The HTLC is not yet saved in the state machine for the outgoing channel.
The forwarded htlc's open circuit has been persisted in under the circuitAddKey when the switch forwarded the HTLC to the outgoing link

On Restart:

The incoming link will resolve forwarding packages, reprocessing all remote adds.
The outgoing HTLC will extra data attached because it is restored from a LogUpdate.

In the receiver's commitment

We have sent a CommitmentSignature to the remote peer that includes the HTLC.

The forward is part of the outgoing link's keystones when we update our commitment, so the full circuit is persisted in the keystoneBkt.
We have persisted the pending CommitDiff saved under the commitDiffKey, including a LogUpdate for the added HTLC

On Restart:

The HTLC is still not saved in the outgoing channel's commitment state.
It will not be reforwarded by the incoming link because it has a keystone, so is dropped on forwarding.
We retrieve our pendingRemoteCommitDiff from disk and restorePendingLocalUpdates which restores the HTLC from a LogUpdate to our localUpdateLog
On channel re-establish we will re-transmit the UpdateAddHTLC to the remote peer.

Irrevocably committed by the receiver

We have received a RevokeAndAck from the remote peer that revokes the state without the HTLC.

When we received the revocation, the previous CommitDiff is stored as our new remote commitment state, and the CommitDiff is cleared.
The outgoing HTLC is stored as a channeldb.HTLC on that commitment.
We do not save the HTLC's LogUpdate in our remoteUnsignedLocalUpdates because they are restored from the remote commitment.

On restart:

The outgoing HTLC will be restored to our localUpdateLog from the remote commitment, so no extra data will be present.

⚠️ The restored HTLC does not have any extra data attached, because it is restored from a channeldb.HTLC which strips those fields. This is not a problem for forwarding the HTLC, since we have already sent the peer an UpdateAddHTLC, but will be an issue if we want to interact with the data on settle/fail.

From this point onwards, we lose the extra data associated with the HTLC after restart. If you ~~like complex switches and you cannot lie~~ want to see the details of how I walked through this all, my full notes on the life and times of a HTLC in lnd are here!

Options to Explore

Migrate channeldb.HTLC to a TLV stream, then store specific fields as they come into use in the protocol.
Restore extra data from forwarding packages

The text was updated successfully, but these errors were encountered:

carlaKC · 2023-01-06T20:20:23Z

Mostly opening this to surface the information - feel free to close immediately!

tl;dr:

We can add forwarding for route blinding with some pretty minimal modifications,
If we need to implement something else that needs UpdateAddHTlC's ExtraData beyond just locking in the HTLC add we're going to need to either populate our channel state from forwarding packages or bite the bullet and do a migration.

(I think, could easily have missed something glaringly obvious)

Roasbeef · 2023-01-06T22:52:13Z

Rather than just store the extra data, would think that we'd actually just parse out what we need in a structured manner to store elsewhere. IMO, any items in the extra data (TLV) should be considered required given they're attached to messages that drive the main channel state machine.

Route blinding: a blinding_point is transmitted to peers as an UpdateAddHTLC tlv.

Would have thought this were just part of the onion, or the EC point at the start would have be overloaded for this purpose 🤔. Otherwise the HMAC (w/ the Associated Data) wouldn't cover this data, like it does the amt+cltv+payment hash, could potentially a new free replay attack vector.

carlaKC · 2023-01-06T23:49:47Z

would think that we'd actually just parse out what we need in a structured manner

Yeah def, as we start to need specific fields pull them out and save them rather than try to store all the tlvs in all the places.

to store elsewhere.

Meaning create a totally new bucket and just drop what we need in there?

Would have thought this were just part of the onion

The first blinding point is in the onion payload (for the "introductory" node), then relaying nodes receive it in update_add_htlc. Not sure how that could open up a replay attack since modifying subsequent blinding points just makes those nodes unable to decrypt the onion / relay?

yyforyongyu · 2023-01-12T18:42:49Z

Probably a dump question, but can we ask the peer to send again? I think we do similar things for commitments, that based on the indexes, we may resend the last commitment, might as well filter out a list of htlcs and resend again(or we already do this now? need double check).

If we want to survive a restart, I think we could just save the logs into remoteUnsignedLocalUpdate or unsignedAckedUpdates by extracting the logic used to create them, from RevokeCurrentCommitment and the second part of updating remoteCommitChain.

With the extra data, agree with laolu here that we need a new bucket for it. I think this would be an extra layer on top of lnwallet.

Migrate channeldb.HTLC to a TLV stream, then store specific fields as they come into use in the protocol.

I think it's generally a good idea to use tlv.

Restore extra data from forwarding packages

I think this would be nice to avoid duplicate storage of the same info, but aren't they removed once the forwarding is finished?

Nice docs btw(why I'm not surprised🤓). I also drafted several docs here a while back and plan to change the confusing variable names.

carlaKC · 2023-01-12T21:16:56Z

Probably a dump question, but can we ask the peer to send again?

This would need a protocol change for something that's working fine (we're just not storing it), which I don't think we want (+ would ne non-trivial).

If we want to survive a restart, I think we could just save the logs into remoteUnsignedLocalUpdate or unsignedAckedUpdates by extracting the logic used to create them, from RevokeCurrentCommitment and the second part of updating remoteCommitChain.

iirc, both of these buckets are cleared out once we've locked in the new commitments, so once we've locked in the incoming + outgoing commitment we will no longer have access to the extra data? For things like reputation/upfront fees, we'll definitley need that info after that point.

With the extra data, agree with laolu here that we need a new bucket for it. I think this would be an extra layer on top of lnwallet.

If this were considered an acceptable approach, I'd def say that this is the path of least resistance. It seems a little taped together to not store it with channeldb.HTLC (since logically that's where it belongs) but it would make life much easier!

I think this would be nice to avoid duplicate storage of the same info, but aren't they removed once the forwarding is finished?

Yeah, wouldn't be my recommended approach but it would work if desparate.

Nice docs btw(why I'm not surprised🤓). I also drafted several docs here a while back and plan to change the confusing variable names.

❤️ same to you! Looks like a study group in the making if ya'll are still into that!

Roasbeef · 2023-01-13T00:57:15Z

Meaning create a totally new bucket and just drop what we need in there?

I think either the new bucket, or just the migration itself (leaning towarrds the migration). The migration may also be useful in the future for things like PTLCs, that'll require us to store more than just the payment hash per HTLC. The migration here may also be a bit tricky: we use the struct in a few different locations, at times storing something else directly after it within a stream. Ideally we can do the migration in an isolated change, so we can minimize the review surface area to make sure we get things right the first time around.

carlaKC · 2023-01-13T14:51:22Z

just the migration itself (leaning towarrds the migration)

Same, feels like it's somewhat inevitable? Better bite the bullet early rather than have a bunch of ancillary buckets that would need to be migrated/accounted for as legacy.

Ideally we can do the migration in an isolated change, so we can minimize the review surface area

def, will probably be quite a heavy migration since we'll need to overhaul every commitment for every channel :(

yyforyongyu · 2023-01-13T15:04:35Z

If we decide to go with the migration path, I think we could generalize the methods created here, where an iterator is provided so the migration could survive from a halt.

FYI will take a deeper look into the blinded path to provide more meaningful feedback here.

carlaKC · 2023-04-19T17:01:54Z

Any further thoughts on this @yyforyongyu @Roasbeef?

#7376 can work without this (with some workarounds), but I think it's far cleaner to bite the bullet and do the migration. If there are no objections, I'll start working on some code for it?

yyforyongyu · 2023-04-20T15:12:24Z

@carlaKC sorry it's been a while, will get back to you next week!

halseth · 2023-05-05T07:38:10Z

How would the migration look in terms of code complexity?

That being said, my feeling as that a new bucket for the extra data is not bad at all. I think that has been done before, and we can just add a subbucket nested under the existing one that hosts the TLV data (and existing HTLCs will just have no data in this bucket).

carlaKC · 2023-05-05T14:13:15Z

How would the migration look in terms of code complexity?

Code complexity pretty straightforward IMO, just inline encoding -> TLVs. The concern is more about the volume of data to be migrated - every HTLC, in every commitment, in every channel 😅

That being said, my feeling as that a new bucket for the extra data is not bad at all.

My original feeling was that we might as well bite the bullet since we'll want more flexible structure for PTLCs (eventually). But if we're reticient to do a big migration and @Roasbeef is okay with the extra bucket then I'm happy with that way (not something we haven't done before).

carlaKC · 2023-05-19T17:51:28Z

Opened up #7710 to demonstrate a nifty workaround for stashing extra data inline with the HTLCs (h/t @Roasbeef for the suggestion).

@yyforyongyu WDTY? I think this hits the nice inbetween of not being too hacky and saving us the trauma of a big migration.

Also looked into storing HTLCs in an extra bucket, but we store a lot of data inline (and in a few places) so it would need to be quite far away from HTLC serialization, so I don't think that's a great direction.

yyforyongyu · 2023-09-01T09:25:41Z

This is now closed by #7710 ?

carlaKC · 2023-09-01T14:12:00Z

This is now closed by #7710 ?

Indeed!

carlaKC mentioned this issue Jan 6, 2023

[feature]: Blinded Forwarding in LND #7298

Closed

Roasbeef added brainstorming Long term ideas/discussion/requests for feedback HTLC spec labels Jan 6, 2023

carlaKC mentioned this issue Jan 18, 2023

POC: Upfront Fees to Mitigate Channel Jamming #7339

Closed

carlaKC mentioned this issue Feb 1, 2023

multi: Forward Blinded Payments #7376

Closed

carlaKC mentioned this issue Apr 28, 2023

[feature]: Add custom TLV fields to HTLC interceptor #7650

Open

carlaKC mentioned this issue May 19, 2023

Channeldb: Store HTLC Extra TLVs in Onion Blob Varbytes #7710

Merged

saubyk assigned carlaKC Jun 15, 2023

carlaKC closed this as completed Sep 1, 2023

carlaKC mentioned this issue May 20, 2024

Enhance update_add_htlc with remote peer's custom records #8660

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature]: Availability of UpdateAddHTLC ExtraData Across Restarts #7297

[feature]: Availability of UpdateAddHTLC ExtraData Across Restarts #7297

carlaKC commented Jan 6, 2023 •

edited

carlaKC commented Jan 6, 2023 •

edited

Roasbeef commented Jan 6, 2023 •

edited

carlaKC commented Jan 6, 2023 •

edited

yyforyongyu commented Jan 12, 2023

carlaKC commented Jan 12, 2023

Roasbeef commented Jan 13, 2023

carlaKC commented Jan 13, 2023

yyforyongyu commented Jan 13, 2023

carlaKC commented Apr 19, 2023

yyforyongyu commented Apr 20, 2023

halseth commented May 5, 2023

carlaKC commented May 5, 2023 •

edited

carlaKC commented May 19, 2023

yyforyongyu commented Sep 1, 2023

carlaKC commented Sep 1, 2023

[feature]: Availability of UpdateAddHTLC ExtraData Across Restarts #7297

[feature]: Availability of UpdateAddHTLC ExtraData Across Restarts #7297

Comments

carlaKC commented Jan 6, 2023 • edited

Background

Use Cases

Availability of Extra Data

Incoming HTLC (A -> B)

Pending on the receiver

In the receiver commitment:

Irrevocably committed by the receiver:

In sender commitment

Irrevocably committed by sender

Outgoing HTLC (B -> C)

Pending on the receiver

In the receiver's commitment

Irrevocably committed by the receiver

Options to Explore

carlaKC commented Jan 6, 2023 • edited

Roasbeef commented Jan 6, 2023 • edited

carlaKC commented Jan 6, 2023 • edited

yyforyongyu commented Jan 12, 2023

carlaKC commented Jan 12, 2023

Roasbeef commented Jan 13, 2023

carlaKC commented Jan 13, 2023

yyforyongyu commented Jan 13, 2023

carlaKC commented Apr 19, 2023

yyforyongyu commented Apr 20, 2023

halseth commented May 5, 2023

carlaKC commented May 5, 2023 • edited

carlaKC commented May 19, 2023

yyforyongyu commented Sep 1, 2023

carlaKC commented Sep 1, 2023

carlaKC commented Jan 6, 2023 •

edited

carlaKC commented Jan 6, 2023 •

edited

Roasbeef commented Jan 6, 2023 •

edited

carlaKC commented Jan 6, 2023 •

edited

carlaKC commented May 5, 2023 •

edited