docs: event hashing ADR 058 #5134

melekes · 2020-07-17T11:07:08Z

auto-comment · 2020-07-17T11:07:11Z

👋 Thanks for creating a PR!

Before we can merge this PR, please make sure that all the following items have been
checked off. If any of the checklist items are not applicable, please leave them but
write a little note why.

Wrote tests
Updated CHANGELOG_PENDING.md
Linked to Github issue with discussion and accepted design OR link to spec that describes this work.
Updated relevant documentation (docs/) and code comments
Re-reviewed Files changed in the Github PR explorer
Applied Appropriate Labels

Thank you for your contribution to Tendermint! 🚀

cwgoes

Looks fine, just a note that we should make sure it is easy in the SDK to specify per-event, e.g. using the ctx.EmitEvents() function, whether or not an event should be hashed into the header.

alexanderbez

Thanks for writing this formally @melekes. LGTM!

docs/architecture/adr-058-event-hashing.md

tac0turtle

LGTM

erikgrinaker · 2020-07-21T11:49:14Z

It might be nice to specify how they are to be hashed, e.g. Merkle tree vs. sequential, ordering, and hash algo.

melekes · 2020-07-22T06:15:32Z

cc @ebuchman

ebuchman · 2020-07-24T05:32:36Z

Thanks. I think this looks fine. We could add more clarity but it's spelled out in more detail in the corresponding spec changes.

Some loose thoughts: Having the event types in the tendermint consensus params does feel a bit weird though - is it really better than having an extra field in the event? We could probably do a better job of weighing the difference here. With the current proposal it means the ConsensusParams could get a lot bigger and there's more we need to track in the Tendermint State. With an extra field we'd have more flexibility, like an app could sometimes decide to include an event of some type and sometimes not, but maybe that's a bad thing, and too implicit. Though I'd expect apps to have some stateful representation of what events they are hashing.

Anyways, I'd also seek input from @ValarDragon @ethanfrey @liamsi @adlerjohn @mappum @zmanian if they have time to think about it... ideally in the future these kinds of changes will happen in more formal RFCs :)

melekes · 2020-07-24T06:03:59Z

Having the event types in the tendermint consensus params does feel a bit weird though

agree here

docs/architecture/adr-058-event-hashing.md

ValarDragon · 2020-07-24T19:32:39Z

Thanks for writing this up!

However I'm overall confused as to why adding / removing event names breaks "LastResultsHash" (and thereby, why we need this as a consensus parameter). Why do I need to know all the event types in order to verify or create proofs?

As I understand it, the procedure should be as follows:
Each Event has a type, and some other additional data. In each of these responses I am given a list of events that should be merkelized to get provability. To merkelize, I set leaf_i := serialize(events[i]). Then the verifier can still get lite client proofs against this as before, without needing to know the name of all events, they only must know the name of the event they care about.

Even if you want tproofs that you are given all events of a certain type or proofs of non-inclusion, you still don't need to know all the event names beforehand to verify or create such proofs. (This just requires the property that you can 'order' the event types, e.g. ordering them alphabetically, since you sort the event types before merkelizing)

In particular, I feel like the application should be able to define a new event type every block and use it immediately, without causing Tendermint issues.

ethanfrey · 2020-07-24T19:32:56Z

Complexity is always nice in the design phase (have your cake and eat it too), but leads to lots of tech debt, confusion, and often misleading docs down the line.

I see 3 simple options:

Keep the event hash out of the block header (same as 0.33)
Add all events hash to the block header (current behavior on master)
One global toggle to switch between (1) and (2) - a simpler version of this proposal

I think the whole event search/subscribe behavior needs to be rethought anyway with the Begin/EndBlockers.

My personal preference is: stick with (1) for 0.34. Go to (2) when sdk team and others in the ecosystem are ready.

If you are stuck in the position where some clients want (1) and some want (2) now, then my option (3) - global swtich can address that, and is much simpler than the full proposed solution

ebuchman · 2020-07-24T21:48:12Z

However I'm overall confused as to why adding / removing event names breaks "LastResultsHash" (and thereby, why we need this as a consensus parameter). Why do I need to know all the event types in order to verify or create proofs?

@ValarDragon you don't - the motivation here is that the event system as an API has a large surface area and is a significant integration point that may take a long time to stabilize. Hence you want to be able to make updates to the kinds of events that are fired and their fields/structure to enable newer/better integrations and pub-sub behaviour, but you don't want those kinds of changes to break the blockchain.

So there's this tension between using events as (1) a non-consensus critical pub-sub mechanism to integrate against and (2) a consensus critical "proof of action" system for light clients.

The proposal allows the application to control which subset of its event system is consensus critical (ie hashed into the header) and which are not. Those that are not can continue to evolve (add/remove fields, add new events, etc.) without breaking the chain.

My personal preference is: stick with (1) for 0.34. Go to (2) when sdk team and others in the ecosystem are ready.

@ethanfrey the challenge here is again around the idea that event systems will completely stabilize. In so far as events are used for pub-sub and as a major integration point, it seems likely that apps may always want to be able to emit new events that aren't consensus critical, even if some events are.

Another possibility here which is perhaps preferred for now until there's more stability/motivation/use-cases/demand here is to push this entirely application side and just have apps which want events to be provable to insert them into their application-side merkle trees. Of course this puts more pressure on their application state and makes event proving application specific, but it might help built up a better sense of use-cases and how this ought to ultimately be done by Tendermint.

docs/architecture/adr-058-event-hashing.md

ValarDragon · 2020-07-24T22:41:08Z

So there's this tension between using events as (1) a non-consensus critical pub-sub mechanism to integrate against and (2) a consensus critical "proof of action" system for light clients.

The proposal allows the application to control which subset of its event system is consensus critical (ie hashed into the header) and which are not. Those that are not can continue to evolve (add/remove fields, add new events, etc.) without breaking the chain.

Thanks for the expl, I think i get it now. Handling the filter seems like something the app should handle.

If these non-merkelized txs should be added to pre-existing pub-sub logic, perhaps it makes sense for there to be two lists of events in these responses? This way Tendermint doesn't need to filter where the events go, instead the SDK filters them before handing them off to ABCI.

ebuchman · 2020-07-25T02:21:56Z

This way Tendermint doesn't need to filter where the events go, instead the SDK filters them before handing them off to ABCI.

Right, the alternative proposal being considered was to add a hash bool field to the Event to indicate whether Tendermint should hash the event or not, and then it would be fully controlled by the app and Tendermint wouldn't know anything up front, but this has a downside of being somewhat less explicit about what events get hashed. With the list in the consensus params, it's pretty up front and clear, though it is a bit awkward ...

ValarDragon · 2020-07-25T03:22:05Z

I see, the downside being now the pub-sub module of Tendermint can't return nice lists for what events are provably queryable, and which aren't. Now I get the ADR, thanks for the expl! (Perhaps more context should be in the intro? Though I'm also the only one who was confused lol)

Instead of having a list of event strings that specify whether or not to merkelize, I'd prefer a KV-map if its easy w/ protobuf. The key being the string, and the value being a 'merkelization type' enum. Seems plausible to me that there me be multiple desired ways to merkelize/accumulate event attributes for different querying modes the app may want (in line with #1007 (comment))

ValarDragon · 2020-07-25T03:24:32Z

docs/architecture/adr-058-event-hashing.md

+`Index bool` EventAttribute's field. When `true`, Tendermint would hash it into
+the `LastResultsEvents`. The downside is that the logic is implicit and depends
+largely on the node's operator, who decides what application code to run. The
+above proposal makes it (the logic) explicit and easy to upgrade via


The easy to upgrade via governance bit doesn't seem true to me. Presumably the SDK would have a governance updatable filter.

Are u saying neither proposal makes it easy to upgrade on the SDK side? (cc @alexanderbez )

ValarDragon · 2020-07-25T03:27:04Z

docs/architecture/adr-058-event-hashing.md

+
+### How events are hashed
+
+Since we do not expect `BeginBlock` and `EndBlock` to contain many events, these


I'm a bit surprised by this. My impression of events is that the following 3 query cases would be most common:

Just track my tx's

Gather data for all tx's of a certain type

Gather some per-block stats about the txs

If (3) is indeed a notable use case for many lite clients, then perhaps these should be merkelized? (Or at least merkelized within end-block).

we can do that, sure

mappum · 2020-07-25T04:42:06Z

Another possibility here which is perhaps preferred for now until there's more stability/motivation/use-cases/demand here is to push this entirely application side and just have apps which want events to be provable to insert them into their application-side merkle trees. Of course this puts more pressure on their application state and makes event proving application specific, but it might help built up a better sense of use-cases and how this ought to ultimately be done by Tendermint.

Just want to give a +1 for this. I've always thought the beauty of Tendermint is the clean separation between app and consensus - in essence just a state hash and some transaction data. Anything else is just extra complexity, and I feel like things have slightly overfitted to the Cosmos SDK, leaving alternative stacks with more features to integrate or locking them into less flexible solutions.

In this case we are pretty agnostic to whatever the decision is, but if we were to use some sort of events we would probably keep them within our own merkle tree so we can do it in a way that fits with the rest of our stack.

tomtau · 2020-07-27T08:06:44Z

docs/architecture/adr-058-event-hashing.md

+
+## Appendix A. Alternative proposals
+
+The other proposal was to add `Hash bool` flag to the `Event`, similarly to


this alternative seems to be a simpler proposal IMHO?

melekes · 2020-07-27T09:31:57Z

Thanks all for your input! ❤️ It looks like leaving it to the application is a way to go for now. I'm going to revert some of the changes made in https://github.com/tendermint/tendermint/pull/4845/files, specifically adding BeginBlock#Events, EndBlock#Events and ResponseDeliverTx#Events. NOTE: GasWanted/GasUsed will not be reverted.

ebuchman · 2020-07-27T18:24:21Z

Do we have rationale for GasWanted / GasUsed or do we have a similar argument for leaving it to the application?

GasWanted would already be accessible and provable directly in the tx (at least eg. in an sdk tx and prob in others like an eth tx has a max gas.

GasUsed can currently be adjusted downwards without breaking the chain, and may in some cases be adjusted upwards. Not clear if that's a useful degree of freedom, but in any case we'd lose it by hashing.

I guess the more general question here is how coupled the ABCI interface ought to be with the light client provability ...

docs: event hashing ADR

619534c

Closes #5113

melekes self-assigned this Jul 17, 2020

melekes changed the title ~~docs: event hashing ADR~~ docs: event hashing ADR 058 Jul 17, 2020

melekes added 2 commits July 17, 2020 15:09

fix syntax

9edd549

remove GasWanted/Used

707df8e

tessr requested review from tac0turtle and alexanderbez July 17, 2020 12:24

cwgoes reviewed Jul 17, 2020

View reviewed changes

alexanderbez reviewed Jul 17, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

melekes added 3 commits July 20, 2020 09:58

fixes after Bez's review

9721bb9

add an example

2167555

add Alternative proposals section

ee2c5a9

melekes marked this pull request as ready for review July 20, 2020 06:48

melekes requested a review from tessr as a code owner July 20, 2020 06:48

add References

de689d5

tac0turtle approved these changes Jul 20, 2020

View reviewed changes

alexanderbez approved these changes Jul 20, 2020

View reviewed changes

Merge branch 'master' into anton/5113-events-hashing

590717b

erikgrinaker approved these changes Jul 21, 2020

View reviewed changes

note on events hashing and order

c2d3399

liamsi reviewed Jul 24, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

liamsi reviewed Jul 24, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

liamsi reviewed Jul 24, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

liamsi reviewed Jul 24, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Show resolved Hide resolved

ebuchman reviewed Jul 24, 2020

View reviewed changes

docs/architecture/adr-058-event-hashing.md Outdated Show resolved Hide resolved

ValarDragon reviewed Jul 25, 2020

View reviewed changes

melekes added 3 commits July 27, 2020 11:26

fixes after Ismail and Ethan's comments

2f4c062

mark as declined

a3e331c

Merge branch 'master' into anton/5113-events-hashing

ad7f402

tomtau reviewed Jul 27, 2020

View reviewed changes

melekes merged commit fb4e00f into master Jul 27, 2020

melekes deleted the anton/5113-events-hashing branch July 27, 2020 08:40

tomtau mentioned this pull request Feb 16, 2022

Problem: don't support debug query result (backport: #328) crypto-org-chain/cronos#342

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: event hashing ADR 058 #5134

docs: event hashing ADR 058 #5134

melekes commented Jul 17, 2020 •

edited

Loading

auto-comment bot commented Jul 17, 2020

cwgoes left a comment

alexanderbez left a comment

tac0turtle left a comment

erikgrinaker commented Jul 21, 2020

melekes commented Jul 22, 2020

ebuchman commented Jul 24, 2020 •

edited

Loading

melekes commented Jul 24, 2020

ValarDragon commented Jul 24, 2020 •

edited

Loading

ethanfrey commented Jul 24, 2020 •

edited

Loading

ebuchman commented Jul 24, 2020

ValarDragon commented Jul 24, 2020 •

edited

Loading

ebuchman commented Jul 25, 2020

ValarDragon commented Jul 25, 2020 •

edited

Loading

ValarDragon Jul 25, 2020

melekes Jul 27, 2020

ValarDragon Jul 25, 2020

melekes Jul 27, 2020

mappum commented Jul 25, 2020

tomtau Jul 27, 2020

melekes Jul 27, 2020

melekes commented Jul 27, 2020

ebuchman commented Jul 27, 2020


		### How events are hashed

		Since we do not expect `BeginBlock` and `EndBlock` to contain many events, these


		## Appendix A. Alternative proposals

		The other proposal was to add `Hash bool` flag to the `Event`, similarly to

docs: event hashing ADR 058 #5134

docs: event hashing ADR 058 #5134

Conversation

melekes commented Jul 17, 2020 • edited Loading

auto-comment bot commented Jul 17, 2020

cwgoes left a comment

Choose a reason for hiding this comment

alexanderbez left a comment

Choose a reason for hiding this comment

tac0turtle left a comment

Choose a reason for hiding this comment

erikgrinaker commented Jul 21, 2020

melekes commented Jul 22, 2020

ebuchman commented Jul 24, 2020 • edited Loading

melekes commented Jul 24, 2020

ValarDragon commented Jul 24, 2020 • edited Loading

ethanfrey commented Jul 24, 2020 • edited Loading

ebuchman commented Jul 24, 2020

ValarDragon commented Jul 24, 2020 • edited Loading

ebuchman commented Jul 25, 2020

ValarDragon commented Jul 25, 2020 • edited Loading

ValarDragon Jul 25, 2020

Choose a reason for hiding this comment

melekes Jul 27, 2020

Choose a reason for hiding this comment

ValarDragon Jul 25, 2020

Choose a reason for hiding this comment

melekes Jul 27, 2020

Choose a reason for hiding this comment

mappum commented Jul 25, 2020

tomtau Jul 27, 2020

Choose a reason for hiding this comment

melekes Jul 27, 2020

Choose a reason for hiding this comment

melekes commented Jul 27, 2020

ebuchman commented Jul 27, 2020

melekes commented Jul 17, 2020 •

edited

Loading

ebuchman commented Jul 24, 2020 •

edited

Loading

ValarDragon commented Jul 24, 2020 •

edited

Loading

ethanfrey commented Jul 24, 2020 •

edited

Loading

ValarDragon commented Jul 24, 2020 •

edited

Loading

ValarDragon commented Jul 25, 2020 •

edited

Loading