evidence: ignore when a peer sends committed evidence #5574

cmwaters · 2020-10-26T18:50:17Z

Description

What I am observing seems to only be occurring with full nodes who are persistent peers with a validator. If the validator witnesses duplicate votes it adds it to the evidence pool and broadcasts it. When the evidence is committed the evidence pool deletes the evidence both in the store and in the clist, However a couple of heights later the validator broadcasts the evidence that should no longer exist to the full node that it is persistent peers with. This seems to only occur once.

I am not familiar enough with the concurrent lists to be able to decipher what exactly is going on (i.e. it might be that the element is being incorrectly detached) so I have stopped returning an error when a peer sends committed evidence. This seems sensible in any case because nodes notion of whether evidence is committed or not is updated at different times. This will also stop the node from disconnecting with the peer.

UPDATE: I believe I have identified the problem here

Closes: #5560

codecov · 2020-10-26T18:53:47Z

Codecov Report

Merging #5574 into master will decrease coverage by 0.07%.
The diff coverage is 58.82%.

@@            Coverage Diff             @@
##           master    #5574      +/-   ##
==========================================
- Coverage   61.08%   61.01%   -0.08%     
==========================================
  Files         263      263              
  Lines       23751    23754       +3     
==========================================
- Hits        14508    14493      -15     
- Misses       7761     7784      +23     
+ Partials     1482     1477       -5

Impacted Files	Coverage Δ
evidence/verify.go	`81.37% <ø> (+2.46%)`	⬆️
evidence/reactor.go	`59.59% <55.55%> (+4.59%)`	⬆️
evidence/pool.go	`69.75% <62.50%> (-0.44%)`	⬇️
consensus/reactor.go	`74.08% <0.00%> (-3.28%)`	⬇️
blockchain/v0/pool.go	`76.01% <0.00%> (-2.22%)`	⬇️
blockchain/v2/reactor.go	`33.58% <0.00%> (-1.50%)`	⬇️
statesync/syncer.go	`78.48% <0.00%> (-0.85%)`	⬇️
p2p/pex/pex_reactor.go	`78.27% <0.00%> (-0.60%)`	⬇️
consensus/state.go	`68.21% <0.00%> (-0.19%)`	⬇️
blockchain/v0/reactor.go	`62.56% <0.00%> (+0.98%)`	⬆️
... and 7 more

evidence/pool.go

evidence/reactor.go

erikgrinaker · 2020-10-26T19:27:14Z

Would be nice with unit tests for these cases as well.

cmwaters · 2020-10-27T12:52:31Z

So I believe I have managed to fix the problem. My hypothesis for how it originate stems from basically having all reactors start up at once. This means that a peer who is fast syncing is also running the evidence reactor.

In a situation where an attack occurred and evidence was formed, validators tried to send the evidence to all it's peers. However at the time, a full node was fast syncing and so was behind the height of the attack. Hence the validators were being put in a for loop where every 100 ms they would ping the behind node to see if it had caught up and could receive evidence. In this for loop they were examining whether the evidence had already expired but not if it had been committed .

Therefore by the time the full node was at the height of the attack and could receive the evidence the validators were five or six heights ahead and the evidence already committed hence all these validators were sending committed evidence to the full node which was then trying to gossip the evidence because it also hadn't noticed that it was committed hence a lot of peers stopped connections with one another.

evidence/reactor.go

erikgrinaker

Great reasoning! This makes sense to me, and I think the proposed fix looks fine.

erikgrinaker · 2020-10-27T12:58:47Z

evidence/pool.go

+			if evpool.isCommitted(ev) {
+				return &types.ErrInvalidEvidence{Evidence: ev, Reason: errors.New("evidence was already committed")}
+			}
+
 			evInfo, err := evpool.verify(ev)
 			if err != nil {
 				return &types.ErrInvalidEvidence{Evidence: ev, Reason: err}
 			}

 			if err := evpool.addPendingEvidence(evInfo); err != nil {


It's a bit strange to me that a "check" function would schedule the evidence as well. Shouldn't it just check it without any side-effects?

I know this is unrelated to your changes, just an observation.

Yes you are correct. It doesn't mutate the evidence it just saves it if it hasn't seen it before. This makes it quicker later on when we want to create abci evidence and send it to the application because the evidence pool has already gone and worked out who the validators involved were

…hat is already committed (#5574)

don't return an error when a peer sends committed evidence

22ee3e6

cmwaters added T:bug Type Bug (Confirmed) C:evidence Component: Evidence labels Oct 26, 2020

cmwaters requested review from ebuchman, erikgrinaker, melekes and tessr as code owners October 26, 2020 18:50

erikgrinaker reviewed Oct 26, 2020

View reviewed changes

evidence/pool.go Outdated Show resolved Hide resolved

evidence/pool.go Outdated Show resolved Hide resolved

evidence/reactor.go Outdated Show resolved Hide resolved

return an error on check evidence

8cc615f

tweak reactor and add tests

1a28678

cmwaters requested a review from erikgrinaker October 27, 2020 12:54

erikgrinaker reviewed Oct 27, 2020

View reviewed changes

evidence/reactor.go Outdated Show resolved Hide resolved

erikgrinaker approved these changes Oct 27, 2020

View reviewed changes

cmwaters added 3 commits October 27, 2020 16:42

add comment and adjust broadcast interval

f24b409

update changelog

8a6f4cb

Merge branch 'master' into callum/committed_evidence

5beb80c

cmwaters requested a review from tac0turtle as a code owner October 27, 2020 15:45

cmwaters merged commit 651d8f0 into master Oct 27, 2020

cmwaters deleted the callum/committed_evidence branch October 27, 2020 16:12

cmwaters added a commit that referenced this pull request Oct 27, 2020

evidence: don't send committed evidence and ignore inbound evidence t…

6bb2f99

…hat is already committed (#5574)

cmwaters mentioned this pull request Oct 27, 2020

evidence: backport evidence fixes regarding gossiping #5580

Merged

cmwaters added a commit that referenced this pull request Oct 28, 2020

evidence: don't send committed evidence and ignore inbound evidence t…

5cfe035

…hat is already committed (#5574)

cmwaters mentioned this pull request Feb 22, 2022

docs: remove spec section from v0.34 docs #7940

Merged

This was referenced Apr 22, 2022

feature/GRAPH 306 v0.34.13 dm graphprotocol/tendermint#15

Merged

feature/GRAPH 306 v0.34.14 dm graphprotocol/tendermint#16

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evidence: ignore when a peer sends committed evidence #5574

evidence: ignore when a peer sends committed evidence #5574

cmwaters commented Oct 26, 2020 •

edited

Loading

codecov bot commented Oct 26, 2020 •

edited

Loading

erikgrinaker commented Oct 26, 2020

cmwaters commented Oct 27, 2020

erikgrinaker left a comment

erikgrinaker Oct 27, 2020

cmwaters Oct 27, 2020

evidence: ignore when a peer sends committed evidence #5574

evidence: ignore when a peer sends committed evidence #5574

Conversation

cmwaters commented Oct 26, 2020 • edited Loading

Description

codecov bot commented Oct 26, 2020 • edited Loading

Codecov Report

erikgrinaker commented Oct 26, 2020

cmwaters commented Oct 27, 2020

erikgrinaker left a comment

Choose a reason for hiding this comment

erikgrinaker Oct 27, 2020

Choose a reason for hiding this comment

cmwaters Oct 27, 2020

Choose a reason for hiding this comment

cmwaters commented Oct 26, 2020 •

edited

Loading

codecov bot commented Oct 26, 2020 •

edited

Loading