Skip to content

Conversation

@AgeManning
Copy link
Member

Issue Addressed

Some nodes not following head, high CPU usage and HTTP API delays

Proposed Changes

Patches gossipsub. Gossipsub was using an lru_time_cache to check for duplicates. This contained an O(N) lookup for every gossipsub message to update the time cache. This was causing high cpu usage and blocking network threads.

This PR introduces a custom cache without O(N) inserts.

This also adds built in safety mechanisms to prevent gossipsub from excessively retrying connections upon failure. A maximum limit is set after which we disconnect from the node from too many failed substream connections.

@AgeManning
Copy link
Member Author

bors r+

bors bot pushed a commit that referenced this pull request Aug 8, 2020
## Issue Addressed

Some nodes not following head, high CPU usage and HTTP API delays

## Proposed Changes

Patches gossipsub. Gossipsub was using an `lru_time_cache` to check for duplicates. This contained an `O(N)` lookup for every gossipsub message to update the time cache. This was causing high cpu usage and blocking network threads. 

This PR introduces a custom cache without `O(N)` inserts. 

This also adds built in safety mechanisms to prevent gossipsub from excessively retrying connections upon failure. A maximum limit is set after which we disconnect from the node from too many failed substream connections.
@bors
Copy link

bors bot commented Aug 8, 2020

@bors bors bot changed the title Patch gossipsub [Merged by Bors] - Patch gossipsub Aug 8, 2020
@bors bors bot closed this Aug 8, 2020
@AgeManning AgeManning deleted the network-fix branch August 18, 2020 12:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants