Optimization/defrag rbtree/v3 #3475

jasonish · 2018-09-14T14:36:23Z

Previous PR:

Optimization/defrag rbtree/v2 #3474

Changes:

Fix performance issue with Linux profile.
Add missing locks around pool return.

PRscript:

PR jasonish-pcap: https://buildbot.openinfosecfoundation.org/builders/jasonish-pcap/builds/319
PR jasonish: https://buildbot.openinfosecfoundation.org/builders/jasonish/builds/672

To improve worst case performance turn the segments list into a rbtree. This greatly improves inserts, lookups and removals if the number of segments gets very large. The tree is sorted by the segment sequence number as its primary key. If 2 segments have the same seq, the payload_len (segment length) is used. Then the larger segment will be places after the smaller segment. Exact matches are not added to the tree.

Now that with the RBTREE we have a properly sorted Segment tree, where with exact SEQ matches the tree is sorted by payload_len smallest to largest, we can avoid walking backwards when checking for overlaps. Our direct RB_PREV either overlaps or not and that is a reliable verdict for the rest of the tree.

Don't try to do a 'fast path' by checking RB_MAX. RB_MAX walks the tree which means it can be quite expensive. This cost would be paid for virtually every data segment. The actual insert that follows would walk the tree again. Instead, simply insert it. There is a slight cost of the unnecessary overlap check, but this is much less than the tree walk in a full tree.

Use this in places where we need to use the outer right edge of our sequence space. This way we can avoid walking the tree to find this, which is a potentially expensive operation.

Convert to rbtree from linked list. These ranges, of which there can be multiple per packet, are fully controlled by an attacked. The attacker could craft a stream of packet in such a way that the list would grow very large. This would make inserts/removals very expensive, as well as the list walk that is done and size calculation and pruning operations. The RBTREE makes inserts/removals much cheaper, at a slight overhead for 'normal' operations and slightly higher per record memory use.

Optimize by keeping count during insert/remove instead of walking the tree per check.

Switch StreamBufferBlocks implementation to use RBTREE instead of a list. This makes inserts/removals and lookups a lot cheaper if the number of data gaps is large. Use separate compare functions for inserts and regular lookups. Inserts care about the offset, while lookups care about the blocks right edge as well.

Change the way fields are ordered to reduce TcpSegment structure with 8 bytes.

Instead of just marking fragments that have been completely overlapped and won't be part of the assembled packet, remove them from the fragment tree when detected.

victorjulien · 2018-09-17T13:40:46Z

Merged into #3479, thanks Jason!

finishes OISF#3475

Add AndX support for SMB1. Finishes OISF#3475. [Updated by Victor Julien to split functions]

victorjulien and others added 15 commits August 31, 2018 21:35

tree: add 2-clause BSD licensed tree.h

4511028

stream/segments: remove RB_MIN/RB_MAX

cbff362

stream/segments: keep track of tree right edge

c69e02f

Use this in places where we need to use the outer right edge of our sequence space. This way we can avoid walking the tree to find this, which is a potentially expensive operation.

stream/sack: optimize SACK size handling

4ef5c13

Optimize by keeping count during insert/remove instead of walking the tree per check.

streaming: keep track of tree 'head'

db10097

streaming/sbb: convert RB_MIN to 'head'

b9bb03e

stream/segments: change packing to reduce size

e9c0975

Change the way fields are ordered to reduce TcpSegment structure with 8 bytes.

defrag: use rb tree to store fragments

c32b152

defrag: remove fragments that have complete overlap

0b43799

Instead of just marking fragments that have been completely overlapped and won't be part of the assembled packet, remove them from the fragment tree when detected.

defrag: break out of loop in linux profile when able to

15e3e49

jasonish requested a review from a team as a code owner September 14, 2018 14:36

jasonish mentioned this pull request Sep 14, 2018

Optimization/defrag rbtree/v2 #3474

Closed

victorjulien mentioned this pull request Sep 17, 2018

Next/20180917/v2 #3479

Merged

victorjulien closed this Sep 17, 2018

jasonish deleted the optimization/defrag-rbtree/v3 branch September 18, 2018 02:43

catenacyber added a commit to catenacyber/suricata that referenced this pull request Jan 15, 2021

smb: andx support

b273068

finishes OISF#3475

victorjulien pushed a commit to victorjulien/suricata that referenced this pull request Feb 23, 2021

smb: andx support

2d14606

Add AndX support for SMB1. Finishes OISF#3475. [Updated by Victor Julien to split functions]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization/defrag rbtree/v3 #3475

Optimization/defrag rbtree/v3 #3475

jasonish commented Sep 14, 2018

victorjulien commented Sep 17, 2018

Optimization/defrag rbtree/v3 #3475

Optimization/defrag rbtree/v3 #3475

Conversation

jasonish commented Sep 14, 2018

victorjulien commented Sep 17, 2018