Next 20160520 v6 #2091

inliniac · 2016-05-20T12:21:35Z

Contains:

AF_PACKET v3 by @regit Tpacket v3 11 #2052, with an additional fix for CentOS6 (f947539)
Flow Worker flow worker v39 #2089
Reassembly Buffer for HTTP and files. Updated version of [preview] reassembly buffer v101 #2008
Delayed-detect fix by @jviiret SPM API: handle NULL pointers gracefully (Bug #1786) #2088

Prscript:

PR inliniac-pcap: https://buildbot.openinfosecfoundation.org/builders/inliniac-pcap/builds/429
PR inliniac: https://buildbot.openinfosecfoundation.org/builders/inliniac/builds/434

This is a preparation for flow locking updates.

Update Flow lookup functions to get a flow reference during lookup. This reference is set under the FlowBucket lock. This paves the way to not getting a flow lock during lookups.

Instead of handling the packet update during flow lookup, handle it in the stream/detect threads. This lowers the load of the capture thread(s) in autofp mode. The decoders now set a flag in the packet if the packet needs a flow lookup. Then the workers will take care of this. The decoders also already calculate the raw flow hash value. This is so that this value can be used in flow balancing in autofp. Because the flow lookup/creation is now done in the worker threads, the flow balancing can no longer use the flow. It's not yet available. Autofp load balancing uses raw hash values instead. In the same line, move UDP AppLayer out of the DecodeUDP module, and also into the stream/detect threads. Handle TCP session reuse inside the flow engine itself. If a looked up flow matches the packet, but is a TCP stream starter, check if the ssn needs to be reused. If that is the case handle it within the lookup function. Simplies the locking and removes potential race conditions.

When we run on live traffic, time handling is simple. Packets have a timestamp set by the capture method. Management threads can simply use 'gettimeofday' to know the current time. There should never be any serious gap between the two or major differnces between the threads. In offline mode, things are dramatically different. Here we try to keep the time from the pcap, which means that if the packets are recorded in 2011 the log output should also reflect this. Multiple issues: 1. merged pcaps might have huge time jumps or time going backward 2. slowly recorded pcaps may be processed much faster than their 'realtime' 3. management threads need a concept of what the 'current' time is for enforcing timeouts 4. due to (1) individual threads may have very different views on what the current time is. E.g. T1 processed packet 1 with TS X, while T2 at the very same time processes packet 2 with TS X+100000s. The changes in flow handling make the problems worse. The capture thread no longer handles the flow lookup, while it did set the global 'time'. This meant that a thread may be working on Packet 1 with TS 1, while the capture thread already saw packet 2 with TS 10000. Management threads would take TS 10000 as the 'current time', considering a flow created by the first thread as timed out immediately. This was less of a problem before the flow changes as the capture thread would also create a flow reference for a packet, meaning the flow couldn't time out as easily. Packets in the queues between capture thread and workers would all hold such references. The patch updates the time handling to be as follows. In offline mode we keep the timestamp per thread. If a management thread needs current time, it will get the minimum of the threads' values. This is to avoid the problem that T2s time value might already trigger a flow timeout as the flow lastts + 100000s is almost certainly meaning the flow would be considered timed out.

To simplify locking, move all locking out of the individual detect code. Instead at the start of detection lock the flow, and at the end of detection unlock it. The lua code can be called without a lock still (from the output code paths), so still pass around a lock hint to take care of this.

Initial version of the 'FlowWorker' thread module. This module combines Flow handling, TCP handling, App layer handling and Detection in a single module. It does all flow related processing under a single flow lock.

Now that the flow lookup is done in the worker threads the flow queue handlers running after the capture thread(s) no longer have access to the flow. This limits the options of how flow balancing can be done. This patch removes all code that is now useless. The only 2 methods that still make sense are 'hash' and 'ippair'.

Add a new API to store data from streaming sources, like HTTP body processing or TCP data. Currently most of the code uses a pattern of list of data chunks (e.g. TcpSegment) that is reassembled into a large buffer on-demand. The Streaming Buffer API changes the logic to store the data in reassembled form from the start, with the segments/chunks pointing to the reassembled data. The main buffer storing the data slides forward, automatically or manually. The *NoTrack calls allows for a segmentless mode of operation. This approach has two main advantages: 1. accessing the reassembled data is virtually cost-free 2. reduction of allocations and memory management

Convert HTTP body handling to use the Streaming Buffer API. This means the HtpBodyChunks no longer maintain their own data segments, but instead add their data to the StreamingBuffer instance in the HtpBody structure. In case the HtpBodyChunk needs to access it's data it can do so still through the Streaming Buffer API. Updates & simplifies the various users of the reassembled bodies: multipart parsing and the detection engine.

The HTPCfgDir structure is meant to contain config for per direction body parsing parameters. This patch stores the streaming API config.

Enforce inspect window also in IDS mode. Try always to get at least 'inspect win' worth of data. In case there is more new data, take some of the old data as well to make sure there is always some overlap. This unifies IDS and IPS modes, the only difference left is the start of inspection. IDS waits until min_size is available, IPS starts right away.

Make the file storage use the streaming buffer API. As the individual file chunks were not needed by themselves, this approach uses a chunkless implementation.

This will handle minimal DetectEngineCtx structures (used in delayed detect mode) safely, since they don't get SPM global contexts allocated. Also added BUG_ON checks for valid spm_table entries.

No need for cooked header in the case of mmap capture.

This patch adds a basic implementation of AF_PACKET tpacket v3. It is basic in the way it is only working for 'workers' runnning mode. If not in 'workers' mode there is a fallback to tpacket_v2. Feature is activated via tpacket-v3 option in the af-packet section of Suricata YAML.

Reorder fields in AFPThreadVars and suppress some that were not used elsewhere than in the initialization.

Suppress useless fields in AFPThreadVars. This patch also get rid of bytes counter as it was only used to display a message at exit. Information on livedev and on packet counters are enough.

Error handling was not done. The implementation is making the choice to consider we must detroy the socket in case of parsing error. The same was done for tpacket_v2.

It is used to set the block size in tpacket_v3. It will allow user to tune the capture depending on his bandwidth. Default block size value has been updated to a bigger value to allow more efficient wlak on block.

Block timeout defines the maximum filling duration of a block.

If TPACKET_V3 is not defined then it is not available and we should not build anything related to tpacket_v3. This will allow us to activate it dy default and fallback to v2 if not available.

Update the code to use mmap capture by default even in unset in configuration file. mmap capture is now be turned off by using explicitely 'use-mmap: no' in configuration.

We can loose packets during setup because we are reading nothing. So it is logical to discard the counter at start of capture to start from a clean state. This means we don't need to account the drop at start. But the stats call that will reset the drop counts will also return and reset the packets count. So we need to know how many packets we really have. This is in fact the number of packets coming from the stats call minus the number of discarded packets and the drop count. All the other packets will have to be read.

Only parse them if mmap is activated.

As we only use the second we don't need GetTime() which is slower and get us milliseconds.

victorjulien and others added 30 commits May 20, 2016 08:56

detect: split detect entry into flow/noflow

a81766c

This is a preparation for flow locking updates.

flow: get flow reference during lookup

ae7aae8

Update Flow lookup functions to get a flow reference during lookup. This reference is set under the FlowBucket lock. This paves the way to not getting a flow lock during lookups.

flowworker: initial support

52d500c

Initial version of the 'FlowWorker' thread module. This module combines Flow handling, TCP handling, App layer handling and Detection in a single module. It does all flow related processing under a single flow lock.

flow: remove dead code

61ce05e

http: add per direction config for body parsing

6fb808f

The HTPCfgDir structure is meant to contain config for per direction body parsing parameters. This patch stores the streaming API config.

http: move body settings into per dir struct

24a2f51

http: make htpstate cfg ptr const

feafc83

file: switch to streaming buffer API

e43ce0a

Make the file storage use the streaming buffer API. As the individual file chunks were not needed by themselves, this approach uses a chunkless implementation.

spm: handle null ptrs in destroy funcs gracefully

f77bc51

This will handle minimal DetectEngineCtx structures (used in delayed detect mode) safely, since they don't get SPM global contexts allocated. Also added BUG_ON checks for valid spm_table entries.

af-packet: avoid test for each packet

5f40078

af-packet: micro optimization

27adbfa

af-packet: remove useless code

d094039

No need for cooked header in the case of mmap capture.

af-packet: cleaning and hole hunting

9500d12

Reorder fields in AFPThreadVars and suppress some that were not used elsewhere than in the initialization.

af-packet: continuing cleaning and hole hunting

b797fd9

Suppress useless fields in AFPThreadVars. This patch also get rid of bytes counter as it was only used to display a message at exit. Information on livedev and on packet counters are enough.

af-packet: AFPWalkBlock error handling

5f84b55

Error handling was not done. The implementation is making the choice to consider we must detroy the socket in case of parsing error. The same was done for tpacket_v2.

af-packet: pack AFPPeer structure

7fa9637

af-packet: put ring setup in a separate function

c7bde9d

af-packet: configurable tpacket_v3 block size

fa902ab

It is used to set the block size in tpacket_v3. It will allow user to tune the capture depending on his bandwidth. Default block size value has been updated to a bigger value to allow more efficient wlak on block.

af-packet: configurable tpacket_v3 block timeout

234aefd

Block timeout defines the maximum filling duration of a block.

af-packet: add option to use memory locked mmap

f5c2019

af-packet: detect availability of tpacket_v3

c2d0d93

If TPACKET_V3 is not defined then it is not available and we should not build anything related to tpacket_v3. This will allow us to activate it dy default and fallback to v2 if not available.

af-packet: use mmap capture by default

876b356

Update the code to use mmap capture by default even in unset in configuration file. mmap capture is now be turned off by using explicitely 'use-mmap: no' in configuration.

regit and others added 7 commits May 20, 2016 12:32

af-packet: make mmap options parsing conditional

8035d83

Only parse them if mmap is activated.

af-packet: ask for hardware timestamp

a40f08a

af-packet: fix some typos in yaml

ff05fb7

af-packet: print errno on mmap error

88f5d7d

af-packet: use time() instead of GetTime()

4961212

As we only use the second we don't need GetTime() which is slower and get us milliseconds.

af-packet: CentOS6 build fixes

f947539

inliniac merged commit f947539 into master May 20, 2016

This was referenced May 20, 2016

[preview] reassembly buffer v101 #2008

Closed

Tpacket v3 11 #2052

Closed

SPM API: handle NULL pointers gracefully (Bug #1786) #2088

Closed

Next 20160520 v11 #2092

Merged

inliniac deleted the next-20160520-v6 branch May 30, 2016 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Next 20160520 v6 #2091

Next 20160520 v6 #2091

inliniac commented May 20, 2016

Next 20160520 v6 #2091

Next 20160520 v6 #2091

Conversation

inliniac commented May 20, 2016