File hashing 4279 v4 #5929

rtkjweeks · 2021-02-25T21:35:15Z

Make sure these boxes are signed before submitting your Pull Request -- thank you.

I have read the contributing guide lines at https://redmine.openinfosecfoundation.org/projects/suricata/wiki/Contributing
I have signed the Open Information Security Foundation contribution agreement at https://suricata-ids.org/about/contribution-agreement/
I have updated the user guide (in doc/userguide/) to reflect the changes made (if applicable)

Link to redmine ticket:
https://redmine.openinfosecfoundation.org/issues/4279

Previous PR:
#5882

Describe changes:

suricata-verify-pr: 430
#suricata-verify-repo:
#suricata-verify-branch:
#suricata-update-pr:
#suricata-update-repo:
#suricata-update-branch:
#libhtp-pr:
#libhtp-repo:
#libhtp-branch:

Files are truncated when they exceed stream reassembly depth. However, for some purposes, a hash of the prefix of the file is still useful, as long as we know the size of this prefix. We therefore change the logging logic to output the hash in the case of file truncation.

Allow capping number of bytes of files used for file hash. This is to allow for consistency and avoid relying on whatever the stream reassembly limit happens to be. If 0 or not present, the original behaviour of hashing up to the stream reassembly limit remains. If the value is greater than the stream reassembly limit, the stream reassembly limit takes precedence.

codecov · 2021-02-26T18:45:26Z

Codecov Report

Merging #5929 (4edf0dc) into master (0ac5c53) will decrease coverage by 0.00%.
The diff coverage is 77.36%.

@@            Coverage Diff             @@
##           master    #5929      +/-   ##
==========================================
- Coverage   76.69%   76.68%   -0.01%     
==========================================
  Files         604      604              
  Lines      187657   187837     +180     
==========================================
+ Hits       143926   144046     +120     
- Misses      43731    43791      +60

Flag	Coverage Δ
fuzzcorpus	`52.54% <34.48%> (-0.02%)`	⬇️
suricata-verify	`49.89% <84.80%> (+0.05%)`	⬆️
unittests	`63.07% <64.37%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

jasonish · 2021-09-03T17:53:31Z

rust/src/ffi/hashing.rs

+    if !bytes_hashed.is_null() {
+        if g_hash_byte_limit > 0 {
+            let bytes_remaining = match g_hash_byte_limit > *bytes_hashed {
+                true => g_hash_byte_limit - *bytes_hashed,
+                false => 0,
+            };
+            if len > bytes_remaining {
+                len = bytes_remaining;
+            }
+        }


I don't like the length logic being put into a low level hashing function which is not file-store specific. I think this accounting should be moved out of here.

jasonish

In general I'm in favour of this approach, I just don't want specifics of filestore to leak into the lower level hashing API.

jasonish · 2021-12-21T20:22:36Z

Ping @rtkjweeks, any plans to continue on this?

If urilen induced depth was set, later DetectContentPropagateLimits() would apply a wrong depth setting, leading to a false negative in some cases. Bug: OISF#5929.

If urilen induced depth was set, later DetectContentPropagateLimits() would apply a wrong depth setting, leading to a false negative in some cases. Bug: OISF#5929. (cherry picked from commit ba7db25)

rtkrruvinskiy and others added 3 commits February 25, 2021 16:19

files: update the function documentation

68db5de

rtkjweeks mentioned this pull request Feb 25, 2021

File hashing 4279 v3 #5882

Closed

3 tasks

rtkjweeks force-pushed the file-hashing-4279-v4 branch 2 times, most recently from 5ef4d2e to eba5b5c Compare February 26, 2021 18:27

docs: documentation for hash-byte-limit option

5142b09

rtkjweeks force-pushed the file-hashing-4279-v4 branch from eba5b5c to 5142b09 Compare February 26, 2021 18:59

jasonish reviewed Sep 3, 2021

View reviewed changes

jasonish requested changes Sep 3, 2021

View reviewed changes

victorjulien closed this Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

File hashing 4279 v4 #5929

File hashing 4279 v4 #5929

rtkjweeks commented Feb 25, 2021 •

edited

codecov bot commented Feb 26, 2021 •

edited

jasonish Sep 3, 2021

jasonish left a comment

jasonish commented Dec 21, 2021

File hashing 4279 v4 #5929

File hashing 4279 v4 #5929

Conversation

rtkjweeks commented Feb 25, 2021 • edited

Describe changes:

codecov bot commented Feb 26, 2021 • edited

Codecov Report

jasonish Sep 3, 2021

Choose a reason for hiding this comment

jasonish left a comment

Choose a reason for hiding this comment

jasonish commented Dec 21, 2021

rtkjweeks commented Feb 25, 2021 •

edited

codecov bot commented Feb 26, 2021 •

edited