Improve performance of exec-hash #3752

NDStrahilevitz · 2023-12-11T16:22:43Z

1. Explain what the PR does

c2ebbb2 feat(exechash): optimize hash by reusing buffer
9f9e5a4 feat(ebpf): expand exec-hash output option
eb17fc6 feat(ebpf): optimize computeFileHash
db51e0b fix(ebpf): fix getFileHash()
c469486 fix(bucketscache): GetBucket was not thread safe
041bcff chore(bucketscache): add benchmark

2. Explain how to test it

Added tests to confirm parity with previous hash functionality.
Can be manually tested by using:
tracee -o option:exec-hash={inode|dev-inode|digest-inode}
where you may choose which hash option you want, with varying degrees of accuracy and performance.

Close #3734
Close #3745

pkg/cmd/flags/output_test.go

docs/docs/flags/output.1.md

docs/man/output.1

pkg/cmd/flags/output.go

pkg/cmd/flags/output_test.go

pkg/cmd/flags/tracee_ebpf_output_test.go

pkg/exechash/cache.go

pkg/exechash/key.go

AlonZivony

Small things

docs/docs/flags/output.1.md

pkg/cmd/flags/output.go

pkg/filehash/key.go

rafaeldtinoco

LGTM. It is missing some public methods documentation lines, but other than that (and minors from Alon) I think we're good to merge. Also, nice job in using SIMD optimized SHA package, it has multiple x86 flavors (SSE, ...) and arm as well (I wonder if it supports SVE extensions in arm, would make it super fast).

It demonstrate that the current algorithm of addBucketItem with multiple lock/unlock) is optimal compared to a single lock/unlock. goos: linux goarch: amd64 pkg: github.com/aquasecurity/tracee/pkg/bucketscache cpu: 12th Gen Intel(R) Core(TM) i7-12700H BenchmarkAddBucketItemCurrent-20 464 2586038 ns/op 2118 B/op 5 allocs/op BenchmarkAddBucketItemWithOneLock-20 122 9461730 ns/op 424 B/op 1 allocs/op

GetBucket was not thread safe since it was returning a reference to a bucket instead of its copy.

First call to getFileHash() for a given filename was returning an empty string. Context: aquasecurity#3745

Improve the computeFileHash function, by using minio/sha256-simd. Benchmarking indicates the current implementation is nearly three times faster than the old version (21,311 ns/op vs. 61,857 ns/op) while maintaining similar memory efficiency (33,040 B/op vs. 33,056 B/op) and the same number of memory allocations per operation (5 allocs/op). goos: linux goarch: amd64 pkg: github.com/aquasecurity/tracee/pkg/ebpf cpu: 12th Gen Intel(R) Core(TM) i7-12700H BenchmarkComputeFileHashOld-20 19257 61857 ns/op 33056 B/op 5 allocs/op BenchmarkComputeFileHashCurrent-20 56769 21311 ns/op 33040 B/op 5 allocs/op

The user can now choose between the following options: - none (default) - inode - dev-inode - digest-inode 'inode' option recalculates the file hash if the inode's creation time (ctime) differs, which can occur in different namespaces even for identical pathnames. 'dev-inode' option generally offers better performance compared to the pathname option, as it bypasses the need for recalculation by associating the creation time (ctime) with the device (dev) and inode pair. 'digest-inode' option is the most efficient, as it keys the hash to a pair consisting of the container image digest and inode. This approach, however, necessitates container enrichment. Code related to executable hashing was opportunistically refactored and moved to its own package. Co-authored-by: Geyslan Gregório <geyslan@gmail.com>

Previous implementation of sha256 hash computation used io.Copy. This method would allocate small buffers as needed when copying the file content into the hash digest. The result would be more allocations and more GC calls. Mitigate this by using one large buffer and clearing it per call. Benchmark results: goos: linux goarch: amd64 pkg: github.com/aquasecurity/tracee/pkg/exechash cpu: AMD EPYC 7571 BenchmarkComputeFileHashOld-8 18355 65371 ns/op 33056 B/op 5 allocs/op BenchmarkComputeFileHashCurrent-8 30561 41714 ns/op 272 B/op 4 allocs/op

yanivagman

I didn't do a full review, but left a couple of comments about the doc file

yanivagman · 2023-12-19T17:33:09Z

docs/docs/flags/output.1.md

@@ -48,7 +48,10 @@ Other options:
  - **exec-env**: When tracing execve/execveat, show the environment variables that were used for execution.
  - **relative-time**: Use relative timestamp instead of wall timestamp for events.
  - **exec-hash**: When tracing some file related events, show the file hash (sha256).
-    - Affected events: sched_process_exec, shared_object_loaded
+    - Affected events: *sched_process_exec*, *shared_object_loaded*
+    - **inode** option recalculates the file hash if the inode's creation time (ctime) differs, which can occur in different namespaces even for identical inode. This option is performant, but not recommended and should only be used if container enrichment can't be enabled for digest-inode, and if performance is preffered over correctness.


Ctime is inode change time, not creation time

yanivagman · 2023-12-19T17:36:32Z

docs/docs/flags/output.1.md

-    - Affected events: sched_process_exec, shared_object_loaded
+    - Affected events: *sched_process_exec*, *shared_object_loaded*
+    - **inode** option recalculates the file hash if the inode's creation time (ctime) differs, which can occur in different namespaces even for identical inode. This option is performant, but not recommended and should only be used if container enrichment can't be enabled for digest-inode, and if performance is preffered over correctness.
+    - **dev-inode** option generally offers better performance compared to the **inode** option, as it bypasses the need for recalculation by associating the creation time (ctime) with the device (dev) and inode pair. It's recommended if correctness is preffered over performance without container enrichment.


You say this option offers better performance, while on the end of this sentence it says that this option is more for correctness, and not performance. This is confusing

NDStrahilevitz added kind/feature milestone/v0.20.0 labels Dec 11, 2023

NDStrahilevitz requested review from geyslan, rafaeldtinoco and AlonZivony December 11, 2023 16:22

github-actions bot assigned NDStrahilevitz Dec 11, 2023

github-actions bot added area/ebpf kind/documentation area/testing area/UX area/kubernetes area/build area/flags labels Dec 11, 2023

NDStrahilevitz mentioned this pull request Dec 11, 2023

Improve performance of exec-hash #3746

Closed

NDStrahilevitz commented Dec 11, 2023

View reviewed changes

pkg/cmd/flags/output_test.go Outdated Show resolved Hide resolved

NDStrahilevitz force-pushed the nadav_exec_hash_perf branch from 6dade93 to c2ebbb2 Compare December 12, 2023 12:20

AlonZivony requested changes Dec 14, 2023

View reviewed changes

NDStrahilevitz force-pushed the nadav_exec_hash_perf branch from c2ebbb2 to 3231fef Compare December 18, 2023 14:24

NDStrahilevitz requested a review from AlonZivony December 18, 2023 14:24

AlonZivony requested changes Dec 18, 2023

View reviewed changes

docs/docs/flags/output.1.md Outdated Show resolved Hide resolved

pkg/cmd/flags/output.go Show resolved Hide resolved

rafaeldtinoco reviewed Dec 19, 2023

View reviewed changes

pkg/filehash/key.go Show resolved Hide resolved

rafaeldtinoco reviewed Dec 19, 2023

View reviewed changes

pkg/filehash/key.go Show resolved Hide resolved

rafaeldtinoco reviewed Dec 19, 2023

View reviewed changes

pkg/filehash/key.go Show resolved Hide resolved

rafaeldtinoco self-requested a review December 19, 2023 04:48

rafaeldtinoco approved these changes Dec 19, 2023

View reviewed changes

geyslan and others added 5 commits December 19, 2023 11:24

fix(bucketscache): GetBucket was not thread safe

06ff40a

GetBucket was not thread safe since it was returning a reference to a bucket instead of its copy.

fix(ebpf): fix getFileHash()

5bb47c4

First call to getFileHash() for a given filename was returning an empty string. Context: aquasecurity#3745

NDStrahilevitz force-pushed the nadav_exec_hash_perf branch from 3231fef to 9b56ded Compare December 19, 2023 11:36

NDStrahilevitz requested a review from AlonZivony December 19, 2023 13:56

NDStrahilevitz merged commit db152b1 into aquasecurity:main Dec 19, 2023
31 checks passed

yanivagman reviewed Dec 19, 2023

View reviewed changes

AlonZivony mentioned this pull request Jan 22, 2024

No default exec-hash option #3817

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of exec-hash #3752

Improve performance of exec-hash #3752

NDStrahilevitz commented Dec 11, 2023 •

edited

Loading

AlonZivony left a comment

rafaeldtinoco left a comment

yanivagman left a comment

yanivagman Dec 19, 2023

yanivagman Dec 19, 2023

Improve performance of exec-hash #3752

Improve performance of exec-hash #3752

Conversation

NDStrahilevitz commented Dec 11, 2023 • edited Loading

1. Explain what the PR does

2. Explain how to test it

AlonZivony left a comment

Choose a reason for hiding this comment

rafaeldtinoco left a comment

Choose a reason for hiding this comment

yanivagman left a comment

Choose a reason for hiding this comment

yanivagman Dec 19, 2023

Choose a reason for hiding this comment

yanivagman Dec 19, 2023

Choose a reason for hiding this comment

NDStrahilevitz commented Dec 11, 2023 •

edited

Loading