Skip to content

Scan speed improvement#369

Merged
pwgit-create merged 4 commits into
developfrom
scan_faster
May 23, 2026
Merged

Scan speed improvement#369
pwgit-create merged 4 commits into
developfrom
scan_faster

Conversation

@pwgit-create
Copy link
Copy Markdown
Contributor

@pwgit-create pwgit-create commented May 23, 2026

This pull request introduces parallel hash computation to improve performance when scanning large files in the File Integrity Scanner. The main changes include integrating a new ParallelFileHashHandler, updating the hash computation logic to use parallel processing for large files, and ensuring proper initialization and shutdown of parallel resources during scan operations.

Parallel hash computation integration:

  • Added ParallelFileHashHandler to FileHashComputer, with logic to use it for large files and on out-of-memory errors, replacing previous single-threaded handling. [1] [2] [3]
  • Implemented initialization (initializeParallelHashing) and shutdown (shutdownParallelHashProcessor) methods for parallel hash resources in FileHashComputer.

Scan workflow updates:

  • Updated ScanServiceImpl to initialize parallel hashing before scans and shut down parallel hash resources after scans complete. [1] [2] [3]

Dependency and configuration updates:

  • Bumped algorithm-hash-extraction dependency version to 1.2.9 and updated project version to 1.9 in pom.xml. [1] [2]
  • Added a logger configuration for lib.pwss.hash.file_hash_handler.parallel in logback.xml to control logging from the new parallel handler.

pwgit-create and others added 3 commits May 9, 2026 00:52
Patch update for dependency comment fix
…h computation

* Upgraded the dependency algorithm-hash-extraction from version 1.2.8 to 1.2.9
* Added ParallelFileHashHandler for parallel hash computation of large files
* Updated FileHashComputer to use parallel processing when computing hashes
* Modified ScanServiceImpl to initialize and shutdown parallel hash processors
* Added debugging log level for lib.pwss.hash.file_hash_handler.parallel package

This change improves performance by utilizing parallel processing for file hash computations, especially for
larger files.
* Updated project version from 1.8.5 to 1.9 in pom.xml
* Changed log level for lib.pwss.hash.file_hash_handler.parallel package from DEBUG to ERROR in logback.xml

This change prepares the project for a new release with improved logging configuration.
@pwgit-create pwgit-create marked this pull request as ready for review May 23, 2026 11:05
@pwgit-create pwgit-create requested a review from lilstiffy May 23, 2026 11:05
* Removed unused `import lib.pwss.hash.ParallelFileHash;` in FileHashComputer.java

This cleanup removes an unnecessary import to keep the codebase tidy and improve maintainability.
Copy link
Copy Markdown
Collaborator

@lilstiffy lilstiffy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No blocking issues found, looks really solid to my eyes 👀
The >2x performance improvement makes this a super solid update.

Image

@pwgit-create pwgit-create merged commit 125f715 into develop May 23, 2026
2 checks passed
@pwgit-create pwgit-create deleted the scan_faster branch May 23, 2026 19:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants