fix: file fingerprinting is now ~33% faster#37802
Conversation
|
PRs without a linked issue will receive lower priority for review and merging. Please update the description to follow the PR template and include a line like |
During some recent investigations package fingerprinting was found to do a bunch of duplicate work (mostly `stat`ting the same files over and over again). That and a number of minor tweaks bring the fingerprinting time of one specific directory down from ~19s to ~13s (about ~30% improvement), while maintaining byte-for-byte compatibility with the previous implementation. In a future change, we will investigate changes that are allowed to change the hash to improve the performance even more.
62836ba to
783cb62
Compare
✅ Updated pull request passes all PRLinter validations. Dismissing previous PRLinter review.
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
Merge Queue Status
This pull request spent 56 minutes 7 seconds in the queue, including 30 minutes 31 seconds running CI. Required conditions to merge
|
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
|
Comments on closed issues and PRs are hard for our team to see. |
During some recent investigations package fingerprinting was found to do a bunch of duplicate work (mostly
statting the same files over and over again).That and a number of minor tweaks bring the fingerprinting time of one specific directory down from ~19s to ~13s (about ~30% improvement), while maintaining byte-for-byte compatibility with the previous implementation.
In a future change, we will investigate changes that are allowed to change the hash to improve the performance even more.
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license