[11.x] Replace MD5 with xxh128 in File::hasSameHash()#54690
Merged
Conversation
Drop-in replacement. Much faster than MD5, and avoids its security issues.
Member
|
ping @GrahamCampbell |
Contributor
Author
|
For the case at hand, I can't think of any regression. The hashes don't go outside the function. |
|
Today I've learned this method actually exists. 👍 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
In
File::hasSameHash(), replace MD5 with xxh128, which has been available since PHP 8.1.xxh128 has the same 128-bit length, but is much faster and avoids the security issues associated with MD5. See also this article.
Quick benchmark for comparing 100 MB files:
Benchmark code
Results:
(The code using
file_get_contents()is only for reference. Large files would exhaust memory, andhash_equals()(#49721) is designed for short, same-length parameters.)Using xxh128 virtually removes the hashing overhead. 🚀
xxh128 is already used in some places in Laravel, for instance see #45371 and discussion #46074.