-
Notifications
You must be signed in to change notification settings - Fork 5.8k
8302783: Improve CRC32C intrinsic with crypto pmull on AArch64 #12624
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
👋 Welcome back yftsai! A progress list of the required criteria for merging this PR into |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change looks good.
Thanks,
Volker
@yftsai This change now passes all automated pre-integration checks. ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details. After integration, the commit message for the final commit will be:
You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed. At the time when this comment was updated there had been 181 new commits pushed to the
As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details. As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@simonis, @phohensee) but any other Committer may sponsor as well. ➡️ To flag this PR as ready for integration with the above commit message, type |
/integrate |
The linux-x86 pre-submit test failure is caused by a test using -XX:+UseCompressedClassPointers, which is an invalid switch for 32-bit JVMs. The linux-cross-compile pre-submit test failure is a compile-time failure in src/hotspot/cpu/arm/interpreterRT_arm.cpp, which latter is not touched by this patch. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lgtm.
/sponsor |
Going to push as commit f3abc40.
Your commit was automatically rebased without conflicts. |
@phohensee @yftsai Pushed as commit f3abc40. 💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored. |
This change adds a pmull-based CRC32C intrinsic, and it is more performant than the existing crc32c-instruction-based intrinsic on Neoverse V1. The benchmark shows 10 - 99% improvement. The improvement comes from the execution throughput increase of pmull/pmull2 from 1 on Neoverse N1 to 4 on Neoverse V1 while the latency remains 2 while the throughput of CRC32C instructions did not changed.
The pmull-based CRC32C intrinsic is enabled by the existing option UseCryptoPmullForCRC32 which also enables the pmull-based CRC32 intrinsic. The option requires crc32c instructions, eor3 in SHA3, and 64-bit pmull/pmull2 in Cryptographic Extension.
With this change, there will be only two different CRC32C intrinsics, crc32c and pmull, while there are four CRC32 intrinsics.
The following test has passed.
test/hotspot/jtreg/compiler/intrinsics/zip/TestCRC32C.java
The throughput reported by the micro benchmark is measured on an EC2 c7g instance. The optimization shows 10 - 99% improvement when the input is at least 384 bytes.
Baseline
Crypto pmull
Progress
Issue
Reviewers
Reviewing
Using
git
Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk pull/12624/head:pull/12624
$ git checkout pull/12624
Update a local copy of the PR:
$ git checkout pull/12624
$ git pull https://git.openjdk.org/jdk pull/12624/head
Using Skara CLI tools
Checkout this PR locally:
$ git pr checkout 12624
View PR using the GUI difftool:
$ git pr show -t 12624
Using diff file
Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/12624.diff