-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[aarch64] set the build flag to "-mcpu=generic" #1731
Conversation
The PR setting this flag originally is quite old now - the original intent was to make sure the build picked the right I suppose it make sense to be consistent with the other builds which (I believe) set a minimum (e.g. setting However, I'm not sure setting Looking at the PyTorch issue, is it necessary to patch the Cmake during the build? I thought passing Cmake
|
74f8878
to
220c3c7
Compare
Thanks everyone for the review. I have updated the PR to use "-mcpu=generic" and added a comment on how to override this. Please review and merge if it looks good. Thank you! |
Thanks for updating @snadampal. Nit-picking, but the comment does slightly muddle
|
This is equivalent to setting -mtune=generic -march=armv8-a. This ensures no implementation specific tuning, or architectural features beyond armv8-a are used,for portability across AArch64 systems.The DNNL_ARCH_OPT_FLAGS build option can be used to override these defaults to optimise for a specific cpu, or revision of the Armv8 architecture.
220c3c7
to
d129496
Compare
thanks @nSircombe , I have updated the comments. |
Looks good to me! |
Thank you for the contribution! |
Description
this change is required to support binary compatibility across all armv8 platforms.
Fixes # (github issue)
This fixes issue on PyTorch repo:
pytorch/pytorch#109312
Checklist
General
make test
andmake test_benchdnn_*
) pass locally for each commit?Performance improvements
New features
Bug fixes
RFC PR