Add CI/CD for Aarch64 pip wheels using GitHub Actions #56025

mseth10 · 2022-05-09T12:09:40Z

Added GitHub workflow to build, test and publish Aarch64 pip wheels for Python versions 3.7, 3.8, 3.9 and 3.10
Workflow jobs run on a self-hosted runner on an AWS Graviton instance and triggers for:

Pull requests against r2.9 branch
New commits to r2.9 branch
New tags starting with v2.

Following tests currently fail and are skipped:

tensorflow/python:nn_grad_test_cpu
tensorflow/python/eager:forwardprop_test_cpu
tensorflow/python/framework:node_file_writer_test_cpu
tensorflow/python/grappler:memory_optimizer_test
tensorflow/python/keras/engine:training_arrays_test
tensorflow/python/kernel_tests/linalg:linear_operator_householder_test_cpu
tensorflow/python/kernel_tests/linalg:linear_operator_inversion_test_cpu
tensorflow/python/kernel_tests/linalg:linear_operator_block_diag_test_cpu
tensorflow/python/kernel_tests/linalg:linear_operator_block_lower_triangular_test_cpu
tensorflow/python/kernel_tests/linalg:linear_operator_kronecker_test_cpu
tensorflow/python/kernel_tests/math_ops:batch_matmul_op_test_cpu
tensorflow/python/kernel_tests/nn_ops:conv_ops_test_cpu
tensorflow/python/kernel_tests/nn_ops:conv2d_backprop_filter_grad_test_cpu
tensorflow/python/kernel_tests/nn_ops:conv3d_backprop_filter_v2_grad_test_cpu
tensorflow/python/kernel_tests/nn_ops:atrous_conv2d_test_cpu
tensorflow/python/ops/parallel_for:math_test_cpu

google-cla · 2022-05-09T12:09:44Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

For more information, open the CLA check for this pull request.

mihaimaruseac

Thank you very much

mihaimaruseac · 2022-05-09T15:38:08Z

Can you sign CLA please?

mihaimaruseac · 2022-05-09T16:14:36Z

Also, I think this should be opened on master?

elfringham · 2022-05-09T16:14:59Z

I think the use of no_aarch64 tag to exclude tests that fail when TF_ENABLE_ONEDNN_OPT=1 is not the best thing to do. The flag is currently used to exclude tests that can never work on AARCH64. This would add a group of tests that we hope are only temporarily broken and so would have their no_aarch64 tag removed at some later point. It also stops those tests being run at all on AARCH64 even for the Eigen build, where they currently pass. I think some other mechanism would be better.

elfringham

This is not the right way to temporarily exclude these tests. It hides the exclusion away in multiple places and also prevents these tests from being run for the default Eigen build where they are passing.

mseth10 · 2022-05-11T01:15:15Z

This is not the right way to temporarily exclude these tests. It hides the exclusion away in multiple places and also prevents these tests from being run for the default Eigen build where they are passing.

Hi @elfringham , I agree with your concern. We can introduce a new tag like no_aarch64_onednn_acl and use it to exclude these tests. Do you think that's an acceptable solution?

nSircombe · 2022-05-11T05:53:53Z

Could they not simply be listed explicitly in TF_TEST_TARGETS with the other test exclusions?

elfringham · 2022-05-11T08:53:35Z

tensorflow/tools/ci_build/rel/ubuntu/cpu_arm64_pip.sh

+# Export optional variables for running pip.sh
+export TF_BUILD_FLAGS="--config=mkl_aarch64 ${extra_args}"
+export TF_TEST_FLAGS="--config=mkl_aarch64 ${extra_args} --test_env=TF_ENABLE_ONEDNN_OPTS=1 --test_env=TF2_BEHAVIOR=1 --define=no_tensorflow_py_deps=true --test_lang_filters=py --verbose_failures=true --test_keep_going"
+export TF_TEST_TARGETS="${DEFAULT_BAZEL_TARGETS} -//tensorflow/lite/... -//tensorflow/compiler/mlir/lite/tests:const-fold.mlir.test -//tensorflow/compiler/mlir/lite/tests:prepare-tf.mlir.test"


I agree with @nSircombe , this would be the place to put exclusions for tests that fail. Using tags is not the best solution as it hides that information in multiple places which makes it harder to check the status and make updates.

I agree with @nSircombe , this would be the place to put exclusions for tests that fail. Using tags is not the best solution as it hides that information in multiple places which makes it harder to check the status and make updates.

Thanks for the suggestion @nSircombe @elfringham . Addressed.

This reverts commit 1c333fa.

mseth10 · 2022-05-12T04:14:48Z

Hi @mihaimaruseac , I have signed the Google Individual CLA.

Also, I think this should be opened on master?

I have only tested this change on r2.9 branch as of now. I was thinking after the release, we can cherry-pick it into master.
Are you suggesting we do it the other way around - create this PR on master branch and when it gets merged, cherry-pick into r2.9 branch?

mihaimaruseac · 2022-05-12T23:52:14Z

Yes please. We never cherrypick from release branch back into master.

mseth10 · 2022-05-13T10:34:48Z

Yes please. We never cherrypick from release branch back into master.

@mihaimaruseac I have opened another PR on master branch #56097 . Will close this PR and will cherry-pick into r2.9 later.

mseth10 requested a review from rohan100jain as a code owner May 9, 2022 12:09

gbaned assigned mihaimaruseac May 9, 2022

gbaned added this to Assigned Reviewer in PR Queue via automation May 9, 2022

mihaimaruseac previously approved these changes May 9, 2022

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer May 9, 2022

elfringham suggested changes May 10, 2022

View reviewed changes

elfringham reviewed May 11, 2022

View reviewed changes

mseth10 added 6 commits May 11, 2022 20:59

add manylinux2014 aarch64 wheel scripts

d2cfc24

skip failing tests

6d8c523

fix with_the_same_user file for centos os

bdf6b4f

clean repo

d86b8af

Revert "skip failing tests"

99fa341

This reverts commit 1c333fa.

add excluded tests as env var

306974c

mseth10 dismissed mihaimaruseac’s stale review via 306974c May 12, 2022 03:51

mseth10 force-pushed the r2.9 branch from 9eff349 to 306974c Compare May 12, 2022 03:51

PR Queue automation moved this from Approved by Reviewer to Reviewer Requested Changes May 12, 2022

mseth10 added 2 commits May 12, 2022 07:33

sudo clean repo

8222cd5

get wheel from container to local

ae251f3

mseth10 closed this May 13, 2022

PR Queue automation moved this from Reviewer Requested Changes to Closed/Rejected May 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CI/CD for Aarch64 pip wheels using GitHub Actions #56025

Add CI/CD for Aarch64 pip wheels using GitHub Actions #56025

mseth10 commented May 9, 2022 •

edited

google-cla bot commented May 9, 2022

mihaimaruseac left a comment

mihaimaruseac commented May 9, 2022

mihaimaruseac commented May 9, 2022

elfringham commented May 9, 2022

elfringham left a comment

mseth10 commented May 11, 2022

nSircombe commented May 11, 2022

elfringham May 11, 2022

mseth10 May 12, 2022

mseth10 commented May 12, 2022

mihaimaruseac commented May 12, 2022

mseth10 commented May 13, 2022

Add CI/CD for Aarch64 pip wheels using GitHub Actions #56025

Add CI/CD for Aarch64 pip wheels using GitHub Actions #56025

Conversation

mseth10 commented May 9, 2022 • edited

google-cla bot commented May 9, 2022

mihaimaruseac left a comment

Choose a reason for hiding this comment

mihaimaruseac commented May 9, 2022

mihaimaruseac commented May 9, 2022

elfringham commented May 9, 2022

elfringham left a comment

Choose a reason for hiding this comment

mseth10 commented May 11, 2022

nSircombe commented May 11, 2022

elfringham May 11, 2022

Choose a reason for hiding this comment

mseth10 May 12, 2022

Choose a reason for hiding this comment

mseth10 commented May 12, 2022

mihaimaruseac commented May 12, 2022

mseth10 commented May 13, 2022

mseth10 commented May 9, 2022 •

edited