Skip to content

Conversation

@coeuvre
Copy link
Contributor

@coeuvre coeuvre commented Jul 14, 2020

Bazel is migrating its RBE worker pool from n1 machines to e2. We created a new dedicated worker pool with n1 machines to support nested virtualization. This PR updates BRE config to target that dedicated worker pool.

@coeuvre coeuvre marked this pull request as draft July 14, 2020 02:28
@coeuvre coeuvre force-pushed the rbe-migration branch 3 times, most recently from e19a9c8 to 52f0697 Compare July 14, 2020 03:08
@coeuvre
Copy link
Contributor Author

coeuvre commented Jul 14, 2020

Test failed on this line: OSError: [Errno 2] No such file or directory.

Would you please help confirm whether this error is related with this PR?

@coeuvre coeuvre marked this pull request as ready for review July 14, 2020 03:34
@philwo
Copy link
Contributor

philwo commented Jul 16, 2020

My guess is that the error happens, because the command no longer runs inside the correct container, thus the tool it is trying to run is missing. But I'm not sure yet, why this is the case... looking into it.

@philwo
Copy link
Contributor

philwo commented Jul 16, 2020

I tried in a separate PR to just bump the bazel-toolchains version to 3.4.0 or 3.3.2 and not even that worked (although with a very different error):

They all fail with this error:

undeclared inclusion(s) in rule '@com_google_protobuf//:protobuf_lite':
--
  | this rule is missing dependency declarations for the following files included by 'external/com_google_protobuf/src/google/protobuf/extension_set.cc':
  | '/usr/local/lib/clang/10.0.0/include/stddef.h'
  | '/usr/local/lib/clang/10.0.0/include/stdarg.h'
    ...

This is super weird. :/

@coeuvre
Copy link
Contributor Author

coeuvre commented Aug 5, 2020

Anything I can help to move this PR forward?

@philwo
Copy link
Contributor

philwo commented Aug 6, 2020

@jin Could you help us here?

@nick51913
Copy link

nick51913 commented Aug 6, 2020 via email

@coeuvre
Copy link
Contributor Author

coeuvre commented Aug 24, 2020

@jin Friendly ping :)

@codingjeremy codingjeremy changed the base branch from master to main September 29, 2020 17:55
@philwo
Copy link
Contributor

philwo commented Feb 19, 2021

We’re going ahead with the infrastructure change next week. If something breaks for this project - this PR contains the fix and we’re also happy to help.

@jin
Copy link
Collaborator

jin commented Feb 26, 2021

Just reran the CI pipeline and it looks like all of the tests are failing:

Ubuntu, failing in test step of RBE:

(07:27:09) ERROR: /var/lib/buildkite-agent/.cache/bazel/_bazel_buildkite-agent/ec321eb2cc2d0f8f91b676b6d4c66c29/external/remote_java_tools_linux/java_tools/zlib/BUILD.bazel:3:11: Compiling java_tools/zlib/inffast.c [for host] failed: undeclared inclusion(s) in rule '@remote_java_tools_linux//java_tools/zlib:zlib':
--
  | this rule is missing dependency declarations for the following files included by 'java_tools/zlib/inffast.c':
  | '/usr/local/lib/clang/10.0.0/include/stddef.h'
  | '/usr/local/lib/clang/10.0.0/include/__stddef_max_align_t.h'
  | '/usr/local/lib/clang/10.0.0/include/limits.h'
  | '/usr/local/lib/clang/10.0.0/include/stdarg.h'

I remember seeing this problem when the local and remote clang versions don't match, which results in this problem. (Host uses clang 10.0.0, but not the docker image)

macOS and Windows:


(07:23:09) ERROR: While resolving toolchains for target @androidsdk//:apksigner: No matching toolchains found for types @bazel_tools//tools/cpp:toolchain_type. Maybe --incompatible_use_cc_configure_from_rules_cc has been flipped and there is no default C++ toolchain added in the WORKSPACE file? See https://github.com/bazelbuild/bazel/issues/10134 for details and migration instructions.
--
  | (07:23:09) ERROR: Analysis of target '//ui/espresso/CustomMatcherSample:CustomMatcherSample' failed; build aborted: No matching toolchains found for types @bazel_tools//tools/cpp:toolchain_type. Maybe --incompatible_use_cc_configure_from_rules_cc has been flipped and there is no default C++ toolchain added in the WORKSPACE file? See https://github.com/bazelbuild/bazel/issues/10134 for details and migration instructions.
  | (07:23:09) INFO: Elapsed time: 5.530s

@philwo
Copy link
Contributor

philwo commented Feb 26, 2021

Possibly due to the outdated baseline of this PR and Bazel on the CI having been upgraded to an incompatible newer version in the meantime? 🤔

@coeuvre Could you rebase this so that we can see the current behavior?

@meteorcloudy
Copy link

meteorcloudy commented Mar 3, 2021

Ping, #367 is still waiting for this fix ;)

@coeuvre coeuvre force-pushed the rbe-migration branch 2 times, most recently from 30c02be to 1c2e4d8 Compare March 8, 2021 08:03
@coeuvre
Copy link
Contributor Author

coeuvre commented Mar 8, 2021

Rebased and upgraded bazel toolchain to latest version but still see the same error.

Keep the toolchain untouched works. https://buildkite.com/bazel/android-testing/builds/1183

@jin
Copy link
Collaborator

jin commented Mar 8, 2021

Looks like the change is minimal, just requires adding the machine type?

@coeuvre
Copy link
Contributor Author

coeuvre commented Mar 8, 2021

Yes.

@jin
Copy link
Collaborator

jin commented Mar 8, 2021

Thank you for looking into this!

@jin jin merged commit 2268cec into android:main Mar 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants