Treat failure to resolve a toolchain as if it were an incompatible target for skipping in ... expansion. #12419

aiuto · 2020-11-04T19:38:30Z

Incompatible target skipping will skip targets that can not be built for the specified destination platform.
There is also a need to be able to skip targets which can not be built on the execution platform. For example, because a test might depend on an OS native tool which makes no sense on any other platform, or a compiler is only licensed to a few users in an organization.

It would be convenient if failure to resolve a toolchain (usually because one was not registered) as a skippable target.

TBD: Bring in more context from the mail thread which sprouted this idea.

cc: @gregestren @AustinSchuh @philsc @katre

philsc · 2020-11-12T03:26:49Z

I'm admittedly more of a toolchains newbie than I'd like to be. I'll do some reading to better understand what it means to have missing toolchain etc.

At a high level I can definitely see that being a desirable feature for the reasons you listed.

Quoting @aiuto from the e-mail chain:

Proposal:
For ... resolution, failure to find a toolchain would be skip just
like target_compatible_with. You would, however, get the warning
that the toolchain was not found. If you try to build/test a specific
target, however, it still fails with an error.

Pro:

easy to understand.

easier for the rule author to get the toolchain specs correct

no overloading target_compatible_with with failure to have the
tools at the executable side.

Con:

if you had tests which required their own unique toolchain you
may skip the tests on some platforms.

aiuto · 2020-11-12T03:49:19Z

The ' easier for the rule author to get the toolchain specs correct' requires some explanation.

In bazelbuild/rules_pkg#254, I am wrapping a linux only tool (rpmbuild) in a toolchain.
In order to get ... skipping to work on platforms where rpmbuild can not be found, I had to create a no-op toolchain which resolved, but contained an attribute saying it was invalid. I exposed the validity as a constraint which we could select() on in a target_compatible_with clause.

It is not a huge amount of work, but you have to see the "trick" to doing it that way, To me, that means it is too hard.

AustinSchuh · 2020-11-12T18:56:23Z

There's something to the idea to explore for sure. Here is another use cases that exists today:

Bazel can't guarentee that the android SDK/NDK is installed when it is being built. It has workspace entries which the user un-comments to enable the toolchains. If the user doesn't un-comment those toolchains, there is a build failure.
-> This proposal would skip the android targets if there is no compiler available for android.

Honestly, that sounds pretty helpful. Likely that should have the same set of constraints that Phil added to incompatible target skipping. If the user requests it through //..., no error and print skipped. If the user explicitly requests an android target in that case, error out and complain.

aiuto · 2020-11-12T21:00:25Z

That's a perfect example. We see this all the time with mobile apps and a shared code base.

the iOS developers don't have any android SDKs installed
the Android developers don't have xcode installed
they both want to be able to say bazel test ... and see the tests for their platform.

philsc · 2020-11-23T19:53:22Z

While not helpful for resolving this issue, I did mention this issue in the Filtering incompatible targets in Starlark proposal. My naive thought is that skipping targets with a missing toolchain can also be expressed via a Starlark-accessible provider. That becomes relevant in that proposal.

gregestren · 2020-12-28T22:38:26Z

Would this work as a current workaround?

Define a default "empty" toolchain that's compatible with everything.
For rule implementations using that toolchain:

if toolchain="empty toolchain":
   return [IncompatiblePlatformProvider()]

philsc · 2021-01-05T00:35:24Z

@gregestren , the IncompatiblePlatformProvider class is currently not exposed to Starlark so that approach unfortunately doesn't work at the moment. However, I think that's a really neat, simple idea!

github-actions · 2023-04-29T01:32:16Z

Thank you for contributing to the Bazel repository! This issue has been marked as stale since it has not had any activity in the last 2+ years. It will be closed in the next 14 days unless any other activity occurs or one of the following labels is added: "not stale", "awaiting-bazeler". Please reach out to the triage team (@bazelbuild/triage) if you think this issue is still relevant or you are interested in getting the issue resolved.

aiuto · 2023-04-29T01:48:52Z

Not really stale. We should probably get this effect somehow.

philsc · 2023-05-03T05:48:20Z

@gregestren , do you have any cool ideas on a good way to accomplish this? I assumed it couldn't be too bad. I think I almost got there with this branch: master...philsc:bazel:unreviewed/philsc/fix-12419

But the test encounters this error:

$ bazel test -c opt //src/test/shell/integration:target_compatible_with_test --test_output=streamed --test_filter=test_incompatible_with_missing_toolchain
...
FATAL: bazel crashed due to an internal error. Printing stack trace:
java.lang.IllegalStateException: Unexpected exception: dep Dependency{label=//target_skipping:compiler_flag, configuration=1882c65bed561e30d863f6ecf00a6c3357d15ac54eca64e00aa2a71ff790ba77, aspects=AspectCollection{[]}, transitionKeys=[], executionPlatformLabel=null} had null value, even though there were no values missing in the initial fetch. That means it had an unexpected exception type (not ConfiguredValueCreationException)
        at com.google.devtools.build.lib.bugreport.BugReport.sendBugReport(BugReport.java:183)
        at com.google.devtools.build.lib.bugreport.BugReport.logUnexpected(BugReport.java:154)
        at com.google.devtools.build.lib.skyframe.PrerequisiteProducer.resolveConfiguredTargetDependencies(PrerequisiteProducer.java:949)
        at com.google.devtools.build.lib.skyframe.PrerequisiteProducer.computeDependencies(PrerequisiteProducer.java:740)
        at com.google.devtools.build.lib.skyframe.PrerequisiteProducer.evaluate(PrerequisiteProducer.java:349)
        at com.google.devtools.build.lib.skyframe.ConfiguredTargetFunction.compute(ConfiguredTargetFunction.java:203)
        at com.google.devtools.build.skyframe.ParallelEvaluator.bubbleErrorUp(ParallelEvaluator.java:422)
        at com.google.devtools.build.skyframe.ParallelEvaluator.waitForCompletionAndConstructResult(ParallelEvaluator.java:211)
        at com.google.devtools.build.skyframe.ParallelEvaluator.doMutatingEvaluation(ParallelEvaluator.java:177)
        at com.google.devtools.build.skyframe.ParallelEvaluator.eval(ParallelEvaluator.java:672)
        at com.google.devtools.build.skyframe.InMemoryMemoizingEvaluator.evaluate(InMemoryMemoizingEvaluator.java:177)
        at com.google.devtools.build.lib.skyframe.SkyframeExecutor.configureTargets(SkyframeExecutor.java:2276)
        at com.google.devtools.build.lib.skyframe.SkyframeBuildView.configureTargets(SkyframeBuildView.java:343)
        at com.google.devtools.build.lib.analysis.BuildView.update(BuildView.java:440)
        at com.google.devtools.build.lib.buildtool.AnalysisPhaseRunner.runAnalysisPhase(AnalysisPhaseRunner.java:242)
        at com.google.devtools.build.lib.buildtool.AnalysisPhaseRunner.execute(AnalysisPhaseRunner.java:140)
        at com.google.devtools.build.lib.buildtool.BuildTool.buildTargets(BuildTool.java:182)
        at com.google.devtools.build.lib.buildtool.BuildTool.processRequest(BuildTool.java:529)
        at com.google.devtools.build.lib.buildtool.BuildTool.processRequest(BuildTool.java:497)
        at com.google.devtools.build.lib.runtime.commands.BuildCommand.exec(BuildCommand.java:103)
        at com.google.devtools.build.lib.runtime.BlazeCommandDispatcher.execExclusively(BlazeCommandDispatcher.java:625)
        at com.google.devtools.build.lib.runtime.BlazeCommandDispatcher.exec(BlazeCommandDispatcher.java:240)
        at com.google.devtools.build.lib.server.GrpcServerImpl.executeCommand(GrpcServerImpl.java:550)
        at com.google.devtools.build.lib.server.GrpcServerImpl.lambda$run$1(GrpcServerImpl.java:614)
        at io.grpc.Context$1.run(Context.java:566)
        at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
        at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
        at java.base/java.lang.Thread.run(Unknown Source)
------------------------------------------------------------------------
test_incompatible_with_missing_toolchain FAILED: Bazel build failed unexpectedly..

Something is unhappy about me trying to propagate the ToolchainException up higher. It's looking for a ConfiguredValueCreationException instead.

Anyway, if you have any ideas, please let me know :)

gregestren · 2023-05-03T23:04:28Z

I'll need to refresh my memory on this. I'll schedule time for me and @katre and @aiuto to review the issue again.

katre · 2023-05-05T15:47:44Z

We chatted about this: the underlying idea that targets which fail toolchain resolution should be marked incompatible instead of failing the entire build makes sense. It's unclear to me now if that should always be true (I'd really prefer to not add Yet Another Flag), but it makes sense as a baseline.

@philsc Your change looks reasonable at a glance. You need to either wrap the ToolchainException in a ConfiguredValueCreationException (see an example) or handle it directly and not re-throw anything (possibly by returning the incompatible configured target).

I haven't done a deep review of the code, let me know when you have a PR ready.

philsc · 2023-05-16T17:35:40Z

Sounds good. Thanks @katre . I'm going to try a few things.

gregestren · 2023-06-22T19:46:30Z

Debugging more, I'm still unsure what's triggering that error. Two followup thoughts:

Note the expressed failing target is //target_skipping:compiler_flag. What happens if you modify the test to build that directly?
Does the error still occur if you catch the ToolchainException as your code already is and comment out the code that returns the incompatible provider? i.e. catch the exception and re-throw it just as the pre-existing code does, would that still trigger this error?

fmeum · 2023-06-27T10:53:39Z

Since the planned behavior is another situation in which an error (toolchain resolution failed) is effectively silenced (target is skipped), I would prefer to have this configurable.

@katre I formulated a concept for a similar setting that doesn't involve yet another flag here: #18707 (comment). I would be interested in hearing your thoughts on this.

philsc · 2023-07-03T03:30:06Z

I believe that whatever solution manifests for #18707 should solve the same concern for this, correct @fmeum ?

fmeum · 2023-07-03T05:46:51Z

@philsc Yes, I think so. The would ideally also be configurable on a per-repo level.

aiuto added P2 We'll consider working on this in future. (Assignee optional) team-Configurability Issues for Configurability team labels Nov 4, 2020

philwo added the type: feature request label Nov 10, 2020

philsc mentioned this issue Mar 7, 2021

Optional toolchains #3601

Closed

github-actions bot added the stale Issues or PRs that are stale (no activity for 30 days) label Apr 29, 2023

sgowroji removed the stale Issues or PRs that are stale (no activity for 30 days) label Apr 29, 2023

aiuto added P3 We're not considering working on this, but happy to review a PR. (No assignee) and removed P2 We'll consider working on this in future. (Assignee optional) labels May 5, 2023

fmeum mentioned this issue Jun 27, 2023

Indirect incompatible target skipping can have highly non-local silent effects #18707

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Treat failure to resolve a toolchain as if it were an incompatible target for skipping in ... expansion. #12419

Treat failure to resolve a toolchain as if it were an incompatible target for skipping in ... expansion. #12419

aiuto commented Nov 4, 2020

philsc commented Nov 12, 2020

aiuto commented Nov 12, 2020

AustinSchuh commented Nov 12, 2020

aiuto commented Nov 12, 2020

philsc commented Nov 23, 2020

gregestren commented Dec 28, 2020

philsc commented Jan 5, 2021

github-actions bot commented Apr 29, 2023

aiuto commented Apr 29, 2023

philsc commented May 3, 2023

gregestren commented May 3, 2023

katre commented May 5, 2023

philsc commented May 16, 2023

gregestren commented Jun 22, 2023

fmeum commented Jun 27, 2023

philsc commented Jul 3, 2023

fmeum commented Jul 3, 2023

Treat failure to resolve a toolchain as if it were an incompatible target for skipping in ... expansion. #12419

Treat failure to resolve a toolchain as if it were an incompatible target for skipping in ... expansion. #12419

Comments

aiuto commented Nov 4, 2020

philsc commented Nov 12, 2020

aiuto commented Nov 12, 2020

AustinSchuh commented Nov 12, 2020

aiuto commented Nov 12, 2020

philsc commented Nov 23, 2020

gregestren commented Dec 28, 2020

philsc commented Jan 5, 2021

github-actions bot commented Apr 29, 2023

aiuto commented Apr 29, 2023

philsc commented May 3, 2023

gregestren commented May 3, 2023

katre commented May 5, 2023

philsc commented May 16, 2023

gregestren commented Jun 22, 2023

fmeum commented Jun 27, 2023

philsc commented Jul 3, 2023

fmeum commented Jul 3, 2023