Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul #4594

StephanTLavavej · 2024-04-16T00:13:55Z

📜 Changelog

Infrastructure improvements:
- Overhauled our Azure Pipelines machinery, introducing "Early Build" stages to quickly find compiler errors when building the STL.
- Updated dependencies.
  - Updated build compiler to VS 2022 17.10 Preview 4.

📖 Non-Changelog Summary

This is a response to the increasing eviction rate of spot VMs that we've recently been suffering. It changes our pool to regular (non-spot) VMs, which will result in far more reliable PR/CI builds. Regular VMs are ~2.5x more expensive for us than spot VMs, although this will be inherently mitigated by fewer reruns being necessary.

To further mitigate this cost increase, I've dramatically overhauled our Azure Pipelines machinery. Now, in parallel with the Code Format stage, we'll run 4 Early Build stages, verifying that the STL itself builds for each architecture. This also extracts building the STL with /analyze (which is significantly slower than an ordinary build), and building the benchmarks. If any of these builds fail, that indicates a severe problem that needs to be fixed before we run any of the tests. If the Code Format and Early Build stages pass, then we "fan in" and run the x64 build/tests, followed by "fan out" to non-x64 build/tests as usual. The full build/test stages (across 32 shards) are somewhat faster due to the /analyze and benchmark builds being lifted out.

The /analyze build imposes approximately 1 minute of additional cost, so overall we're saving ~32 minutes of compute across the full build/test stages. The Early Builds take about 3 minutes per architecture, so we're paying ~12 minutes there, so there should be a ~20 minute net savings for a fully successful run. In the event that an Early Build finds a problem, we save tons of compute (either 8 full shards for an x64 failure, or the whole 32 shards for a non-x64 failure). Note that failed Code Format validation now spends more compute, but I felt that this was an acceptable tradeoff. The overall critical path is mostly unchanged (we save ~2 minutes by avoiding two /analyze costs, but the ~3 minutes for an Early Build takes longer than the ~1.5 minutes for Code Format).

I had to massively overhaul the machinery to make this restructuring feasible. This overhaul became a top-level improvement by itself (at least for maintainers) - by extracting repetition and eliminating unnecessary logic, I saved ~40 net lines even while adding all of the Early Build logic. The result should be far more maintainable and extensible.

⚙️ Azure Pipelines Commits

Behavioral simplification: Drop testParallelism; lit uses all CPUs by default.
- See: https://llvm.org/docs/CommandGuide/lit.html#cmdoption-lit-j
Change amd64 to x64; VsDevCmd.bat handles them as synonyms.
In cross-build.yml, rename vsDevCmdArch to targetArch.
- This matches how it's being passed to cmake-configure-build.yml and run-tests.yml.
In native-build-test.yml, split vsDevCmdArch into hostArch and targetArch.
- This matches what's being passed to cmake-configure-build.yml and run-tests.yml.
Explicitly pass hostArch: x64 to cross-build.yml.
Unify away targetPlatform; it was identical to targetArch.
Consistently order hostArch before targetArch.
Cosmetic change: Unify 'Build Tests' and 'Run Tests' into 'Build and Run Tests'.
- Spending logic to vary the displayName is unnecessary.
Use long-form ctest options for clarity.
- See: https://cmake.org/cmake/help/latest/manual/ctest.1.html
Rename testSelection to ctestOptions.
- This clarifies who the ultimate consumer is.
Drop unnecessary comments.
In cross-build.yml, change hardcoded ctestOptions to a parameter with a default argument.
- This is more complicated, but it's a step towards unification.
In asan-pipeline.yml, don't bother centralizing '--tests-regex stlasan' into a variable.
- IMO this extra step was making things harder to follow.
In cross-build.yml, take asanBuild as a parameter with a default argument.
- Again, this is a step towards unification. This makes a clarity improvement possible - now we can remove the asanBuild default argument from cmake-configure-build.yml. IMO having defaults at different "levels" was very confusing.
Change how cmakeAdditionalFlags is defaulted.
- Clarity improvement: Don't default it at the cmake-configure-build.yml level.
- Both native-build-test.yml and cross-build.yml now take it as a parameter, defaulting to empty, and pass it down.
- Finally, azure-pipelines.yml passes cmakeAdditionalFlags: '-DTESTS_BUILD_ONLY=ON' when performing cross builds.
- This is the final step towards unification.
Unify (identical) native-build-test.yml and cross-build.yml into build-and-test.yml.
- Saying "and" in the name clarifies that we're doing two separate things.
Consistently sort parameters after hostArch, targetArch.
Change the buildBenchmarks default from true to false.
Consistently capitalize the PowerShell@2 task.
Drop unnecessary pwsh: false.
- This is the default on Windows. See: https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/powershell-v2?view=azure-pipelines
Simplify task: CmdLine@2 to the script: shortcut.
- We use this everywhere else. Docs:
- https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/cmd-line-v2?view=azure-pipelines#remarks
- https://learn.microsoft.com/en-us/azure/devops/pipelines/yaml-schema/steps-script?view=azure-pipelines
Add parameter analyzeBuild to control STL_USE_ANALYZE.
- Originally hardcoded to ON, now defaulted to true. (These are synonyms to CMake.)
Change the analyzeBuild default from true to false.
Behavior change: Don't enable analyzeBuild in asan-pipeline.yml.
Simplify checkout-sources.yml by dropping unnecessary parameters.
- These parameters (llvmSHAVar etc.) were storing the names of Azure Pipelines variables to create (llvmSHA), confusingly sharing the same names as PowerShell variables ($llvmSHA etc.). This layer of indirection was unnecessary. We can take the names llvmSHAVar etc. and directly use them as the names of Azure Pipelines variables.
In checkout-sources.yml, drop unnecessary semicolons when setting Azure Pipelines variables.
Extract git submodule status into a loop.
Perform regex replacement on a separate line.
- This simplifies the syntax and avoids packing too much logic into a single line. It also allows us to put the regex directly next to its replacement, clarifying how the capture group is connected.
Scripts don't need to cd $(Build.SourcesDirectory) as that's their default workingDirectory.
- Several of our scripts were already assuming this; let's be consistent. See docs:
- https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/cmd-line-v2?view=azure-pipelines#inputs
- https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/powershell-v2?view=azure-pipelines#inputs
Add doTesting to control whether we checkout LLVM and build/run tests.
- (I changed my mind and altered this to skipTesting below, but rewriting history was too much effort. Sorry for the extra complexity.)
Major behavioral change: Add 'Early Build' stages.
- Alongside 'Code Format', we begin by fanning out an 'Early Build' for each architecture. This verifies that the STL builds, and also covers /analyze and the benchmarks (the latter is still skipped for plain ARM). It skips all testing, and saves more time by skipping the LLVM checkout. Then we fan in for the normal x64 build and test. This results in several improvements:
  - Any build breaks (whether ordinary, /analyze, or benchmark) are caught early and cheaply. Previously, we'd spend at least 8 shards, or the full 32 for a non-x64 break.
  - This reduces the length of the critical path (or so I thought, but not as much as I'd hoped). We previously paid two /analyze and benchmark builds on the critical path (x64, then non-x64). Now we perform one /analyze and benchmark build simultaneously with 'Code Format' which takes almost as long, so it's nearly free.
  - By removing work from the 32 full test shards (/analyze was expensive although the benchmarks are currently cheap), we're reducing the surface area for eviction, and reducing the amount of work that needs to be rerun after evictions.
- If 'Code Format' fails, this performs a bit more work than before, but less than 1 full test shard.
Cosmetic change: Display 'Build and Test' for ARM/ARM64.
- We're still compiling the tests, just not running them.
Behavior change: Make buildBenchmarks control whether we checkout google-benchmark.
git sparse-checkout now defaults to --cone.
- See: https://github.blog/2022-06-27-highlights-from-git-2-37/
Make 'Setup TMP Directory' more consistent.
- Move one occurrence from build-and-test.yml into checkout-sources.yml, so it consistently appears before checkout: self.
- Change the other occurrence in format-validation.yml to also use a single-line if exist command.
Behavior change: Set fetchDepth: 1 and fetchTags: false.
- These settings observably improve our checkout behavior, especially now that I added tags for every release.
- See: https://learn.microsoft.com/en-us/azure/devops/pipelines/yaml-schema/steps-checkout?view=azure-pipelines
- Note: Despite the comments in the docs, both of our pipelines are exhibiting the "old" behavior by default. Being explicit about the "new" behavior is harmless if the default ever actually changes.
Replace general cmakeAdditionalFlags with specific testsBuildOnly.
- And we don't need to use this when configuring the benchmarks.
Changed my mind, use skipTesting with a default of false.
Split checkout-sources.yml into checkout-self.yml and checkout-submodules.yml.
- Now format-validation.yml can use checkout-self.yml, avoiding duplication.
Behavioral simplification: Avoid 3x duplication with checkout-submodule.yml.
- This extracts 3 copy-pasted cmd scripts into 1 PowerShell script.
- We no longer need to use Azure Pipelines variables to communicate the SHA.
- Each remote is now named submodule-upstream for uniformity, as the name doesn't matter.
- I've performed a few additional simplifications that I believe are proper, but we'll need to watch out for problems when agents reuse repos:
  - After top-level self-checkout, each submodule directory should exist, so we shouldn't have to force-create it.
  - If the .git directory doesn't exist in the submodule, we shouldn't need to obliterate all other files there. We're going to perform a checkout and clean that should restore us to a known good state.
  - If the .git directory already exists, running git init again is harmless by design.
  - Instead of the "run git remote get-url and look for failure" technique, we can check the output of git remote to make git remote add idempotent.

🧰 Toolset Update Commits

Use PowerShell 7.4.2.
Spot D32ds_v5 => Regular D32ads_v5
New-AzVMConfig defaults to -MaxPrice -1.
- See: https://learn.microsoft.com/en-us/powershell/module/az.compute/new-azvmconfig?view=azps-11.5.0#-maxprice
- Let's drop this since we won't need it if we ever go back to Spot VMs, and I'm worried it might conflict with Regular VMs.
New pool.
VS 2022 17.10 Preview 4.

…by default.

This matches how it's being passed to cmake-configure-build.yml and run-tests.yml.

…argetArch`. This matches what's being passed to cmake-configure-build.yml and run-tests.yml.

…Run Tests'. Spending logic to vary the `displayName` is unnecessary, especially because we already distinguish 'Build and Test x64' vs. 'Build ARM64' at the top level.

This clarifies who the ultimate consumer is.

…th a default argument. This is more complicated, but it's a step towards unification.

…an'` into a variable. IMO this extra step was making things harder to follow.

…gument. Again, this is a step towards unification. This makes a clarity improvement possible - now we can remove the `asanBuild` default argument from cmake-configure-build.yml. IMO having defaults at different "levels" was very confusing.

Clarity improvement: Don't default it at the cmake-configure-build.yml level. Both native-build-test.yml and cross-build.yml now take it as a parameter, defaulting to empty, and pass it down. Finally, azure-pipelines.yml passes `cmakeAdditionalFlags: '-DTESTS_BUILD_ONLY=ON'` when performing cross builds. This is the final step towards unification.

…d-and-test.yml. Saying "and" in the name clarifies that we're doing two separate things.

This is the default on Windows. See: https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/powershell-v2?view=azure-pipelines

We use this everywhere else. Docs: https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/cmd-line-v2?view=azure-pipelines#remarks https://learn.microsoft.com/en-us/azure/devops/pipelines/yaml-schema/steps-script?view=azure-pipelines

Originally hardcoded to `ON`, now defaulted to `true`. (These are synonyms to CMake.)

These parameters (`llvmSHAVar` etc.) were storing the names of Azure Pipelines variables to create (`llvmSHA`), confusingly sharing the same names as PowerShell variables (`$llvmSHA` etc.). This layer of indirection was unnecessary. We can take the names `llvmSHAVar` etc. and directly use them as the names of Azure Pipelines variables.

…re Pipelines variables.

This is phrased positively to avoid negation. I'm intentionally not providing a default. Defaulting to `false` would be confusing, because the name build-and-test.yml suggests that testing will be performed. Defaulting to `true` would be inconsistent with the other booleans all defaulting to `false`.

Alongside 'Code Format', we begin by fanning out an 'Early Build' for each architecture. This verifies that the STL builds, and also covers `/analyze` and the benchmarks (the latter is still skipped for plain ARM). It skips all testing, and saves more time by skipping the LLVM checkout. Then we fan in for the normal x64 build and test. This results in several improvements: * Any build breaks (whether ordinary, `/analyze`, or benchmark) are caught early and cheaply. Previously, we'd spend at least 8 shards, or the full 32 for a non-x64 break. * This reduces the length of the critical path. We previously paid two `/analyze` and benchmark builds on the critical path (x64, then non-x64). Now we perform one `/analyze` and benchmark build simultaneously with 'Code Format' which takes almost as long, so it's nearly free. * By removing work from the 32 full test shards (`/analyze` was expensive although the benchmarks are currently cheap), we're reducing the surface area for eviction, and reducing the amount of work that needs to be rerun after evictions. If 'Code Format' fails, this performs a bit more work than before, but less than 1 full test shard.

We're still compiling the tests, just not running them.

…oogle-benchmark.

See: https://github.blog/2022-06-27-highlights-from-git-2-37/

Move one occurrence from build-and-test.yml into checkout-sources.yml, so it consistently appears before `checkout: self`. Change the other occurrence in format-validation.yml to also use a single-line `if exist` command.

These settings observably improve our checkout behavior.

And we don't need to use this when configuring the benchmarks.

…ules.yml. Now format-validation.yml can use checkout-self.yml, avoiding duplication.

…le.yml. This extracts 3 copy-pasted cmd scripts into 1 PowerShell script. We no longer need to use Azure Pipelines variables to communicate the SHA. Each remote is now named submodule-upstream for uniformity, as the name doesn't matter. I've performed a few additional simplifications that I believe are proper, but we'll need to watch out for problems when agents reuse repos: * After top-level self-checkout, each submodule directory should exist, so we shouldn't have to force-create it. * If the .git directory doesn't exist in the submodule, we shouldn't need to obliterate all other files there. We're going to perform a checkout and clean that should restore us to a known good state. * If the .git directory already exists, running `git init` again is harmless by design. * Instead of the "run `git remote get-url` and look for failure" technique, we can check the output of `git remote` to make `git remote add` idempotent.

See: https://learn.microsoft.com/en-us/powershell/module/az.compute/new-azvmconfig?view=azps-11.5.0#-maxprice Let's drop this since we won't need it if we ever go back to Spot VMs, and I'm worried it might conflict with Regular VMs.

StephanTLavavej · 2024-04-17T22:56:26Z

/azp run STL-ASan-CI

CaseyCarter

This will make it much less painful to work on the pipelines in the future. Thanks!

CaseyCarter · 2024-04-18T03:46:55Z

/azp run STL-ASan-CI

STL-ASan-CI passed.

StephanTLavavej · 2024-04-18T18:24:27Z

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

StephanTLavavej added infrastructure Related to repository automation uncharted Excluded from the Status Chart labels Apr 16, 2024

StephanTLavavej requested a review from a team as a code owner April 16, 2024 00:13

StephanTLavavej added 18 commits April 15, 2024 22:02

Behavioral simplification: Drop testParallelism; lit uses all CPUs …

acaca2d

…by default.

Change amd64 to x64; VsDevCmd.bat handles them as synonyms.

6d44ede

In cross-build.yml, rename vsDevCmdArch to targetArch.

17a8870

This matches how it's being passed to cmake-configure-build.yml and run-tests.yml.

In native-build-test.yml, split vsDevCmdArch into hostArch and `t…

a49383e

…argetArch`. This matches what's being passed to cmake-configure-build.yml and run-tests.yml.

Explicitly pass hostArch: x64 to cross-build.yml.

20f8645

Unify away targetPlatform; it was identical to targetArch.

6d0860a

Consistently order hostArch before targetArch.

6da00b1

Cosmetic change: Unify 'Build Tests' and 'Run Tests' into 'Build and …

46ce7ad

…Run Tests'. Spending logic to vary the `displayName` is unnecessary, especially because we already distinguish 'Build and Test x64' vs. 'Build ARM64' at the top level.

Use long-form ctest options for clarity.

4881ae7

Rename testSelection to ctestOptions.

0952e22

This clarifies who the ultimate consumer is.

Drop unnecessary comments.

bf6ebf2

In cross-build.yml, change hardcoded ctestOptions to a parameter wi…

7fe629d

…th a default argument. This is more complicated, but it's a step towards unification.

In asan-pipeline.yml, don't bother centralizing `'--tests-regex stlas…

693e9eb

…an'` into a variable. IMO this extra step was making things harder to follow.

Unify (identical) native-build-test.yml and cross-build.yml into buil…

5b92074

…d-and-test.yml. Saying "and" in the name clarifies that we're doing two separate things.

Consistently sort parameters after hostArch, targetArch.

e0af61c

Change the buildBenchmarks default from true to false.

e49a3e2

StephanTLavavej force-pushed the azure branch from cedd615 to e49a3e2 Compare April 16, 2024 05:03

StephanTLavavej added 8 commits April 15, 2024 23:45

Consistently capitalize the PowerShell@2 task.

5606102

Drop unnecessary pwsh: false.

5c5ce7a

This is the default on Windows. See: https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/powershell-v2?view=azure-pipelines

Add parameter analyzeBuild to control STL_USE_ANALYZE.

e41ddca

Originally hardcoded to `ON`, now defaulted to `true`. (These are synonyms to CMake.)

Change the analyzeBuild default from true to false.

92bcfb3

Behavior change: Don't enable analyzeBuild in asan-pipeline.yml.

d94d760

In checkout-sources.yml, drop unnecessary semicolons when setting Azu…

f490278

…re Pipelines variables.

StephanTLavavej added 9 commits April 16, 2024 15:36

Cosmetic change: Display 'Build and Test' for ARM/ARM64.

8f93132

We're still compiling the tests, just not running them.

Behavior change: Make buildBenchmarks control whether we checkout g…

3ed351e

…oogle-benchmark.

git sparse-checkout now defaults to --cone.

11952e7

See: https://github.blog/2022-06-27-highlights-from-git-2-37/

Make 'Setup TMP Directory' more consistent.

3cd74c2

Move one occurrence from build-and-test.yml into checkout-sources.yml, so it consistently appears before `checkout: self`. Change the other occurrence in format-validation.yml to also use a single-line `if exist` command.

Behavior change: Set fetchDepth: 1 and fetchTags: false.

78162cb

These settings observably improve our checkout behavior.

Replace general cmakeAdditionalFlags with specific testsBuildOnly.

7bb30a4

And we don't need to use this when configuring the benchmarks.

Changed my mind, use skipTesting with a default of false.

af29fdf

StephanTLavavej force-pushed the azure branch from 5496ca0 to af29fdf Compare April 17, 2024 04:56

StephanTLavavej added 2 commits April 16, 2024 22:51

Split checkout-sources.yml into checkout-self.yml and checkout-submod…

6439b1a

…ules.yml. Now format-validation.yml can use checkout-self.yml, avoiding duplication.

StephanTLavavej force-pushed the azure branch from a677d4a to a9370ed Compare April 17, 2024 08:56

StephanTLavavej added 5 commits April 17, 2024 12:55

[toolset update] Use PowerShell 7.4.2.

ac058aa

[toolset update] Spot D32ds_v5 => Regular D32ads_v5

607d712

[toolset update] New pool.

dec7828

[toolset update] VS 2022 17.10 Preview 4.

c59fe88

This comment was marked as resolved.

Sign in to view

StephanTLavavej changed the title ~~Azure Pipelines refactoring WIP~~ Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul Apr 17, 2024

StephanTLavavej removed the uncharted Excluded from the Status Chart label Apr 17, 2024

StephanTLavavej assigned CaseyCarter Apr 18, 2024

CaseyCarter approved these changes Apr 18, 2024

View reviewed changes

CaseyCarter removed their assignment Apr 18, 2024

StephanTLavavej self-assigned this Apr 18, 2024

StephanTLavavej merged commit a35fb2a into microsoft:main Apr 19, 2024
39 checks passed

StephanTLavavej deleted the azure branch April 19, 2024 00:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul #4594

Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul #4594

StephanTLavavej commented Apr 16, 2024 •

edited

StephanTLavavej commented Apr 17, 2024

This comment was marked as resolved.

CaseyCarter left a comment

CaseyCarter commented Apr 18, 2024

StephanTLavavej commented Apr 18, 2024

Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul #4594

Toolset update: VS 2022 17.10 Preview 4, non-spot VMs, Azure Pipelines overhaul #4594

Conversation

StephanTLavavej commented Apr 16, 2024 • edited

📜 Changelog

📖 Non-Changelog Summary

⚙️ Azure Pipelines Commits

🧰 Toolset Update Commits

StephanTLavavej commented Apr 17, 2024

This comment was marked as resolved.

CaseyCarter left a comment

Choose a reason for hiding this comment

CaseyCarter commented Apr 18, 2024

StephanTLavavej commented Apr 18, 2024

StephanTLavavej commented Apr 16, 2024 •

edited