Enable read-only configuration cache #29467

mikejuyoon · 2024-06-07T17:35:17Z

Expected Behavior

There should be a setting to enable loading configuration cache but skip the store step at the end of the build, which adds potentially unnecessary time to the build.

Current Behavior (optional)

When configuration cache is enabled, both load and store steps are enabled with no way to opt out of either step

Context

We want to enable configuration-cache in CI, where majority of the builds will be consuming configuration cache that was produced from another CI job. Majority of builds won't need to store the cache at the end of the build as it won't be reused. However the time to store this cache can make the performance gain from cc negligible, or even worse, especially when theres a cache miss.

The text was updated successfully, but these errors were encountered:

mlopatkin · 2024-06-11T12:36:03Z

Can you tell us more about what you perceive as a source of the overhead? We know about sequential dependency resolution, and will address this soon. Is the I/O in general a problem? Do you expect the machine to have enough memory to temporarily hold the cached state if needed?

The store operation is an essential step to prepare for the execution phase, to allow parallel task execution, for example. It doesn't really happen at the end of the build. Our end goal is to get rid of non-configuration-cached execution, so falling back to it won't be a long-term solution.

An alternative solution could be to fail the build if the cached state cannot be reused to indicate that cache-priming build has to be re-run. Does this behavior fit your use case?

joshfriend · 2024-06-13T01:25:19Z

Can you tell us more about what you perceive as a source of the overhead?

At one point, we had enabled configuration cache in our CI as a way to validate that our build was compatible with CC when updating gradle, but we found that this came at a ~10% performance penalty. For larger builds we would sometimes observe the storing of configuration cache to take >1m.

We are rolling out configuration caching to CI builds where we produce the cache in the main branch, and PR builds will restore the cache from the nearest commit ancestor on main that has cache available. In some cases, a developer has made a CC invalidating change and the cache we restore is not reusable. In these cases we would like to basically continue as if CC were disabled and not incur the cost of storing the new configuration to the cache.

Do you expect the machine to have enough memory to temporarily hold the cached state if needed?

Generally yes, we had one or two CI jobs in the build where configuration cache had to remain disabled because it caused OOMs, but we have been able to hold the state in memory for everything else.

The store operation is an essential step to prepare for the execution phase, to allow parallel task execution, for example

We are able to run task execution in parallel with CC turned off, I don't understand why this is a requirement when CC is enabled. I think I am missing some bit of knowledge here that would help this requirement make sense.

An alternative solution could be to fail the build if the cached state cannot be reused to indicate that cache-priming build has to be re-run. Does this behavior fit your use case?

Potentially,. We would have to check if the time taken to run the initialization twice with different settings would be faster than writing the configuration cache and discarding it. That doesn't seem great in general from a usability standpoint though.

bamboo · 2024-06-17T20:02:09Z

Thanks, @joshfriend, we'll get back to the this issue soon, I just wanted to clarify one point:

We are able to run task execution in parallel with CC turned off, I don't understand why this is a requirement when CC is enabled.

By isolating tasks, CC can enable intra-project parallelism while --parallel only enables inter-project parallelism.

mikejuyoon added a:feature A new functionality to-triage labels Jun 7, 2024

mikejuyoon changed the title ~~Add setting to allow read-only configuration cache~~ Enable read-only configuration cache Jun 7, 2024

bamboo added the in:configuration-cache Configuration Caching label Jun 7, 2024

mlopatkin removed the to-triage label Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable read-only configuration cache #29467

Enable read-only configuration cache #29467

mikejuyoon commented Jun 7, 2024

mlopatkin commented Jun 11, 2024

joshfriend commented Jun 13, 2024

bamboo commented Jun 17, 2024

Enable read-only configuration cache #29467

Enable read-only configuration cache #29467

Comments

mikejuyoon commented Jun 7, 2024

Expected Behavior

Current Behavior (optional)

Context

mlopatkin commented Jun 11, 2024

joshfriend commented Jun 13, 2024

bamboo commented Jun 17, 2024