Never re-use check containers #3079

vito · 2019-01-17T16:08:34Z

What challenge are you facing?

Check containers are re-used for up to an hour (by default) so that resources can re-use cached state that they fetched (i.e. so a fresh git clone doesn't have to run all the time).

We limit their re-use to an hour so that they get rebalanced across workers periodically. However that means every hour forces a fresh git clone, which makes people want to extend their lifetime further (#2988).

This also means that for a bunch of resources, there will always be a bunch of containers across all the workers, leading to desperate measures like #3054 and in general a ton of questions because people are almost always surprised to see so many check containers.

We also are super careful to lock and make sure only one ATC is using a container at a time.

And we're careful about how these containers are re-used (e.g. across teams).

What would make this better?

Let's not do that!

Can we instead create a one-time-use container every time we need to run a check, and use a volume to explicitly track the cached state between runs?

This volume could work like task caches - there would be one persisted on each worker that the check runs on. A copy of it would be mounted to the container, and that copy would become the new cache for subsequent runs.

Open questions:

How quickly can containers be created/deleted? There should be some way to make this fast as Diego and Garden teams benchmarked the hell out of it back in the day. But we should check and make sure, because this will result in a lot more creations/deletes.
How do we identify the GC/ownership of these containers now that they're one-off? We don't want them deleted while they're being used, but we do want them GC'd if there's some way to tell it's no longer in use.
How do we handle the lifecycle of the cache volumes?

Are you interested in implementing this yourself?

Yeah, actually, it sounds fun.

The text was updated successfully, but these errors were encountered:

jwntrs · 2019-01-17T20:08:28Z

I really like this idea. However I think this might be a good idea to rethink the way we model volume uses since this change will likely involve yet another another foreign key on the volumes table.

At the moment, a volume (sometimes) belongs to a team, and defines its use through a foreign key reference (i.e. resource cache, task cache, etc). I see volumes as a lower level concept that shouldn't contain all this information. I think we should invert this relationship, and have the higher level concepts track both the team (where applicable) and volume foreign keys. This will make adding new volume uses much lighter weight, and let us query for the higher level objects, and join on the lower level objects.

@vito thoughts?

vito · 2019-01-18T17:13:44Z

@pivotal-jwinters I wonder if the idea I threw around in #3025 (comment) meshes well with that.

At least for now I would rather do this within the bounds of the current architecture, just because there seems to be a clear enough path. But it helps to discuss this stuff early on for potential post-5.x/6.x architecture changes!

topherbullock · 2019-01-18T20:09:41Z

Spiking on some wild wild thoughts around this:

Adding a field to the containers table to denote containers as "ignored" by the Container Collector, so that we can defer their lifecycle to the runtime ( via setting Garden container GraceTime )
Could we have a resource checking 'agent' container per worker and create "peas" ( container processes supplying their own rootfs (but sharing namespaces/cgroups with a sandbox container) ) for each check?
- Does using 'peas' in Garden warm up any part of the creation of a container?
- Are there other gains we could realize by sharing the container namespaces but running separate processes for checks??
- Do peas support different bind-mounts?

vito · 2019-01-22T19:01:01Z

Adding this to the Core project as well, just because a lot of the concerns/gotchas regarding global resources would be greatly mitigated if we were to do this. Implementation-wise, there are probably more runtimey concerns though (especially regarding the cache volume lifecycle and container scheduling).

Maybe it can be divided into separate steps that can be done by each team?

Step 1 would be to eliminate check sessions entirely and just create a new check container every time. This would obviously be a huge regression for big git repos and we shouldn't ship with only this much implemented, but maybe we can start to see how this simplifies things. We would still need to figure out a new lifecycle for these containers but it might be pretty simple.

Step 2 would be to measure container creation time to see how this change may impact worker load, and then see what we can do to mitigate any additional slowness. This testing should be done with cheap resources like time so that we're not measuring the known regression that Step 1 introduces.

Step 3 would then be to figure out the volume caching lifecycle, and how to schedule containers to best leverage it. I think this probably takes more thinking.

Step 1 could probably be easily done by Core, and steps 2 and 3 are probably better left to Runtime.

@topherbullock wdyt?

topherbullock · 2019-01-22T23:29:59Z

I agree with the separation of work between Core and Runtime. I think Core's implementation of Step 1 could even get away with ignoring the fact that GC may reap those containers until Runtime picks up a corresponding story to address GC. We might want to spike on a quick resource check measurement now to see the delta after Step 2 on low-state checks like `time`

…

On Tue., Jan. 22, 2019, 2:01 p.m. Alex Suraci, ***@***.***> wrote: Adding this to the Core project as well, just because a lot of the concerns/gotchas regarding global resources would be greatly mitigated if we were to do this. Implementation-wise, there are probably more runtimey concerns though (especially regarding the cache volume lifecycle and container scheduling). Maybe it can be divided into separate steps that can be done by each team? Step 1 would be to eliminate check sessions entirely and just create a new check container every time. This would obviously be a huge regression for big git repos and we shouldn't ship with only this much implemented, but maybe we can start to see how this simplifies things. We would still need to figure out a new lifecycle for these containers but it might be pretty simple. Step 2 would be to measure container creation time to see how this change may impact worker load, and then see what we can do to mitigate any additional slowness. This testing should be done with cheap resources like time so that we're not measuring the known regression that Step 1 introduces. Step 3 would then be to figure out the volume caching lifecycle, and how to schedule containers to best leverage it. I think this probably takes more thinking. Step 1 could probably be easily done by Core, and steps 2 and 3 are probably better left to Runtime. @topherbullock <https://github.com/topherbullock> wdyt? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3079 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzt3MqHKktySWonUmligbD7X9jy8Rv6ks5vF1_xgaJpZM4aGEtu> .

vito · 2019-02-07T17:12:06Z

Did a bit more thinking on this with @clarafu and @pivotal-jwinters - we like the idea of creating a one-off container and not bothering with a complicated lifecycle, and allowing Garden's GraceTime to clean it up if we fail to .Destroy it in the happy path.

One gotcha however is that the container also has an associated volume for its copy-on-write rootfs. BaggageClaim supports TTLs but it won't keep it alive for you as it has no idea that it's "in use". So we tried thinking of ways to avoid defining the lifecycle for this volume, since it'd be just about as complicated as it would be for containers.

One idea we had was to write a BaggageClaim image plugin for Guardian. Image plugins are executables that are configured in Guardian like so:

https://github.com/cloudfoundry/guardian/blob/6571dc2d8af81f200eef97bf7ea107437dbb9576/guardiancmd/command.go#L228-L236

We could ship Concourse with a BaggageClaim image plugin and auto-configure it when starting gdn via the binary. This way we can rely on Garden to destroy the image when the container goes away. I think this would actually be pretty easy, especially now that we have more control over how Guardian is configured and can auto-configure those flags. We could use this for all containers, and have one less thing to manage with GC.

For phase 1 (Core spike), I think we can just leave the volumes around forever for now, or give them some arbitrary TTL.

clarafu · 2019-02-21T20:20:15Z

clarafu · 2019-02-27T15:55:21Z

The current approach to this issue:
Step 1. Runtime: write a new ephemeral check container creation and destroy method.

Step 2. Core: remove resource config check sessions and all it's associates within the database and code base.

Step 3. Runtime: measure container creation time with the new changes of ephemeral check containers to see how this change may impact worker load in a real deployment, and then see what we can do to mitigate any additional slowness. This testing should be done with cheap resources like time so that we're not measuring the known regression that Step 1 introduces.

Step 4. Runtime: would then be to figure out the volume caching lifecycle, and how to schedule containers to best leverage it. I think this probably takes more thinking.

vito · 2019-03-04T19:26:32Z

A goal of this is to remove the lock around checking, though currently this also locks around saving the versions. It looks like that code is idempotent but we should probably verify this by hand to make sure no funny business happens. Since it's never been exercised otherwise.

cirocosta · 2019-03-04T22:53:34Z

Maybe related:

I was looking at the "locks held" metric when a particular pipeline (strabo) kicked off, and it's quite interesting how that looks like:

ddadlani · 2019-05-30T20:56:26Z

Work on this has resumed in #3424

vito added efficiency enhancement labels Jan 17, 2019

vito added this to Icebox in Runtime via automation Jan 17, 2019

vito mentioned this issue Jan 18, 2019

Setting the Garden default grace time to a small value causes "unknown handle" error messages in resources #3054

Closed

vito mentioned this issue Jan 22, 2019

Tagged resources should not re-use check containers from workers that don't have the configured tags #3002

Closed

topherbullock mentioned this issue Jan 24, 2019

Support retry limits on container creation #2429

Closed

vito mentioned this issue Jan 29, 2019

Resources that are inputs to jobs that have no triggers should not check. #1445

Open

vito mentioned this issue Feb 6, 2019

Add web config for max duration of check container life cycle #3030

Closed

kcmannem mentioned this issue Feb 8, 2019

Check containers accumulate on a single worker with the 'fewest-build-containers' #3251

Closed

vito mentioned this issue Feb 11, 2019

fly containers does not give the pipeline/job/name of check containers #1374

Closed

This was referenced Feb 11, 2019

Discussion: Runtime refactoring #2926

Closed

Remove references to build containers #3293

Closed

wagdav mentioned this issue Feb 14, 2019

Check containers are super ambiguous in fly containers #1149

Closed

vito mentioned this issue Feb 20, 2019

Resource check debugging #3344

Open

ddadlani mentioned this issue Feb 21, 2019

Add in-depth worker healthchecks / probes [#2753] #3025

Closed

ddadlani mentioned this issue Feb 26, 2019

Investigate: If ATC restarts while scheduling a build, it might schedule the build on a different worker #3370

Closed

This was referenced Mar 4, 2019

runtime : allow creation of ephemeral check containers #3424

Closed

RFC: Kubernetes Runtime concourse/rfcs#2

Closed

clarafu mentioned this issue Mar 13, 2019

The ATC holds a lock on a resource type scan #1954

Closed

topherbullock moved this from Icebox to Backlog in Runtime Mar 13, 2019

xtremerui mentioned this issue Mar 14, 2019

keep fly check resource until it succeed in downgrade/upgrade job #3504

Closed

clarafu mentioned this issue Mar 25, 2019

Remove resource scopes on 'spaces' branch #3585

Closed

vito mentioned this issue Mar 26, 2019

500 error on badly cleaned up containers/workers #3588

Closed

ddadlani mentioned this issue Apr 5, 2019

Add ATC/worker flags to limit max build containers for workers #2928

Closed

vito mentioned this issue May 14, 2019

Fly Containers show check types as none #3860

Closed

ddadlani mentioned this issue May 17, 2019

Better build scheduling / load distribution #3695

Open

ddadlani mentioned this issue May 28, 2019

fewest-build-containers: tasks in aggregate land on the same worker #3301

Closed

ddadlani removed this from Backlog in Runtime Jul 30, 2019

vito added this to To do in Epic #3079 via automation Feb 18, 2020

vito moved this from To do to End goals in Epic #3079 Feb 18, 2020

vito added domain/core Includes scope pertaining to core Concourse functionality (i.e. pipelines, workflows). domain/runtime Includes scope pertaining to how Concourse actually "runs" things (i.e. containers, scheduling). epic and removed efficiency enhancement labels Feb 18, 2020

vito moved this from End goals to Current iteration in Epic #3079 Feb 21, 2020

vito added this to To do in Roadmap via automation May 8, 2020

vito moved this from To do to Icebox in Roadmap May 8, 2020

vito removed this from Current iteration in Epic #3079 May 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Never re-use check containers #3079

Never re-use check containers #3079

vito commented Jan 17, 2019

jwntrs commented Jan 17, 2019

vito commented Jan 18, 2019

topherbullock commented Jan 18, 2019

vito commented Jan 22, 2019

topherbullock commented Jan 22, 2019 via email

vito commented Feb 7, 2019

clarafu commented Feb 21, 2019 •

edited

clarafu commented Feb 27, 2019

vito commented Mar 4, 2019

cirocosta commented Mar 4, 2019

ddadlani commented May 30, 2019

Never re-use check containers #3079

Never re-use check containers #3079

Comments

vito commented Jan 17, 2019

What challenge are you facing?

What would make this better?

Are you interested in implementing this yourself?

jwntrs commented Jan 17, 2019

vito commented Jan 18, 2019

topherbullock commented Jan 18, 2019

vito commented Jan 22, 2019

topherbullock commented Jan 22, 2019 via email

vito commented Feb 7, 2019

clarafu commented Feb 21, 2019 • edited

clarafu commented Feb 27, 2019

vito commented Mar 4, 2019

cirocosta commented Mar 4, 2019

ddadlani commented May 30, 2019

clarafu commented Feb 21, 2019 •

edited