add workflow to clean caches after PR gets closed #2450

mauwii · 2023-01-29T01:26:21Z

This helps at least a bit to get rid of all those huge caches

keturn · 2023-01-29T01:31:28Z

Do we need to manage this ourselves? Won't the cache manager do this itself as the old ones are no longer used?

mauwii · 2023-01-29T09:34:58Z

Do we need to manage this ourselves? Won't the cache manager do this itself as the old ones are no longer used?

Those links should give a good explanation:

https://docs.github.com/en/actions/using-workflows/caching-dependencies-to-speed-up-workflows#force-deleting-cache-entries

https://github.com/actions/cache/blob/main/tips-and-workarounds.md#force-deletion-of-caches-overriding-default-cache-eviction-policy

Personally I would even disable the caching of the Huggingface Models and just redownload them every time, since the cache is more ment to not need compute power to build artifacts over and over, but since it was asked more than once to cache those models, I thought it would be nice from us to delete unneeded caches early and therefore save GH some Storage early 😅

But after I tested this cache cleanup action succesfully now, I would even leave this in place, unecesarry if we disable caching of HF-Models or not (since I also cache the pip packages now, which indeed are not needed anymore after a PR got merged) 🤓

mauwii · 2023-01-29T15:44:42Z

@lstein @ebr @keturn @psychedelicious

What are your thoughts about caching the HF-Models? Leave enabled or remove them and save a lot of storage space?

psychedelicious · 2023-01-30T05:11:58Z

My understanding of the Github actions caching:

Caches removed after 7 days of no access
Caches are limited to a total of 10GB, and when we go over, GH evicts caches until we are under
Caches created by feature branches cannot be used elsewhere, but caches created by base branches (eg main) can be used by feature branches

For our purposes, caching will speed up our tests only by preventing the need for a large download as the test sets up.

So if I understand correctly, I'd say we make our caches on a base branch (main) so PRs can use those caches. Then let GH manage clearing caches as needed.

keturn · 2023-01-30T06:31:09Z

What are your thoughts about caching the HF-Models? Leave enabled or remove them and save a lot of storage space?

🤷 while working on the diffusers branch, I had them enabled for a while, because I thought having cache locality for the runners would be the polite thing to do for stuff that would otherwise incur tens of gigabytes of traffic upstream.

then the runners would literally break at the end of the run trying to put the cache together, because they didn't have enough disk space to hold all of that and build the archive zip of it at the end (effectively requiring the runner to have twice as much available filesystem space), so we disabled them.

then huggingface.co had a bad day, so someone said we needed to cache these things to provide a buffer against upstream server outages, so I think we added the caching back in again?

honestly I've lost track at this point.

our integration tests seem too demanding for this low-tier GitHub infrastructure, and we'd probably be better served by focusing on containerization and obtaining access to higher-spec hosts.

mauwii · 2023-01-31T06:08:36Z

My understanding of the Github actions caching:

Caches removed after 7 days of no access

Caches are limited to a total of 10GB, and when we go over, GH evicts caches until we are under

Caches created by feature branches cannot be used elsewhere, but caches created by base branches (eg main) can be used by feature branches

I would say yes

For our purposes, caching will speed up our tests only by preventing the need for a large download as the test sets up.

Can also be the opposite, when somehow the cache has a hickup and restoring suddenly takes up to 60 minutes (where the default timeout triggers)

So if I understand correctly, I'd say we make our caches on a base branch (main) so PRs can use those caches. Then let GH manage clearing caches as needed.

Unfortunatelly it is not easy to make sure that the cache on main is always there, and as soon as it isn't the runners for PRs start to create this 7GB cache over and over again.

But as @keturn mentioned, it also happened that that huggingface was down and no downloads where working at all, then a cache was pretty handy. So I guess there is no single answer to this, which is why I asked for some oppinions 🙈

But well, then I will leave them as they are for now, but then this PR makes a lot of sence, since you know for sure that you won't need a cache for a closed PR anymore 😅

edit: Since it actually just happened again, here is an example of a cache which loads forever

lstein

Let's give this a try and see whether it helps or hurts in the long run.

mauwii enabled auto-merge January 29, 2023 01:26

mauwii added the CI-CD Continuous integration / Continuous delivery label Jan 29, 2023

mauwii force-pushed the add/ci/clean-caches branch from 0694758 to 229514a Compare January 31, 2023 06:08

lstein approved these changes Jan 31, 2023

View reviewed changes

mauwii force-pushed the add/ci/clean-caches branch from 6ce75bd to 8dfe8fd Compare February 1, 2023 00:22

add workflow to clean caches after PR gets closed

99cd598

mauwii force-pushed the add/ci/clean-caches branch from 8dfe8fd to 99cd598 Compare February 1, 2023 17:32

mauwii merged commit 31146eb into invoke-ai:main Feb 1, 2023

mauwii deleted the add/ci/clean-caches branch February 2, 2023 00:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add workflow to clean caches after PR gets closed #2450

add workflow to clean caches after PR gets closed #2450

mauwii commented Jan 29, 2023

keturn commented Jan 29, 2023

mauwii commented Jan 29, 2023

mauwii commented Jan 29, 2023

psychedelicious commented Jan 30, 2023

keturn commented Jan 30, 2023

mauwii commented Jan 31, 2023 •

edited

Loading

lstein left a comment

add workflow to clean caches after PR gets closed #2450

add workflow to clean caches after PR gets closed #2450

Conversation

mauwii commented Jan 29, 2023

keturn commented Jan 29, 2023

mauwii commented Jan 29, 2023

mauwii commented Jan 29, 2023

psychedelicious commented Jan 30, 2023

keturn commented Jan 30, 2023

mauwii commented Jan 31, 2023 • edited Loading

lstein left a comment

Choose a reason for hiding this comment

mauwii commented Jan 31, 2023 •

edited

Loading