Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GCP Housekeeping jobs? #617

Open
MattWellie opened this issue Feb 19, 2024 · 2 comments
Open

GCP Housekeeping jobs? #617

MattWellie opened this issue Feb 19, 2024 · 2 comments
Labels
core Changes to the cpg-workflows api

Comments

@MattWellie
Copy link
Contributor

MattWellie commented Feb 19, 2024

We have a number of jobs which nominate a temporary data directory, and write potentially huge amounts of data to them (e.g. AnnotateCohort which generates multiple checkpoints, or GATK-SV in general which is a storage-hungry beast).

I would like to consider follow-on jobs which would run if the main job completes successfully, clearing the temporary storage directory which was used.

We'd need to poke the numbers, and see if this is worth the effort.

@vivbak
Copy link
Contributor

vivbak commented Apr 5, 2024

We'd need to poke the numbers, and see if this is worth the effort.

I agree with this. I'm scared of deleting things in the pipeline :)

Tag this under: Investigate?

@vivbak
Copy link
Contributor

vivbak commented Apr 5, 2024

Also tag this under cost optimisation

@vivbak vivbak added the core Changes to the cpg-workflows api label Apr 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Changes to the cpg-workflows api
Projects
None yet
Development

No branches or pull requests

2 participants