Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto push experiments at end of each stage #8843

Open
dberenbaum opened this issue Jan 19, 2023 · 3 comments
Open

Auto push experiments at end of each stage #8843

dberenbaum opened this issue Jan 19, 2023 · 3 comments
Labels
A: experiments Related to dvc exp A: pipelines Related to the pipelines feature feature request Requesting a new feature p2-medium Medium priority, should be done, but less important

Comments

@dberenbaum
Copy link
Contributor

Auto-pushing checkpoints was introduced to make it easier to recover long-running model training jobs in CI. For long-running processing jobs over multiple pipeline stages, the same behavior should be available at the end of each stage in the pipeline.

@dberenbaum dberenbaum added feature request Requesting a new feature p2-medium Medium priority, should be done, but less important A: experiments Related to dvc exp A: pipelines Related to the pipelines feature labels Jan 19, 2023
@sukhovvl
Copy link

We are very interested in this feature. We run long on-commit dvc pipelines in CI, by the means of dvc repro and in cases they fail we currently have to rerun everything from scratch. It would be great if intermediate results were downloadable from the remote dvc cache.

@sukhovvl
Copy link

sukhovvl commented Jan 26, 2023

Furthermore, we experimented a bit with cloud parallelisation of pipeline stages, i.e. a stage that looks like a normal stage for dvc, actually starts various cloud jobs. It would be great if there was a way for those jobs to call dvc pull and get the intermediate results of the previous stages. Leaving for a moment aside the question of how to transfer dvc.lock file to the remote workers and how to funnel back the results of the stages, it feels like intermediate pushes would open many workarounds for these cases. Of course it might seem like a far fetched scenario, but maybe it's another case in point in favour of this feature.

@cateseale
Copy link

+1 for this feature

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: experiments Related to dvc exp A: pipelines Related to the pipelines feature feature request Requesting a new feature p2-medium Medium priority, should be done, but less important
Projects
None yet
Development

No branches or pull requests

3 participants