exp run: fix issue where duplicate workspace runs would incorrectly conflict #5611

pmrowla · 2021-03-12T06:46:15Z

❗ I have followed the Contributing to DVC checklist.
📖 If this PR requires documentation updates, I have created a separate PR (or issue, at least) in dvc.org and linked it here.

Thank you for the contribution - we'll try to review it as soon as possible. 🙏

Will fix #5567

pmrowla · 2021-03-12T06:49:14Z

dvc/repo/experiments/executor/base.py

+                if checkpoint:
+                    raise CheckpointExistsError(ref_info.name)
+                raise ExperimentExistsError(ref_info.name)
+            new_rev = orig_rev


If the new run generates an identical commit to an existing one, we should be reusing the existing commit (this logic was already happening for tempdir runs during git fetch, but not for workspace runs where we commit directly into the main git/dvc workspace)

@pmrowla, is it possible to find out before even running this, that the state is the same as before (might be problematic with non-deterministic stages probably).

I think it's difficult because we also include git repo/workspace modifications with experiments (and not just DVC dependencies). So while two pipeline runs might be identical (so they get hashed into matching stages with matching DVC-tracked deps/outs), there could be other changes in the repo that show up in git but not to DVC. So we have to also generate the final git commit and then see if that actually conflicts/diffs with the previous run.

If there are git differences, we can't really tell which experiment should be preferred (so we would error out and then require running with -f/--force to overwrite the existing one)

And yeah, also as you noted, I'm not sure we can rely on checkpoint stages to be deterministic, since they are persist outputs and it's not necessarily guaranteed that the user's code will always generate the identical sequence of checkpoints

dvc/repo/experiments/executor/base.py

skshetry

Looks good to me. I have made some minor comments inline.

pmrowla added 2 commits March 12, 2021 15:32

exp run: gracefully handle duplicate workspace runs

3306883

exp run: fix issue where errors could be logged twice

ec3e348

pmrowla added the bugfix fixes bug label Mar 12, 2021

pmrowla self-assigned this Mar 12, 2021

pmrowla requested a review from skshetry March 12, 2021 06:46

pmrowla commented Mar 12, 2021

View reviewed changes

skshetry reviewed Mar 12, 2021

View reviewed changes

dvc/repo/experiments/executor/base.py Outdated Show resolved Hide resolved

skshetry reviewed Mar 12, 2021

View reviewed changes

dvc/repo/experiments/executor/base.py Outdated Show resolved Hide resolved

skshetry approved these changes Mar 12, 2021

View reviewed changes

move ref conflict check into its own function

b55a2ab

pmrowla merged commit d365e6d into treeverse:master Mar 12, 2021

pmrowla deleted the 5567-workspace-duplicate branch March 12, 2021 11:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

exp run: fix issue where duplicate workspace runs would incorrectly conflict #5611

exp run: fix issue where duplicate workspace runs would incorrectly conflict #5611

Uh oh!

pmrowla commented Mar 12, 2021

Uh oh!

pmrowla Mar 12, 2021

Uh oh!

skshetry Mar 12, 2021

Uh oh!

pmrowla Mar 12, 2021

Uh oh!

pmrowla Mar 12, 2021

Uh oh!

Uh oh!

Uh oh!

skshetry left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

exp run: fix issue where duplicate workspace runs would incorrectly conflict #5611

exp run: fix issue where duplicate workspace runs would incorrectly conflict #5611

Uh oh!

Conversation

pmrowla commented Mar 12, 2021

Uh oh!

pmrowla Mar 12, 2021

Choose a reason for hiding this comment

Uh oh!

skshetry Mar 12, 2021

Choose a reason for hiding this comment

Uh oh!

pmrowla Mar 12, 2021

Choose a reason for hiding this comment

Uh oh!

pmrowla Mar 12, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

skshetry left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants