Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make syncer not recreate datasets versions every time it's restarted #6

Closed
lukemarsden opened this issue Jun 17, 2021 · 0 comments · Fixed by #48
Closed

Make syncer not recreate datasets versions every time it's restarted #6

lukemarsden opened this issue Jun 17, 2021 · 0 comments · Fixed by #48
Labels
S O(days)

Comments

@lukemarsden
Copy link
Contributor

lukemarsden commented Jun 17, 2021

Right now every time you restart the syncer, it creates a new dataset versions for every commit in every pachyderm repo, even if they've already been created. We need to list the existing dataset versions when we start up, and avoid recreating ones that have already been created.

It's actually kinda useful that it does this for debugging right now.

@lukemarsden lukemarsden added the S O(days) label Jun 17, 2021
@albscui albscui changed the title Make syncer not recreate datasets every time it's restarted Make syncer not recreate datasets versions every time it's restarted Jul 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
S O(days)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant