Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: aim-mlflow option to skip artifact downloads #5

Open
josiahbjorgaard opened this issue Jan 19, 2023 · 4 comments
Open

Comments

@josiahbjorgaard
Copy link

Could an option be added to the aim-mlflow sync operation to skip downloading artifacts?

@gorarakelyan
Copy link
Contributor

@josiahbjorgaard thanks for the suggestion! Could you please describe the motivation?

@josiahbjorgaard
Copy link
Author

For example, we have multiple GBs of artifacts attached to our MLFlow logs (model weights, png plots, data), mostly stored on S3 (not colocated with the MLFlow GUI). Automatically syncing it all causes me to run out of disk space. Many of the artifacts are intended for later usage, like for restarting training or pushing model weights to production, rather than for analysis.

@gorarakelyan
Copy link
Contributor

@josiahbjorgaard this makes sense. I think it would be even better to provide a more flexible interface to enable filtering artifacts rather than completely disabling syncing. Something like:

  • aimlflow sync --excluded-artifacts="*.png" (exclude artifacts based on name)
  • aimlflow sync --included-artifacts="*.png" (sync only matching artifacts)
    Thoughts?

@pezosanta
Copy link

+1 for this feature request

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants