Skip to content

get/import: look for output in all remotes (not just default remote, which may not exist)? #3371

@jorgeorpinel

Description

@jorgeorpinel

Extracted from #3369 (comment)

This would be useful for dataset registries where each dataset is a directory that uses a different DVC remote. No default remote is set in this kind of project to help prevent people from accidentally pushing a dataset to the wrong remote (e.g. different S3 bucket keys). In a project like that, get and import won't work because they expect a default remote.

Possible implementation solutions

  • Add --remote option to get/import: not great because requires manually inspecting the source project config file before being able to get/import.
  • get/import can try every remote in the source project config file sequentially and use the first one that contains the target data to download
  • New behavior in dvc remote default to be able to set default remotes for specific paths in a project (i.e. this would be done in the source data reg, not when getting/importing)

Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions