-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
index: introduce fetch #341
Comments
Note that deduping by fs, effectively means creating an index for a storage with files we are interested in and then downloading (aka caching) it locally in some location. For Ideally would also avoid having to modify existing indexes that are passed to us, and maybe we could do that by introducing an ability for each |
@efiop Could you give a high-level example? Is it mostly about imports and other external data? |
@dberenbaum This is about all data management that we have in dvc. So that we can get rid of get_used_objs stuff and so that all manipulations (like filtering by size, etc) are in one place. |
We currently have a junky version of
fetch
based onodb
that is not used anywhere. It was part of early experiments (not dvc exp) and is no longer needed.In
dvc fetch
we currently do two things:we need to take 2), make it dedup based on source fs/path and download stuff into a temporary location (note that we are not talking about reproducing the structure of indexes there, but purely stashing data somewhere). This will allow us to download stuff optimally across different indexes (e.g. across different git revisions), which also means that
fetch
should probably accept multiple indexes and not just 1. And probably it should updatestorage_info.data
as a result.Related https://github.com/iterative/studio/issues/4782
The text was updated successfully, but these errors were encountered: