-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Description
Bug Report
Description
In our mono-repo we have some finished projects which used DVC 2 (and maybe even DVC 1 still). More recent projects use DVC 3 (dvc init --subdir). These projects may still require some outputs such as in particular trained model weights that were tracked with an older DVC version. However, the dvc.yaml pipeline files used features that have been dropped in DVC 3 (stage-level vars).
When trying to pull any file in the old DVC project folder, dvc pull some_file.dvc also tries to validate all other DVC pipeline files and fails with an error. This prevents us from checking out the data without making changes to historic DVC pipelines.
The same issue we have with dvc.lock files before the schema: '2.0' was introduced. These seem to be no longer supported by DVC 3.
required key not provided, in schema
'dvc.lock' validation failed: 13 errors.
extra keys not allowed, in create_index_table, line 2, column 3
1 create_index_table:
2 cmd: python -m scripts.create_index_table
[...]
Reproduce
Expected
DVC as data versioning tool and tool to enable reproducibility should have strong backward compatibility for any generated data (and ideally also pipelines, though one could use an older DVC version to reproduce them). It should at the very least be possible to check out data produced with older DVC versions with the latest DVC version still.
Environment information
Output of dvc doctor:
$ dvc doctor
DVC version: 3.16.0 (conda)
---------------------------
Platform: Python 3.10.6 on Linux-3.10.0-1127.8.2.el7.x86_64-x86_64-with-glibc2.17
Subprojects:
dvc_data = 2.15.4
dvc_objects = 1.0.1
dvc_render = 0.5.3
dvc_task = 0.3.0
scmrepo = 1.2.1
Supports:
http (aiohttp = 3.8.5, aiohttp-retry = 2.8.3),
https (aiohttp = 3.8.5, aiohttp-retry = 2.8.3),
s3 (s3fs = 2023.6.0, boto3 = 1.26.76)
Config:
Global: /home/aschuh/.config/dvc
System: /etc/xdg/dvc
Cache types: <https://error.dvc.org/no-dvc-cache>
Caches: local
Remotes: s3, s3
Workspace directory: xfs on /dev/sda1
Repo: dvc (subdir), git
Repo.site_cache_dir: /var/tmp/dvc/repo/f00b3983400f9f38ae255d78e2110269Additional Information (if any):