Skip to content

dvc import: misleading warnings and outputs #2839

@dmpetrov

Description

@dmpetrov
$ dvc -V
0.70.0+3de46d

One data file was updated in the imported repo. However, it was not a file that I'm trying to update.

$ dvc update data/data.xml.dvc
WARNING: DVC-file 'data/data.xml.dvc' changed.
WARNING: Stage 'data/data.xml.dvc' changed.
Importing 'data.xml (https://github.com/dmpetrov/dataset)' -> 'data/data.xml'
Output 'data/data.xml' didn't change. Skipping saving.
  • UI issues 1: It should not be imported since there are no changes. Output: Importing 'data.xml (https://github.com/dmpetrov/dataset)' -> 'data/data.xml'
  • UI issues 2: Warning duplication DVC-file 'data/data.xml.dvc' changed. and Stage 'data/data.xml.dvc' changed.

PS: Updated file was dir1/file2 which is corresponding to stage file dir1.dvc

EDITED:

  • Issue 3 (not UI): It turned out that data/data.xml.dvc was actually updated after the update command - the repo checksum was changed (while checksum was not updated in the changed file dir1.dvc) with dvc-file checksum. Is it expected?

  • Issue 4 (not UI): When I update a stage that was actually changed I still see this output ... didn't change message while the outputs were changed and even cache was updated. This happens even when I import a single artifact from a repo (I've check in another project). Code:

$ dvc update dir1.dvc
WARNING: DVC-file 'dir1.dvc' changed.
WARNING: Stage 'dir1.dvc' changed.
Importing 'dir1 (https://github.com/dmpetrov/dataset)' -> 'dir1'
Output 'dir1' didn't change. Skipping saving.

Metadata

Metadata

Assignees

No one assigned

    Labels

    p2-mediumMedium priority, should be done, but less importantresearchuiuser interface / interaction

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions