Skip to content

1.0 (not only) testing - comments, finding, issues, etc #3777

@shcheklein

Description

@shcheklein

Some notes on the playground, how to reproduce, and what I'm testing for.

Repo to reproduce: here.

  • I'm running testing on a new version of the example-get-started that I built with this updated gen script.

  • The repo itself (that script from above generates) can be downloaded here. Should be enough to reproduce issues below described below. And can be useful for everyone as an easy entry point to try DVC 1.0 on a bit more sophisticated project.

  • This list below might include things not related 100% to the latest changes we introduced in 1.0, but all of them are potentially relevant. E.g., we change pipelines interface and should try scenarios around.

dvc repro behavior and UI/UX

  • I think we should assume dvc.yaml prefix by default. Instead of running a stage like this dvc repro dvc.yaml:prepare, I should be able to run it like this (if dvc.yaml is found in the current dir? or if it is a single one in the project?):
$ dvc repro prepare

#3842

$ dvc repro model.pkl
  • Repro excessive warning on the default target file name. Either make it info (in general I would say WARNING is a rare thing that we need to show), or don't show again. Instead make a message about the stage that we execute more meaningful (see below). default target: reduce loglevel to debug #3822
$ dvc repro
WARNING: assuming default target 'dvc.yaml'.
$ dvc repro
WARNING: stage: 'data/data.xml.dvc' is locked. Its dependencies are not going to be reproduced.

dvc repro options

$ vim scr/train.py (some dummy change)
$ dvc repro --dry`
  • -p,. --pipeline - deprecate? It does not make sense moving forward to accept .dvc files.

    -p still makes sense for running a pipeline of a given stage, even in 1.0.
    Though, repro -p will not work as either dvc file or stage name needs to be specified.

  • Related to the previous one - we still accept .dvc files into dvc repro - we should not be doing this anymore - semantics is not clear anymore. Deprecate or remove.

    dvc repro is still backward compatible. And, we cannot detect if the repo is 1.0-one,
    as the repo can have a mixture of old-style dvcfiles and new pipeline files.

  • --help review and fix it. e.g. we don't expect Dvcfile by default anymore. repro: fix help message for default target #3831

$ dvc repro --help
DVC-file to reproduce. 'Dvcfile' by default.

Other commands

  • dvc pipeline show - expects Dvcfile, should now do the same as dvc repro + fix help.

  • dvc pipeline show output is broken from a lot of different perspectives - see this comment for suggestions and details.

  • We broke the progress bar for the get/import. It is now Downloading 0/1 files vs Downloading 0/36.2M before. Mean that we effectively don't show a progress bar for the case like:

dvc get https://github.com/iterative/dataset-registry get-started/data.xml -o data/data.xml

This is not fixed by reverting b312895?w=1

Extracted to #3874

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugDid we break something?testingRelated to the tests and the testing infrastructure

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions