-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Description
Here's the Weekly Digest for iterative/dvc:
ISSUES
Last week 39 issues were created.
Of these, 23 issues have been closed and 16 issues are still open.
OPEN ISSUES
π #3722 gdrive: progress for downloads, by casperdcl
π #3720 status: add --recursive flag, by nik123
π #3719 Duplicated remote repository name in different config levels., by karajan1001
π #3718 ERROR: unexpected error - are any errors expected by the user?, by skshetry
π #3717 DVC add fails when there are broken symlinks in the dataset, by greaber
π #3716 [WIP] repo: use unified RepoTree for erepos, by pmrowla
π #3715 Dvc remote default in list validation , by karajan1001
π #3714 Get Remote Storage URL for files/directories added directly from S3, by AratiNagmal
π #3706 run: ui issue for run, by skshetry
π #3703 remote: should DVC prevent external cache overlap default remote?, by jorgeorpinel
π #3700 pipeline file: characters to allow in stage name, by skshetry
π #3698 invalid start byte, by Sunny-Day200
π #3697 Linking type should not be committed to git, by drorata
π #3693 cleanups: todo after implementation of pipeline file, by skshetry
π #3690 --show-md output for metrics, params and dvc diff, by dmpetrov
π #3685 publish conda package for python 3.8, by antonkulaga
CLOSED ISSUES
β€οΈ #3721 progress: add postfix info to avoid overwriting desc, by casperdcl
β€οΈ #3713 stage: cache: use lockfiles, by efiop
β€οΈ #3712 Restyle dump: lockfile dump deterministically, by restyled-io[bot]
β€οΈ #3711 dump: deterministic lockfile dump, by skshetry
β€οΈ #3710 dvc: rename pipelines.yaml -> dvc.yaml, by efiop
β€οΈ #3709 run: try to save deps before running the command, by efiop
β€οΈ #3708 serialize: use checksums that are already saved, by efiop
β€οΈ #3707 run: params are not checked before running the command, by skshetry
β€οΈ #3705 remote: adjust traverse threshold multiplier, by pmrowla
β€οΈ #3704 remote: reduce traverse weight multiplier, by pmrowla
β€οΈ #3702 setup: relax python-dateutil pip version constraint to include v2.8.2. #3701, by dchichkov
β€οΈ #3701 Relax pip dependency versions constraints for python-dateutil, by dchichkov
β€οΈ #3699 tag: getting rid of it, by skshetry
β€οΈ #3696 lockfile: order of content changes on a repro, by skshetry
β€οΈ #3695 refactor: dvc/output class names unification, by nik123
β€οΈ #3694 dvc: implement params support for pipeline file, by skshetry
β€οΈ #3692 Restyle [WIP] dvc: introduce local stage cache, by restyled-io[bot]
β€οΈ #3691 Restyle dvc: implement multistage dvcfile, by restyled-io[bot]
β€οΈ #3689 repo: use reverse post-order DFS in repro --downstream, by pmrowla
β€οΈ #3688 refactor: dvc/dependency class names unification, by nik123
β€οΈ #3687 install: rename Windows package installation name, by fabiosantoscode
β€οΈ #3686 gdrive: fix multi-remote workflow, cont. cleanup, by shcheklein
β€οΈ #3684 refactor: dvc/remotes class names unification, by nik123
LIKED ISSUE
π #3684 refactor: dvc/remotes class names unification, by nik123
It received π x2, π x0, π x2 and β€οΈ x2.
NOISY ISSUE
π #3687 install: rename Windows package installation name, by fabiosantoscode
It received 6 comments.
PULL REQUESTS
Last week, 24 pull requests were created, updated or merged.
UPDATED PULL REQUEST
Last week, 4 pull requests were updated.
π #3722 gdrive: progress for downloads, by casperdcl
π #3716 [WIP] repo: use unified RepoTree for erepos, by pmrowla
π #3715 Dvc remote default in list validation , by karajan1001
π #3647 remote: add support for WebDAV, by shizacat
MERGED PULL REQUEST
Last week, 20 pull requests were merged.
π #3721 progress: add postfix info to avoid overwriting desc, by casperdcl
π #3713 stage: cache: use lockfiles, by efiop
π #3711 dump: deterministic lockfile dump, by skshetry
π #3710 dvc: rename pipelines.yaml -> dvc.yaml, by efiop
π #3709 run: try to save deps before running the command, by efiop
π #3708 serialize: use checksums that are already saved, by efiop
π #3705 remote: adjust traverse threshold multiplier, by pmrowla
π #3702 setup: relax python-dateutil pip version constraint to include v2.8.2. #3701, by dchichkov
π #3699 tag: getting rid of it, by skshetry
π #3695 refactor: dvc/output class names unification, by nik123
π #3694 dvc: implement params support for pipeline file, by skshetry
π #3689 repo: use reverse post-order DFS in repro --downstream, by pmrowla
π #3688 refactor: dvc/dependency class names unification, by nik123
π #3686 gdrive: fix multi-remote workflow, cont. cleanup, by shcheklein
π #3684 refactor: dvc/remotes class names unification, by nik123
π #3676 dvc: implement multistage dvcfile, by skshetry
π #3675 remote.ssh: suppress paramiko logging, by pmrowla
π #3672 remote: use string paths over PathInfo for performance reasons, by pmrowla
π #3603 dvc: introduce local stage cache, by efiop
π #3577 Metrics - plotting for multiple revisions initial, by pared
COMMITS
Last week there were 22 commits.
π οΈ [progress: add postfix info to avoid overwriting desc (#3721) * progress: persist primary description
Move subsequent updates to a postfix.
Clear postfix on exit.
TODO: align nicely.
Fixes #3681.
- progress: move to posfix[info] for full control
It was bound to happen.
-
progress: git: persist description](c873787) by casperdcl
π οΈ [Metrics - plotting for multiple revisions initial (Metrics - plotting for multiple revisions initialΒ #3577) * init -
rename to plot data insertion basig on dicts update
-
revision support
-
roll back revision
-
plot makedirs for backward compatibility
-
log path
-
pretty plot link to visualization page
-
make target default title
-
efiop review
-
efiop review
-
plot multiple initial
-
add some missing metric file tests
-
proper id generation
-
proper id generation
-
add confusion matrix template
-
refactor tests
-
plot from dvct file
-
plot from dvct
-
brush up commands
-
fix confusion matrix multiple plot
-
plot: change confusion matrix data schema
-
should be working as intended
-
support for src file in dvct files
-
minor fixes
-
plot: support json templates
-
plot: rename confusion template
-
plot: polish command behaviour
-
fix test for json
-
plot: test command
-
some minor fixes for tests
-
plot: unit test loading
-
plot: unit test loading
-
plot: handle TODOS
-
cleanup
-
use mocker
-
plot: support tsv
-
plot: command refactoring
-
plot: fix windows issues with tests
-
plot: test: some more windows fixes
-
plot: _load_from_revisions complexity fix
-
plot: reduce complexity
-
plot: complexity reduction
-
plot: deepsource suggestions
-
plot: move template path evaluation
-
fixup
-
fixup
-
exception on no datafile and no template
-
json metric load with OrderedDict
-
plot: improve handling non-existing files on revisions
-
plot: improve handling non-existing files on revisions
-
change default plot path
-
some exceptions and fixes
-
add yaml metrics support
-
fixup
-
some more suggestions
-
default filename fix
-
efiop review requests
-
log exception on failur
-
move revisions deduction to commands
-
json templates
-
extract template filling to separate method
-
some parsing improvements
-
add columns functionality
-
extract default data transformation to separate method
-
plot: initial support for jsonpath
-
plot: rename columns to filters, tests are dict based
-
plot: fixups
-
plot: refactoring
-
repo: plot: convert to package
-
plot: data loading refactor, support searching for data
-
plot: raise if wrong fields provided
-
plot: command description
-
plot: default: pass y axis info for default plot
-
plot: get rid of fieldnames, expect ordered data
-
plot: handle default plot in separate method
-
plot: fix default
-
plot: command option names fixes
-
refactoring
-
fixes
-
plot: provide option for stdout redirection
-
plot: rename show-json to no-html
-
plot: add no-csv-header option
-
plot: improve error message for wrongly structured metric
-
plot: match template name exactly, whit suffix appended only
-
plot: dmpetrov and ivan review
-
plot: refactor --stdout help message
-
plot: move template to repo/plot
-
plot: add -x and -y options
-
plot: add -x and -y options
-
plot: command: order change
-
plot: scatter
-
plot: rename confusion matrix template, new name generation format
-
plot: add title anchor
-
plot: review from jorgeorpinel
-
plot: rename filter and result options to select and file
-
plot: add --title, --x-title, --y-title
-
plot: xlab ylab
-
Update dvc/repo/plot/template.py
Co-authored-by: Ruslan Kuprieiev kupruser@gmail.com
- Update dvc/repo/plot/template.py
Co-authored-by: Ruslan Kuprieiev kupruser@gmail.com
-
efiop review
-
plot: bash completion
-
plot: static code analysis fixes
Co-authored-by: Ruslan Kuprieiev kupruser@gmail.com](e553511) by pared
π οΈ [stage: cache: use lockfiles (#3713) * stage: cache: use lockfiles
-
reorganize
-
Reorganize, use fill_from_lock to load stage cache
-
load params from build cache
-
adjust tests
Co-authored-by: Saugat Pachhai suagatchhetri@outlook.com](827c994) by efiop
π οΈ [dump: deterministic lockfile dump (#3711) * dump: lockfile is dumped deterministically
The dump is no longer deterministic/dependent on the pipeline file,
but is sorted based on file names in outs, deps or params.
Also, the params inside each files are also sorted based on name.
However, the objects inside params are not sorted deterministically
as I think it's too much to sort that, and is not easy (considering
the types of objects it might hold, eg: lists, objects, etc).
This will also provide ordered dumps for Python3.5
- fix windows 3.8 test
Co-authored-by: Ruslan Kuprieiev ruslan@iterative.ai](22c60dd) by skshetry
π οΈ dvc: rename pipelines.yaml -> dvc.yaml (#3710) by efiop
π οΈ [run: try to save deps before running the command (#3709) Unlike old _check_missing_deps, this also verifies that we are able to
save more complex dependencies such as parameters, where we not only
care about the config file, but also about the parameters in it.
Fixes #3707](71d156a) by efiop
π οΈ serialize: use checksums that are already saved (#3708) get_checksum() recomputes the checksum which might not match the
pre-recorded one. checksum is the one that was save()ed during run
and it is the one that should be used in the lockfile. by efiop
π οΈ remote: adjust traverse threshold multiplier (#3705) * Fixes #3704 by pmrowla
π οΈ [dvc: introduce local build cache (#3603) This patch introduces .dvc/cache/stages that is used to store previous
runs and their results, which could then be reused later when we stumble
upon the same command with the same deps and outs.
Format of build cache entries is single-line json, which is readable by
humans and might also be used for lock files discussed in #1871.
Related to #1871
Local part of #1234](18e8f07) by efiop
π οΈ [gdrive: fix multi-remote workflow, cont. cleanup (#3686) * remote, minor: fix parameter method name for consistency
-
gdrive: cleanup, fix workflow with multiple gdrive remotes
-
config: resolve gdrive cred file parth, typo fix
-
grdive: address deepsource warning
-
gdrive: fix tests after simplifying auth flow
-
gdrive: address PR review, use backticks where appropriate
Co-Authored-By: Jorge Orpinel jorgeorpinel@users.noreply.github.com
- gdrive: exception text improvements
Co-Authored-By: Jorge Orpinel jorgeorpinel@users.noreply.github.com
- gdrive: fix exception message
Co-Authored-By: Jorge Orpinel jorgeorpinel@users.noreply.github.com
- gdrive: fix root not found exception message
Co-Authored-By: Jorge Orpinel jorgeorpinel@users.noreply.github.com
-
gdrive: minor warnings/exceptions text improvement
-
gdrive: add tests for the gdrive_user_credentials_file relative path
-
gdrive: address review, slightly change text
-
gdrive: comments -> docstrings, addressing PR review
Co-authored-by: Jorge Orpinel jorgeorpinel@users.noreply.github.com](8aefbac) by shcheklein
π οΈ [setup: relax python-dateutil pip version constraint to include v2.8.2. #3701 (#3702) * Relax python-dateutil pip version constraint to include v2.8.2. #3701
[WIP]. Attempt to relax upper constraints on the version of python-dateutil, to include 2.8.1 and 2.8.2.
Original constraint was:
"python-dateutil<2.8.1,>=2.1", # Consolidates azure-blob-storage and boto3
- Update setup.py
Co-authored-by: Ruslan Kuprieiev kupruser@gmail.com](40c6b56) by dchichkov
π οΈ [tag: getting rid of it (#3699) * tags: get rid of it
-
tests: adjust for tags and allow @ in stage name
-
disallow @ on stage name
-
fix ds issue, leftover tag in Stage](907853b) by skshetry
π οΈ [remote.ssh: suppress paramiko logging (remote.ssh: suppress paramiko loggingΒ #3675) * remote.ssh: suppress paramiko logging -
use SSHClient.set_log_channel](31fe778) by pmrowla
π οΈ [dvc: implement params support for pipeline file (dvc: implement params support for pipeline fileΒ #3694) * dvc: implement params support for pipeline file -
reorganize dump/load of params
-
add tests for params
-
fix order of dictionary in Python 3.5](7edee84) by skshetry
π οΈ refactor: dvc/output class names unification (#3695) by nik123
π οΈ [repo: use reverse post-order DFS in repro --downstream (repo: use reverse post-order DFS in repro --downstreamΒ #3689) * tests: ensure repro --downstream preserves dependency order -
repo: use reverse post-order DFS in repro --downstream
-
make test_repro and test_repro_multistage consistent
-
rebase master, update multistage test](4fdf7ca) by pmrowla
π οΈ innosetup: rename exe app name Fixes https://github.com/iterative/dvc/issues/3687 by efiop
π οΈ [remote: use string paths over PathInfo for performance reasons (remote: use string paths over PathInfo for performance reasonsΒ #3672) * remote: use checksums instead of paths when filling dir statuses -
remote: prefer using str paths over PathInfo for performance reasons
-
only use string paths for RemoteLOCAL
-
only optimize calls made from RemoteLOCAL.cache_exists
-
fix ds warning
-
use os.path in local
-
remote: re-add checksum_to_path() to return string paths when applicable
- cloud remotes still default to using PathInfo's
-
cache fspath string
-
use abspath in checksum_to_path
- if path is not relpath from cwd or abspath, posix lstat() syscall
runtime doubles (from calculating relpath from cwd)](6d8499e) by pmrowla
π οΈ stage: exceptions: update formatting by efiop
π οΈ [dvc: implement multistage dvcfile (dvc: implement multistage dvcfileΒ #3676) * dvc: use lockfile as yaml instead of json
-
refactor Dvcfile into Pipeline file and Single stage file
-
fix tests
-
dvc: fix outputs
-
add more tests for collection of outputs
-
add tests for data cloud/get/import/ls
-
tests: test for checkouts
-
Allow other checksums other than md5
-
tests: use iterdir instead of os.listdir
-
cleanup errors reported by cc and ds
-
utils: throw DvcException instead of plain Exception
-
tests: use yaml.load instead of json.load
-
run: split assignments
-
Update dvc/stage/exceptions.py
Co-authored-by: Ruslan Kuprieiev kupruser@gmail.com](1937527) by skshetry
π οΈ refactor: dvc/dependency class names unification (#3688) by nik123
π οΈ refactor: dvc/remotes name unification (#3684) Partially fixes #2089 by nik123
CONTRIBUTORS
Last week there were 8 contributors.
π€ casperdcl
π€ pared
π€ efiop
π€ skshetry
π€ pmrowla
π€ shcheklein
π€ dchichkov
π€ nik123
STARGAZERS
Last week there were 44 stagazers.
β SuryaThiru
β hoisee
β tinyRatP
β rlalpha
β SergeevVladislav
β agis85
β twang96
β HtutLynn
β VictorGuedes
β homutov
β deshraj
β ms-sharma
β sagewhocodes
β virendrasuryavanshi
β neutrinus
β Mordin13
β hademircii
β oke-aditya
β eyehattaya
β ravila4
β oiotoxt
β kaka-lin
β stefanocoretta
β alheio
β polololya
β Pachec0o0
β jayvasantjv
β Shastick
β nilsdebruin
β conraddd
β SAr2r
β MHarland
β mattc-eostar
β DanHugoDanHugo
β akanz1
β phamquiluan
β germz01
β courentin
β Oktosha
β GiulioRossetti
β bassrehab
β achicha
β MsMandelbrot
β ghrahul
You all are the stars! π
RELEASES
Last week there were no releases.
That's all for last week, please π Watch and β Star the repository iterative/dvc to receive next weekly updates. π
You can also view all Weekly Digests by clicking here.
Your Weekly Digest bot. π