Skip to content

Back-fill metrics #4210

@jonilaserson

Description

@jonilaserson

Say I am maintaining some data using dvc, and at some point decide I want to have a metric showing some data statistics (i.e. track how many positive samples I have). So I create a pipeline that computes this metric. How do I back-fill it to previous commits? The goal is to plot a graph showing this metric at different stages of the project.

Specifically: If the commit history is A->B->master, I know I can checkout data-commit 'A' and run the pipeline, but I won't be able to save the metric output for commit 'A' in the context of commit 'A', right? At most I will be able to commit it in a new commit (A') whose parent is 'A'. It would have been better if I could have committed it to 'A' directly. Why? Because A' is not an ancestor of 'master', so it's not naturally included in the development of my data.

Metadata

Metadata

Assignees

No one assigned

    Labels

    feature requestRequesting a new featurep2-mediumMedium priority, should be done, but less importantresearch

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions