Skip to content

gc: parallelize garbage collection #5961

@isidentical

Description

@isidentical

It seems like there is nothing blocking this, and for cloud providers, this might mean up to 16-20x speed up (dvc gc -c). Just for some numbers as the motivation, removing 1000~ cache files from the s3 takes about 20-25 minutes.

Metadata

Metadata

Assignees

No one assigned

    Labels

    A: gcRelated go garbage collectionperformanceimprovement over resource / time consuming tasks

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions