Skip to content

Add cache invalidation to OnDiskCacheHolder #1111

@bilelomrani1

Description

@bilelomrani1

🚀 The feature

Add the ability to automatically invalidate a cached sub-graph when the remote files change after being cached locally.

Motivation, pitch

Say I have multiple files stored in a remote object storage. These files are fed into a datapipe using FSSpecFileLister, and cached locally using .on_disk_cache. I want to invalidate the cache and re-compute the datapipe when one or more remote files are changed, probably based on their hash.

Alternatives

No response

Additional context

This feature request originated from this conversation on the Pytorch forum.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions