Skip to content

How is the MD5 hash of entire directories in the .dvc files calculated? #9732

Answered by pmrowla
mblsf asked this question in Help
Discussion options

You must be logged in to vote

.dir files are JSON files that contain a listing of the files in the directory. The hash is the MD5 of the JSON content. This hash would mainly only be useful to know whether or not you have downloaded the correct JSON content for a given .dir file.

How are you using the API to download files?

Also, just to be clear, if you are using DVC 2.x, the MD5 hash computed by DVC is not always the MD5 of the original file content, and you cannot compare the DVC MD5 hash to the results of something like md5sum (or python's hashlib.md5)

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@mblsf
Comment options

@pmrowla
Comment options

Answer selected by mblsf
@mblsf
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Help
Labels
None yet
3 participants