Make pinset sharding deterministic #3640

whyrusleeping · 2017-01-28T06:05:53Z

Making this deterministic keeps us from creating an exponential amount
of objects as the number of pins in the set increases.

closes #3621 for the most part
License: MIT
Signed-off-by: Jeromy jeromyj@gmail.com

Making this deterministic keeps us from creating an exponential amount of objects as the number of pins in the set increases. License: MIT Signed-off-by: Jeromy <jeromyj@gmail.com>

rht · 2017-01-31T09:32:16Z

Confirmed there is no longer exponential explosion, , but pinning still reverses the benefit of block-level dedup.

Kubuxu · 2017-01-31T15:55:09Z

but pinning still reverses the benefit of block-level dedup.

What you mean by that?

Kubuxu · 2017-01-31T15:56:03Z

From that graph it seems for me that we should start sharding the pinset a bit more early.

rht · 2017-01-31T20:50:42Z

I meant: the storage-saving effect of the block-dedup is shadowed by the storage-requirement of the pin set. In short, files in an ipFS repo take more space than they were in unixFS.

Kubuxu · 2017-02-01T23:01:24Z

IDK if we should go with fully static seed, it allows for same attacks that all languages protect their hash maps against (precalcing which items would go to which buckets and causing requests that would cause many items in one bucket) but in our case it would be causing the tree to have one deep branch.

whyrusleeping · 2017-02-01T23:20:50Z

@Kubuxu I don't think this is an issue, You would have to find 257 items that all share a hash prefix of length n, where the hash function changes at each byte index. And then convince another node to pin each of them individually.

Kubuxu · 2017-02-01T23:25:51Z

Yeah, the split to sub buckets helps in comparison to hash maps.

whyrusleeping · 2017-02-12T19:42:24Z

@Kubuxu 👍 here?

Kubuxu

It is quite inefficient way of doing it but will have to do. Let's switch to HAMT as soon as it is viable.

Make pinset sharding deterministic

728ff6d

Making this deterministic keeps us from creating an exponential amount of objects as the number of pins in the set increases. License: MIT Signed-off-by: Jeromy <jeromyj@gmail.com>

whyrusleeping added the status/in-progress In progress label Jan 28, 2017

whyrusleeping added this to the ipfs 0.4.6 milestone Jan 29, 2017

rht mentioned this pull request Feb 1, 2017

Disk space usage of old Files API nodes #3254

Open

whyrusleeping requested a review from Kubuxu February 1, 2017 22:48

Kubuxu approved these changes Feb 12, 2017

View reviewed changes

whyrusleeping merged commit 4028e89 into master Feb 12, 2017

whyrusleeping deleted the fix/pinset-obj-explosion branch February 12, 2017 20:05

whyrusleeping removed the status/in-progress In progress label Feb 12, 2017

whyrusleeping mentioned this pull request Feb 15, 2017

ipfs pin add is very slow / hanging #3505

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make pinset sharding deterministic #3640

Make pinset sharding deterministic #3640

whyrusleeping commented Jan 28, 2017

rht commented Jan 31, 2017 •

edited

Kubuxu commented Jan 31, 2017

Kubuxu commented Jan 31, 2017

rht commented Jan 31, 2017 •

edited

Kubuxu commented Feb 1, 2017

whyrusleeping commented Feb 1, 2017

Kubuxu commented Feb 1, 2017

whyrusleeping commented Feb 12, 2017

Kubuxu left a comment

Make pinset sharding deterministic #3640

Make pinset sharding deterministic #3640

Conversation

whyrusleeping commented Jan 28, 2017

rht commented Jan 31, 2017 • edited

Kubuxu commented Jan 31, 2017

Kubuxu commented Jan 31, 2017

rht commented Jan 31, 2017 • edited

Kubuxu commented Feb 1, 2017

whyrusleeping commented Feb 1, 2017

Kubuxu commented Feb 1, 2017

whyrusleeping commented Feb 12, 2017

Kubuxu left a comment

Choose a reason for hiding this comment

rht commented Jan 31, 2017 •

edited

rht commented Jan 31, 2017 •

edited