Implement compatible bitfield #466

austinabell · 2020-05-31T02:28:49Z

Summary of changes
Changes introduced in this pull request:

Needed for the miner actor, roughly matches functionality of https://github.com/filecoin-project/go-bitfield (but is not optimized yet) and abstractions in https://github.com/filecoin-project/specs-actors/blob/master/actors/abi/bitfield.go are included (I don't know why they have seperate)

Was previously setup because it was optimistic that a basic bitvec would be sufficient and the API would have all the necessary functionality. Because of inconsistencies in how the bitvectors are decoded and when they error, this functionality was setup.

It doesn't match the go impl 1:1 because to match their iterators is quite tedious and going to PR the functionality in first and I'll open an issue to benchmark and optimize. There are some benefits to having it the way this is rather than those iterators, so will wait for benchmarks to draw conclusions. (This impl does less encoding/decoding but requires more memory in an actual environment probably)

cc @dutterbutter let me know if this interface fits your needs, happy to tweak things if needed. I sometimes require mutable reference and flush to avoid inefficiencies and unnecessary decoding but I can change if needed.

Reference issue to close (if applicable)

Closes

Other information and links

utils/bitfield/tests/bitfield_tests.rs

utils/bitfield/src/rleplus.rs

vm/actor/src/builtin/miner/state.rs

timvermeulen

This is really cool!

I haven't looked too much into the Go implementation but I'm curious what the exact trade-off is for flushing the bitfield on operations that otherwise only need read-only access. Surely something like a single get call on an RLE+-encoded bitvec can be done more efficiently than decoding it first. So I suppose the idea is that you usually do more than one get (or other operations that are more efficient on decoded bitvecs than encoded ones), and at some point it pays off to decode it up front?

We could always expose the flush operation and have the user of the type decide whether to do this or not (if they have a mutable reference to it). Then we can make get/first/count etc work on shared references as well.

utils/bitfield/tests/bitfield_tests.rs

utils/bitfield/src/lib.rs

utils/bitfield/tests/bitfield_tests.rs

utils/bitfield/src/lib.rs

austinabell · 2020-06-01T14:49:30Z

I haven't looked too much into the Go implementation but I'm curious what the exact trade-off is for flushing the bitfield on operations that otherwise only need read-only access.

Lookups are O(1) after the first flush, instead of checking the set and unset hashsets as well as iterating over the RLE encoded bits.

Surely something like a single get call on an RLE+-encoded bitvec can be done more efficiently than decoding it first.

Yes, I was planning on doing this in a seperate issue which would include benchmarking, but you can iterate over the RLE encoded bits to see if the index bit is set (that is the only other thing preventing) but waiting for benchmarking because checking the hash of the index against the set and unset cache as well as iterating over the RLE bits isn't trivial in extreme cases

So I suppose the idea is that you usually do more than one get (or other operations that are more efficient on decoded bitvecs than encoded ones), and at some point it pays off to decode it up front?

Yeah, that's the idea. I don't particularly like it, but it solves our need and matches the go impl (and where it would error, which is important)

We could always expose the flush operation and have the user of the type decide whether to do this or not (if they have a mutable reference to it). Then we can make get/first/count etc work on shared references as well.

Yes, that was what I wanted to do, but this being drop in functionality for the specs-actors, invariants of different error handling and inconsistent functionality is not something worth this change right now, especially since this will most likely be refactored. The main goal is to match functionality and have a consistent API

utils/bitfield/src/lib.rs

timvermeulen

This should be good enough for now

austinabell · 2020-06-01T18:53:46Z

Applied f94217d @timvermeulen @AshantiMutinta Sorry to do after your review, but I liked Ashanti's suggestion and it was a very small change and not functional difference to make in another PR

timvermeulen · 2020-06-01T19:48:30Z

@austinabell Oh I missed that but lgtm 🙂

austinabell added 8 commits May 29, 2020 19:45

Setup core bitfield operations and replace usages

b91bf08

Setup union and fix merge

8ea3903

Implement other specs-actors bitfield methods

3d74e16

Update and port over slice and union tests

3a4ced7

Setup more tests pre refactor

9a2e342

Refactor merge

aeaf5ab

Port over other go tests

a206b6c

Test untested functionality

cec3450

austinabell added Status: Needs Review VM labels May 31, 2020

austinabell requested review from ansermino, dutterbutter and ec2 as code owners May 31, 2020 02:28

austinabell requested review from timvermeulen, StaticallyTypedAnxiety, flodesi and RajarupanSampanthan May 31, 2020 02:29

valid point, clippy. Conceptually it seemed the same though

a856454

StaticallyTypedAnxiety reviewed May 31, 2020

View reviewed changes

utils/bitfield/tests/bitfield_tests.rs Outdated Show resolved Hide resolved

StaticallyTypedAnxiety reviewed May 31, 2020

View reviewed changes

utils/bitfield/src/rleplus.rs Outdated Show resolved Hide resolved

StaticallyTypedAnxiety reviewed May 31, 2020

View reviewed changes

utils/bitfield/src/rleplus.rs Outdated Show resolved Hide resolved

StaticallyTypedAnxiety reviewed May 31, 2020

View reviewed changes

vm/actor/src/builtin/miner/state.rs Show resolved Hide resolved

Comment changes

eab02cf

timvermeulen suggested changes Jun 1, 2020

View reviewed changes

timvermeulen reviewed Jun 1, 2020

View reviewed changes

utils/bitfield/src/lib.rs Show resolved Hide resolved

Addr comments

6ea081b

Update contains_all functionality and update test

6078431

timvermeulen reviewed Jun 1, 2020

View reviewed changes

utils/bitfield/src/lib.rs Outdated Show resolved Hide resolved

oops

26911f6

Fix check

afdb4b7

timvermeulen approved these changes Jun 1, 2020

View reviewed changes

StaticallyTypedAnxiety approved these changes Jun 1, 2020

View reviewed changes

austinabell added 2 commits June 1, 2020 14:49

Apply Ashanti's suggestion

f94217d

Merge branch 'master' into austin/bitfield

2a3503e

dutterbutter approved these changes Jun 1, 2020

View reviewed changes

austinabell mentioned this pull request Jun 1, 2020

BitField optimization and benchmarking #468

Closed

austinabell merged commit 6308313 into master Jun 1, 2020

austinabell deleted the austin/bitfield branch June 1, 2020 20:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement compatible bitfield #466

Implement compatible bitfield #466

austinabell commented May 31, 2020

timvermeulen left a comment

austinabell commented Jun 1, 2020

timvermeulen left a comment

austinabell commented Jun 1, 2020

timvermeulen commented Jun 1, 2020

Implement compatible bitfield #466

Implement compatible bitfield #466

Conversation

austinabell commented May 31, 2020

timvermeulen left a comment

Choose a reason for hiding this comment

austinabell commented Jun 1, 2020

timvermeulen left a comment

Choose a reason for hiding this comment

austinabell commented Jun 1, 2020

timvermeulen commented Jun 1, 2020