feat: Switch from SHA3-256 to BLAKE3-256 #306

matheus23 · 2023-07-10T19:28:22Z

Summary

Switched any sha3 dependencies out for blake3 with the traits-preview feature. That feature made the switch pretty seamless.
Switched prime_hash impl to using little-endian big integer encoding, otherwise some tests would run out of stack space. BigUInt::from_bytes_be actually simply copies over the bytes into a new array that it reverses and runs BigUint::from_bytes_le on, which causes quite a lot of allocation.
Added a test to make sure future changes to prime_hash don't unintentionally change hash results.
Adjusted the way private file blocks are referenced to make it easier to construct the names
Added base_name to ExternalFileContent. This allows users with snapshot access only to derive the necessary block labels (they wouldn't be able to access the header.name).
Did some minor changes to keys.rs, making the API slimmer (the SnapshotKey and TemporalKey newtypes are now private), and removing From<> impls in favor of documented, (hopefully hard to misuse?) special-purpose functions.

Still having the "tests run out of stack" issue every now and then. After a couple of complicated debugging sessions with lldb, it turns out the issue is mostly having functions with lots of await points (e.g. unit tests!). All of the Futures are allocated on the stack, blowing it. There's no good solution that I'm aware of, except for RUST_MIN_STACK=2500000 cargo test.

codecov · 2023-07-10T19:35:50Z

Codecov Report

Merging #306 (05529f5) into main (c17f6bb) will decrease coverage by 0.04%.
The diff coverage is 86.07%.

@@            Coverage Diff             @@
##             main     #306      +/-   ##
==========================================
- Coverage   56.25%   56.22%   -0.04%     
==========================================
  Files          43       43              
  Lines        3180     3189       +9     
  Branches      769      770       +1     
==========================================
+ Hits         1789     1793       +4     
- Misses        900      904       +4     
- Partials      491      492       +1

Impacted Files	Coverage Δ
wnfs-hamt/src/diff.rs	`19.29% <ø> (ø)`
wnfs-hamt/src/hamt.rs	`27.50% <ø> (ø)`
wnfs-hamt/src/node.rs	`46.66% <ø> (ø)`
wnfs-hamt/src/pointer.rs	`6.25% <ø> (ø)`
wnfs/examples/mnemonic_based.rs	`0.00% <0.00%> (ø)`
wnfs/src/private/forest/traits.rs	`100.00% <ø> (ø)`
wnfs/src/private/keys/access.rs	`42.10% <0.00%> (-4.96%)`	⬇️
wnfs/src/private/node/keys.rs	`78.94% <85.71%> (-3.11%)`	⬇️
wnfs/src/private/file.rs	`78.37% <88.00%> (-0.32%)`	⬇️
wnfs-nameaccumulator/src/name.rs	`82.51% <91.66%> (-0.29%)`	⬇️
... and 9 more

expede · 2023-07-10T19:56:04Z

I haven't actually like plotted the benchmarks or anything, but just looking at the raw numbers it looks like this PR mostly runs as-fast-or-faster than the previous PR 🙌

expede · 2023-07-10T19:56:56Z

Actually, do we have a tool that plots the benches set up? Save me from tabbing back and forth between the plaintext results 😛

matheus23 · 2023-07-10T19:58:27Z

We have this: https://wnfs-wg.github.io/rs-wnfs/dev/bench/

But I don't think we have anything like that for branches.

expede · 2023-07-10T19:58:43Z

Oh these benchmarks are for native code though, right? Do we have Wasm benches?

matheus23 · 2023-07-10T19:58:59Z

No WASM benches currently :/

expede · 2023-07-10T20:00:43Z

(I really dislike the lack of threading in main thread PR comments)

We should at minimum run some manual tests for this PR I think, especially given that GH Issue on the BLAKE3 repo. Setting up Wasm benches would be nice in general (we have wasm-js tests, so maybe extend that for rough microbenches?)

expede · 2023-07-11T04:40:12Z

Was chatting with @zeeshanlakhani about this, and he says that the right way to bench Wasm is via JS wrappers. It would be nice to have them for all targets (so... two for now, but who knows down the road)

matheus23 · 2023-07-12T13:24:41Z

Remaining TODOs:

Manual testing for Wasm performance
Wait for feat: Switch from SHA3-256 to BLAKE3-256 rs-skip-ratchet#32 to be merged & released
Resolve conflicts

This ensures you can re-generate all block labels, even if you don't have access to the PrivateNodeHeader`, e.g. when you only have snapshot access.

matheus23 · 2023-07-19T14:13:09Z

I set up this very crude benchmark:

Code

import { PrivateForest, PrivateDirectory } from "./pkg/wnfs_wasm.js";
import { CID } from "multiformats/cid";
import { sha256 } from "multiformats/hashes/sha2";
import { webcrypto } from "one-webcrypto";

class MemoryBlockStore {
    /** Creates a new in-memory block store. */
    constructor() {
        this.store = new Map();
    }

    /** Stores an array of bytes in the block store. */
    async getBlock(cid) {
        const decoded_cid = CID.decode(cid);
        return this.store.get(decoded_cid.toString());
    }

    /** Retrieves an array of bytes from the block store with given CID. */
    async putBlock(bytes, codec) {
        const hash = await sha256.digest(bytes);
        const cid = CID.create(1, codec, hash);
        this.store.set(cid.toString(), bytes);
        return cid.bytes;
    }
}

const rng = {
    /** Returns random bytes of specified length */
    randomBytes(count) {
        const array = new Uint8Array(count);
        webcrypto.getRandomValues(array);
        return array;
    }
}


async function test() {
    const something = new Uint8Array(1024 * 1024);

    const store = new MemoryBlockStore();
    const forest = new PrivateForest(rng);
    const dir = new PrivateDirectory(forest.emptyName(), new Date(), rng);

    const reps = 100;
    const before = performance.now();
    
    for (let i = 0; i < reps; i++) {
        await dir.write(["Hello", "World", "file"], false, something, new Date(), forest, store, rng);
        await dir.store(forest, store, rng);
    }

    const time = performance.now() - before;
    console.log("Time per write & store: ", time / reps);


    await forest.store(store);
}

test();

And the variance between runs is bigger than the variance between main and this PR.
So no performance regression @expede :)

EDIT: LOL should've added the benchmark results:

Two runs on c17f6bb (main):

$ node test.js 
Time per write & store:  351.0440690800175

$ node test.js 
Time per write & store:  338.9089437599853

Two runs on 05529f5 (this PR):

$ node test.js 
Time per write & store:  338.4276692800038

$ node test.js 
Time per write & store:  315.1771848800033

matheus23 · 2023-07-19T14:33:52Z

@appcypher This is ready for review :)

appcypher

LGTM! 🎉

feat: Switch from SHA3-256 to BLAKE3-256

5d2cf33

matheus23 self-assigned this Jul 10, 2023

matheus23 requested a review from a team as a code owner July 10, 2023 19:28

Also use Blake3 by default in BlockStore::create_cid

bbf1291

matheus23 mentioned this pull request Jul 11, 2023

Data Format Stability Tasks #303

Open

6 tasks

Also use Blake3 in skip ratchet key derivation

f35f738

matheus23 added 12 commits July 17, 2023 17:08

Make use of blake3::derive_key algorithm

d17451a

Merge remote-tracking branch 'origin/main' into matheus23/sha3-to-blake3

635c754

Un-expose temporal key bytes

8a9adc7

Update domain separation string

dc9109e

Update prime hash fixture

2dea9e6

Fix block naming consistency

b413bfc

Dedicated APIs for key structs & cleanup

bfee036

Store a base_name in ExternalFileContent

07be022

This ensures you can re-generate all block labels, even if you don't have access to the PrivateNodeHeader`, e.g. when you only have snapshot access.

Lint

f889315

Give tests more stack space

6154b8e

Fix wasm-wnfs

7fbb07c

Make external file content encoding more spec-adhering

ddc2fa7

matheus23 mentioned this pull request Jul 18, 2023

Symlink Metadata and Double Packing Avoidance #256

Closed

Add a hiding segment to base_name

6e911f8

Depend on released skip ratchet crate

05529f5

appcypher approved these changes Jul 21, 2023

View reviewed changes

matheus23 merged commit e164a1f into main Jul 21, 2023
10 checks passed

matheus23 deleted the matheus23/sha3-to-blake3 branch July 21, 2023 11:46

github-actions bot mentioned this pull request Jul 21, 2023

chore: release main #301

Merged

matheus23 mentioned this pull request Jul 25, 2023

PrivateFile::basic_mv and PrivateFile::cp load full file content in memory #242

Closed

matheus23 mentioned this pull request Aug 1, 2023

feat: Implement public directory cp & more efficient copy for PrivateFile #319

Merged

github-actions bot mentioned this pull request Sep 1, 2023

chore: release main #344

Closed

github-actions bot mentioned this pull request Sep 10, 2023

chore: release main #349

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Switch from SHA3-256 to BLAKE3-256 #306

feat: Switch from SHA3-256 to BLAKE3-256 #306

matheus23 commented Jul 10, 2023 •

edited

Loading

codecov bot commented Jul 10, 2023 •

edited

Loading

expede commented Jul 10, 2023

expede commented Jul 10, 2023

matheus23 commented Jul 10, 2023

expede commented Jul 10, 2023

matheus23 commented Jul 10, 2023

expede commented Jul 10, 2023 •

edited

Loading

expede commented Jul 11, 2023 •

edited

Loading

matheus23 commented Jul 12, 2023 •

edited

Loading

matheus23 commented Jul 19, 2023 •

edited

Loading

matheus23 commented Jul 19, 2023 •

edited

Loading

appcypher left a comment

feat: Switch from SHA3-256 to BLAKE3-256 #306

feat: Switch from SHA3-256 to BLAKE3-256 #306

Conversation

matheus23 commented Jul 10, 2023 • edited Loading

Summary

codecov bot commented Jul 10, 2023 • edited Loading

Codecov Report

expede commented Jul 10, 2023

expede commented Jul 10, 2023

matheus23 commented Jul 10, 2023

expede commented Jul 10, 2023

matheus23 commented Jul 10, 2023

expede commented Jul 10, 2023 • edited Loading

expede commented Jul 11, 2023 • edited Loading

matheus23 commented Jul 12, 2023 • edited Loading

matheus23 commented Jul 19, 2023 • edited Loading

matheus23 commented Jul 19, 2023 • edited Loading

appcypher left a comment

Choose a reason for hiding this comment

LGTM! 🎉

matheus23 commented Jul 10, 2023 •

edited

Loading

codecov bot commented Jul 10, 2023 •

edited

Loading

expede commented Jul 10, 2023 •

edited

Loading

expede commented Jul 11, 2023 •

edited

Loading

matheus23 commented Jul 12, 2023 •

edited

Loading

matheus23 commented Jul 19, 2023 •

edited

Loading

matheus23 commented Jul 19, 2023 •

edited

Loading