Skip to content

erskingardner/notepack

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

54 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

notepack

Docs.rs Crates.io

notepack is a compact binary note format for nostr notes, with a specification and reference implementation in Rust.

It ships with:

  • πŸ“¦ A Rust crate β€” for embedding notepack logic into apps, relays, or tooling.
  • πŸ’» A CLI tool β€” for piping JSON ↔ notepack_… strings in scripts.

πŸ“œ See SPEC.md for the full format specification.


✨ Features

  • Copy‑pasteable string starting with notepack_ + Base64 (RFC 4648, no padding).
  • Compact: Every integer is ULEB128 varint. hex strings in tags are encoded as bytes.
  • 50% size reduction: Many large events like contact lists see a 50% reduction in size
  • Simple: So simple its a candidate for nostr's canonical binary representation

Benchmarks

$ cargo bench

Contact list note with 1022 tags:

  • notepack-decode: 2GB/s, 15 microseconds
  • json-decode: 711MB/s, 100 microseconds

The numbers start to count when you are decoding lots of notes: 1000 notes would take 100ms with json, 15ms with notepack.

If not iterating tags, notepack gets up to 1TB/s at 30 nanoseconds. That's only 0.03ms for 1000 notes if you're not verifying and just want to check a few fields as they stream in.

...but didn't feel like that was a fair comparison since you'll likely need to iterate the tags to verify the note.

I have lots of hacks in nostrdb to do incremental json parsing for note de-duplication/note rejection. with notepack you can get the ID and skip it in less than 15 microseconds.

Example

$ notepack <<<'{"id": "f1e7bc2a9756453fcc0e80ecf62183fa95b9a1278a01281dbc310b6777320e80","pubkey": "7fe437db5884ee013f701a75f8d1a84ecb434e997f2a31411685551ffff1b841","created_at": 1753900182,"kind": 1,"tags": [],"content": "hi","sig": "75507f84d78211a68f2f964221f5587aa957a66c1941d01125caa07b9aabdf5a98c3e63d1fe1e307cbf01b74b0a1b95ffe636eb6746c00167e0d48e5b11032d5"}'

notepack_AfHnvCqXVkU/zA6A7PYhg/qVuaEnigEoHbwxC2d3Mg6Af+Q321iE7gE/cBp1+NGoTstDTpl/KjFBFoVVH//xuEF1UH+E14IRpo8vlkIh9Vh6qVembBlB0BElyqB7mqvfWpjD5j0f4eMHy/AbdLChuV/+Y262dGwAFn4NSOWxEDLVlsmpxAYBAmhpAA
  • json string: 363 bytes
  • notepack string: 124 bytes raw, 196 base64-encoded

For large contact lists, you can crunch them down from 74kb to about 36kb.

πŸ“¦ Usage (Library)

Encoding from Hex Strings

use notepack::{NoteBuf, pack_note_to_string};

let note = NoteBuf {
    id: "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa".into(),
    pubkey: "bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb".into(),
    created_at: 1753898766,
    kind: 1,
    tags: vec![vec!["tag".into(), "value".into()]],
    content: "Hello, world!".into(),
    sig: "cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc".into(),
};

let encoded = pack_note_to_string(&note).unwrap();
println!("{encoded}"); // => notepack_AAECAw...

Fast Binary Encoding (2-3x faster)

When you already have binary data (from crypto libraries or databases), use NoteBinary to skip hex decoding:

use notepack::NoteBinary;

let id = [0xaa; 32];
let pubkey = [0xbb; 32];
let sig = [0xcc; 64];
let tags = vec![vec!["t".into(), "nostr".into()]];

let note = NoteBinary {
    id: &id,
    pubkey: &pubkey,
    sig: &sig,
    created_at: 1720000000,
    kind: 1,
    tags: &tags,
    content: "Hello, Nostr!",
};

let bytes = note.pack();           // Returns Vec<u8>
// Or reuse a buffer:
// let mut buf = Vec::new();
// note.pack_into(&mut buf);

Streaming Decode

use notepack::{NoteParser, ParsedField};

let b64 = "notepack_..."; // from wire
let bytes = NoteParser::decode(b64).unwrap();
let parser = NoteParser::new(&bytes);

for field in parser {
    match field.unwrap() {
        ParsedField::Id(id) => println!("id: {}", hex::encode(id)),
        ParsedField::Content(c) => println!("content: {}", c),
        _ => {}
    }
}

Fast Field Access (Zero-Copy Filtering)

For relay workloads that filter millions of events, read specific fields without full deserialization:

use notepack::NoteParser;

// Filter events by kind and author without parsing tags/content
for event_bytes in database.scan() {
    let parser = NoteParser::new(event_bytes);

    // O(1) access to fixed-offset fields
    let pubkey = parser.read_pubkey().unwrap();
    let kind = parser.read_kind().unwrap();

    if kind == 1 && pubkey == &target_pubkey {
        // Only deserialize matching events
        let note = parser.into_note().unwrap();
        results.push(note);
    }
}

Available fast accessors:

  • read_id() - O(1), fixed offset at byte 1
  • read_pubkey() - O(1), fixed offset at byte 33
  • read_sig() - O(1), fixed offset at byte 65
  • read_created_at() - parses 1 varint
  • read_kind() - parses 2 varints
  • read_kind_and_pubkey() - combined for common filter pattern
  • read_created_at_and_kind() - combined for time+type filtering

πŸ’» CLI Usage

The binary is also called notepack.

Encode JSON β†’ notepack string

echo '{"id":"...","pubkey":"...","created_at":123,"kind":1,"tags":[],"content":"Hi","sig":"..."}' \
  | notepack

Decode notepack string β†’ JSON

echo 'notepack_AAECA...' | notepack

πŸ“‚ Project Structure

src
β”œβ”€β”€ SPEC.md         # Full binary format spec
β”œβ”€β”€ error.rs        # Unified error type for encoding/decoding
β”œβ”€β”€ lib.rs          # Crate entrypoint
β”œβ”€β”€ main.rs         # CLI tool: JSON ↔ notepack
β”œβ”€β”€ note.rs         # `Note` struct (Nostr event model)
β”œβ”€β”€ parser.rs       # Streaming `NoteParser`
β”œβ”€β”€ stringtype.rs   # String vs raw byte tags
└── varint.rs       # LEB128 varint helpers

πŸ“œ License

MIT β€” do whatever you want, but attribution is appreciated.


πŸ§ͺ Fuzzing

This repo includes a cargo-fuzz setup under fuzz/ with targets that stress the decoder/parser with arbitrary event content.

Install tooling

cargo install cargo-fuzz
rustup toolchain install nightly
rustup component add llvm-tools-preview --toolchain nightly

Run fuzzers

cd fuzz
cargo +nightly fuzz run notepack_parser

You can also fuzz Base64 decode/prefix handling:

cd fuzz
cargo +nightly fuzz run notepack_decode_string

And you can fuzz encoding (structured NoteBuf generation) + encode/decode roundtrips:

cd fuzz
cargo +nightly fuzz run notepack_encoder

About

A compact binary format and Rust library for encoding and decoding Nostr notes

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Rust 99.3%
  • Just 0.7%