Skip to content

Conversation

@LNSD
Copy link
Contributor

@LNSD LNSD commented Jan 13, 2026

Undo the premature optimization of using only the Hash to deduplicate datasets in the dataset-store cache. This enables future simplifications in the data plane (query engine, data store).

  • Change Dataset.manifest_hash field to Dataset.reference (HashReference)
  • Update dataset cache from HashMap<Hash, Dataset> to HashMap<HashReference, Dataset>
  • Update eth_call cache to also use HashReference as key
  • Update all extractors to receive HashReference in dataset() functions
  • Simplify tracing statements to use reference.short_display()

Undo the premature optimization of using only the Hash to deduplicate
datasets in the dataset-store cache. This enables future simplifications
in the data plane (query engine, data store).

- Change `Dataset.manifest_hash` field to `Dataset.reference` (HashReference)
- Update dataset cache from `HashMap<Hash, Dataset>` to `HashMap<HashReference, Dataset>`
- Update eth_call cache to also use `HashReference` as key
- Update all extractors to receive `HashReference` in `dataset()` functions
- Simplify tracing statements to use `reference.short_display()`

Signed-off-by: Lorenzo Delgado <lorenzo@edgeandnode.com>
@LNSD LNSD requested a review from Theodus January 13, 2026 11:14
@LNSD LNSD self-assigned this Jan 13, 2026
Copy link
Member

@Theodus Theodus left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@LNSD LNSD merged commit 11d2abb into main Jan 13, 2026
9 checks passed
@LNSD LNSD deleted the lnsd/refactor-dataset-store-hash-refs branch January 13, 2026 13:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants