[spec] data schema #126

tzemanovic · 2021-04-12T17:02:48Z

the rendered file: https://github.com/heliaxdev/anoma-prototype/blob/63bae79238ec98b91937833395d5447c3760efcb/tech-specs/src/explore/design/data-schema.md

cwgoes

Generally looks well thought-out; a few questions.

cwgoes · 2021-04-12T21:34:31Z

tech-specs/src/explore/design/data-schema.md

+
+At high level, all the data in the [accounts' dynamic sub-spaces](accounts.md#dynamic-storage-sub-space) is just keys associated with arbitrary bytes. To help the processes that read and write this data (transactions, validity predicates, intents) interpret it and implement interesting functionality on top it, the ledger could provide a way to describe the schema of the stored data.
+
+For storage data encoding, we're currently using the borsh library, which provides a way to derive schema for data that can describe its structure in a very generic way that can easily be consumed in different data-exchange formats such as JSON. In Rust code, the data can be composed with Rust native ADTs (`struct` and `enum`) and basic collection structures (fixed and dynamic sized array, hash map, hash set). Borsh already has a decent coverage of different implementations in e.g. JS and TypeScript, JVM based languages and Go, which we'll hopefully be able to support in wasm in near future too.


Should we implement borsh in Juvix? cc @mariari

cwgoes · 2021-04-12T21:34:54Z

tech-specs/src/explore/design/data-schema.md

+}
+```
+
+When the transaction is applied, the data is stored together with a reference to the derived data schema, e.g.:


Where is the schema stored?

In particular, is this efficient? (we don't want to store the schema along with each copy of the data)

I'm not sure that a transaction can recognize its type as MultiSig when the transaction reads an account.
I think it can be stored with a key like <height>/schema/<address>/.../<var_name> to the DB.
When a transaction reads the var of the account (the key would be <address>/.../<var_name>), it can read the schema entry with the same key.

In this case, I suppose it has to be stored along with each copy because, for example, another MultiSig which has different members for other accounts exists.

Yeah, I think it should be possible to reuse schema definitions by e.g. storing the schemas in storage outside of accounts sub-spaces with some unique identifiers (hash of the schema). Then accounts sub-spaces data could store the identifier to the schema (if specified).

With the MultiSig example, we would store the schema at <height>/schema/<schema_id> and there could be many accounts using this schema with e.g.:

MultiSig encoded data at <height>/subspace/<address>/<sub_key...>

<height>/subspace/<address>/<sub_key...>/schema = <height>/schema/<schema_id>

A possible slight variation would be to have a dedicated special account for schemas, so we could add a validity predicate to it instead of having its logic in the ledger.

I think it would be nice if we could somehow split the storage fee for schemas between its users, so that most commonly used schemas would be very cheap.

yito88

LGTM 👍

tzemanovic added 2 commits April 12, 2021 18:59

[spec] data schema ideas

f57b1dc

fixup! [spec] data schema ideas

63bae79

tzemanovic added the spec label Apr 12, 2021

tzemanovic requested review from yito88 and cwgoes April 12, 2021 17:02

cwgoes reviewed Apr 12, 2021

View reviewed changes

yito88 approved these changes Apr 13, 2021

View reviewed changes

tzemanovic merged commit 7151a36 into master Apr 14, 2021

tzemanovic deleted the tomas/data-schema-design branch April 14, 2021 07:40

tzemanovic mentioned this pull request Apr 15, 2021

Tracking: base ledger prototype version 3 #125

Closed

tzemanovic mentioned this pull request Jan 10, 2022

implement storage data schema #780

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spec] data schema #126

[spec] data schema #126

tzemanovic commented Apr 12, 2021 •

edited

Loading

cwgoes left a comment

cwgoes Apr 12, 2021

cwgoes Apr 12, 2021

cwgoes Apr 12, 2021

yito88 Apr 13, 2021

tzemanovic Apr 13, 2021

yito88 left a comment


		At high level, all the data in the [accounts' dynamic sub-spaces](accounts.md#dynamic-storage-sub-space) is just keys associated with arbitrary bytes. To help the processes that read and write this data (transactions, validity predicates, intents) interpret it and implement interesting functionality on top it, the ledger could provide a way to describe the schema of the stored data.

		For storage data encoding, we're currently using the borsh library, which provides a way to derive schema for data that can describe its structure in a very generic way that can easily be consumed in different data-exchange formats such as JSON. In Rust code, the data can be composed with Rust native ADTs (`struct` and `enum`) and basic collection structures (fixed and dynamic sized array, hash map, hash set). Borsh already has a decent coverage of different implementations in e.g. JS and TypeScript, JVM based languages and Go, which we'll hopefully be able to support in wasm in near future too.

[spec] data schema #126

[spec] data schema #126

Conversation

tzemanovic commented Apr 12, 2021 • edited Loading

cwgoes left a comment

Choose a reason for hiding this comment

cwgoes Apr 12, 2021

Choose a reason for hiding this comment

cwgoes Apr 12, 2021

Choose a reason for hiding this comment

cwgoes Apr 12, 2021

Choose a reason for hiding this comment

yito88 Apr 13, 2021

Choose a reason for hiding this comment

tzemanovic Apr 13, 2021

Choose a reason for hiding this comment

yito88 left a comment

Choose a reason for hiding this comment

tzemanovic commented Apr 12, 2021 •

edited

Loading