Add initial specification for block stream data. #342

jsync-swirlds · 2024-04-19T22:10:57Z

Update specification to current standards

Corrected bad copy/paste of copyright header
Added file prefix comment with keyword section.
Adjusted names to remove redundant suffix
Moved files from streams/v7 to block/stream.
Commented out unused imports
Fixed java_package and pbj.java_package to both be
com.hedera.hapi.block.stream. There is no legacy code
that needs a different package for block stream.
Changed package directive to com.hedera.hapi.block.stream as well.
- "main" messages prefixed with proto due to poor package choice in legacy
  design. Hopefully we can fix that soon.
Started on service outputs
- consensus_service.proto
- util_service.proto
- crypto_service.proto
- file_service.proto
- network_service.proto -- renamed from misc_output.proto
- schedule_service.proto
- smart_contract_Service.proto
- token_service.proto -- Much of this is yet to be designed
Cleaned up related items
- transaction_result.proto
Cleaned up transaction_output.proto.
- Removed all currently unused options.
- Adjusted ordering for better efficiency and clarity.
- Reserved field numbers of all remaining transaction types
  for future use.
- Documented, in an HTML comment, field types, names, and numbers
  for all remaining outputs.
Cleaned up remaining items in several other block stream files.
- This is a general "editing" update to clean up and clarify
  the text. This may include small structural changes for
  both clarity and data efficiency.

Adjust and progress the design.

Updated proto design and documentation
- state_changes.proto
- system_transaction.proto
- block.proto
- block_item.proto
- block_header.proto
- block_state_proof.proto
- block_service.proto
Added block_info_state_proof.proto for the block info state proof
which is intended to replace the running hashes state proof in
block_state_proof.proto.
- Old version removed and new version renamed to block_state_proof.proto
  after discussion on 2024-04-11.
Changed most int64 values to uint64 as they are never
permitted to be negative.
Changed one int32 value to uint32 as it is never negative.
Removed substantially everything from services outputs
- These are in state changes, and outputs should not
  include anything from state changes.
Added a server status request to block_service.proto.
- This provides clients a mechanism to learn what blocks are
  available on a block node, whether the node offers
  historical snapshots, and the current price list for that
  block node server.

Remove payment from block_service.proto.

Block service payment and authentication are separate from the block service API
and not normative. We will define a recommended authentication and payment
process for a block node separately from the block stream specifications.

Non-Normative Documents

Added processed markdown output in documents/api/block/ folder for ease of review
- These documents will be generated by a build process in the future, and removed
  from the codebase before merging this PR.

Work completed by Nick Poorman

Document Block Node Messages.

add block stream protos
add new lines at the end of files
add v7 to package path
use releative path for proto imports
use releative path for proto imports
use releative path for proto imports
comment out empty proto messages
change the property to match for other token calls
Move all Block level props into BlockItem so it can be streamed. Add rpc for blocksPutIfAbsent.
Import BlockHeader into BlockItem
Import BlockStateProof into BlockItem
Fixed imports
Maybe if they are in a different file?
That didn't work. Going to have to look at the template code
I suspect it's because we don't have any streaming requests yet
Let's try making it the same as the others
Found an issue saying maybe change the file name
Adding to java_multiple_files option to see if that works
Needs to be a stream of BlockItems
Fixed typo for running hashes
Updated streams v7 protos from working notion doc
Updated streams v7 protos from working notion doc
Added start running hash to the BlockHeader
Added versions of block, hapi, services, and platform to the BlockHeader
Extracted sidecar output to match current outputs which are multiple sidecars
Using the already created TransactionSidecarRecord
Added SystemTransaction to BlockItem
Renaming System transaction messages to avoid collision
Fix StateSignatureSystemTransaction to use remove duplicate hash value
Optomized by including BlockHashAlgorithm and BlockSignatureAlgorithm in the BlockHeader for the whole block
Optomized by removing SignatureObject and replacing with hash and signature bytes property
Guess protobuf doesn't like having the same enum values
Fixed comment for birth_round to be accurate
Updated state change protos to surface state change block items
Add the delete operations
Fix numbers
Change type to plural version
Prefix enum values
We seem to be serializing string values in tests
Need to verify if AccountID should be a key type or not. Putting it in for now so I can get tests passing
Need to verify if AccountID should be a key type or not. Putting it in for now so I can get tests passing
Worked out all the protos needed for various state changes
import exchange_rate.proto
import recordcache.proto
add ProtoString to store types
fix typo
Make slot_value consistent
Fix QueuePushChange types
fixed change_operation
Switched to using element instead of value for the elements in a queue
Add missing type to SingletonUpdateChange
Add missing type to SingletonUpdateChange
Add missing type to SingletonUpdateChange
Add missing type to SingletonUpdateChange
Add the round which the state proof was constructed for
Working on block node API design
Cleaned it up

jsync-swirlds · 2024-04-19T22:17:58Z

This draft PR is not intended to be merged. This is a vehicle to gather more detailed and extensive feedback and review, which will then be incorporated into a future PR with a much wider audience.

Nana-EC

1st pass comments.
Thanks for the cleanup and I like the idea to add MD files

Nana-EC · 2024-05-16T01:40:02Z

block/block_service.proto

+     * The block node SHALL send one `ItemAcknowledgement` for each `BlockItem`
+     * received and verified.<br/>
+     */
+    message ItemAcknowledgement {


Q: What's the use case for an item level acknowledgement?
Doesn't a block contain multiple BlockItems? So shouldn't a single Block acknowledgement be sufficient?

We are sending individual block items in a stream, so logically the acknowledgement should be per item. If we wait to only acknowledge the entire block, then both sides are required to retain the full block in memory until it's fully sent, and that imposes costs and requirements we may not want to impose in all cases.
We should also have block level acknowledgement, but that should not be the only acknowledgement.

Nana-EC · 2024-05-16T01:49:56Z

block/stream/transaction_result.proto

+     *   <li>Any transfers caused by the creation of threshold records</li>
+     * </ul>
+     */
+    proto.TransferList transfer_list = 7;


transfer_list seems like it would be better placed in a general TransactionOutput
Same for token_transfer_lists and automatic_token_associations.

The only issue I see with the transfer lists is that it's present for every transaction, as that's where charged fees are recorded. Now, with state changes in separate block items, it's possible we won't have that (though certain huge accounts will be in every transaction state changes instead, which has its own issues in the short term).

This is something that was undecided at the last discussion I was invited to.

Nana-EC · 2024-05-16T01:54:46Z

block/stream/transaction_result.proto

+ * >> TokenTransfer output would also need custom fees, and we may wish
+ * >> to add custom fees to other transactions in the future.
+ */
+message TransactionResult {


TransactionOutput vs TransactionResult.
It's not clear what each should contain.
TransactionResult feels like it's transaction metadata for the networks point at the time of processing.
TransactionOutput feels like it's newly created state as a result of the transaction that is not in the body.

Neither should be state, as we have state changes in separate structure.
The rest is design that's in flux. This PR is far from final; it's just a step along the road to get to a final specification.

It is entirely possible that all, or nearly all, of TransactionOutput is subsumed in StateChanges and no longer relevant.

Nana-EC · 2024-05-16T02:00:05Z

documents/api/block/stream/block.md

+### Block
+A single complete Hedera block chain block.
+
+This is a single block structure and SHALL NOT represent the primary


Is the goal here saying BlockItems are streamed not Block objects.
So to a client streaming they wouldn't know they have a complete block without the BlockHeader and BlockStateProof delineators?

Is the reasoning size or something else?

The impact is clients need to keep track of the separator to create a meaningful Block aggregation

Absolutely.
The stream is individual block items, not entire blocks.
Jasper has a good description for why we chose that approach, but it comes down to efficiency and resilience, for the most part.

A client that is streaming will only handle full blocks if the client chooses to do so, but if it does it will require both header and proof to delimit the block.

Clients that handle full blocks need to handle validating the block proof and may need to maintain an entire mirror state as well (if it's a stateful node), so tracking when a block starts and ends isn't hard in context.

Note, for "casual" clients, who only want full blocks, the expectation is that the client will just request a single block at a time from a block node, rather than requesting a stream from the block node.
(We expect push streams to be restricted to specific consensus nodes sending to specific block nodes, at least initially).

Nana-EC · 2024-05-16T02:32:06Z

block/block_service.proto

+     * Block node implementations SHOULD charge increased fees for such
+     * "future" streams.
+     */
+    uint64 end_block_number = 2;


We should consider adding a limit field. A subscriber would define one or the other if at all

What would a limit look like, if not the block number at which to end the stream?

Nana-EC · 2024-05-16T02:36:35Z

block/block_service.proto

+     * The reported status SHALL reflect the success of the request, or
+     * a detailed reason the request failed.
+     */
+    SingleBlockResponse.ResponseCode status = 1;


In SubscribeStreamResponse you had a oneof here. Should the two rsponses be similar in this case - either a oneof or a separation of fields

Stream responses are sent many times for a single request, and there are a couple different response types possible because we need to send data items and result codes, but cannot join the two together (they're sent at different times).

The Single block response is sent exactly once per request, and has only one possible response type which includes both data and result code.

Nana-EC · 2024-05-16T02:40:54Z

block/block_service.proto

+     * of the request as possible in the event payment is not sufficient to
+     * complete the request.
+     */
+    rpc subscribeBlockStream(stream SubscribeStreamRequest) returns (stream SubscribeStreamResponse);


Q: why a bi-directional stream for the subscribe?
Would a client make a single request and receive the stream, why would it keep sending a SubscribeStreamRequest if blocks are delivered in order?

The subscriber is receiving BlockItems, not Blocks and, though design is not complete for this, will likely need to acknowledge items as they're received. Also, a requester might request blocks 1-5, then request 6-9, and so forth, continuing a stream as long as needed.

Streams aren't sent all at once, and can last a rather long (perhaps hours) time.

jsync-swirlds · 2024-04-25T23:48:18Z

block/block_service.proto

+     * the latest available, but MUST clearly document this behavior.
+     */
+    rpc stateSnapshot(StateSnapshotRequest) returns (StateSnapshotResponse);
+


Note: We probably need an API here to request a single item or subtree from "current" state (do we need "historical state"?).
Likely also need an API for "how big is block stream from block G to block Y".

* add block stream protos * add new lines at the end of files * add v7 to package path * use releative path for proto imports * use releative path for proto imports * use releative path for proto imports * comment out empty proto messages * change the property to match for other token calls * Move all Block level props into BlockItem so it can be streamed. Add rpc for blocksPutIfAbsent. * Import BlockHeader into BlockItem * Import BlockStateProof into BlockItem * Fixed imports * Maybe if they are in a different file? * That didn't work. Going to have to look at the template code * I suspect it's because we don't have any streaming requests yet * Let's try making it the same as the others * Found an issue saying maybe change the file name * Adding to java_multiple_files option to see if that works * Needs to be a stream of BlockItems * Fixed typo for running hashes * Updated streams v7 protos from working notion doc * Updated streams v7 protos from working notion doc * Added start running hash to the BlockHeader * Added versions of block, hapi, services, and platform to the BlockHeader * Extracted sidecar output to match current outputs which are multiple sidecars * Using the already created TransactionSidecarRecord * Added SystemTransaction to BlockItem * Renaming System transaction messages to avoid collision * Fix StateSignatureSystemTransaction to use remove duplicate hash value * Optomized by including BlockHashAlgorithm and BlockSignatureAlgorithm in the BlockHeader for the whole block * Optomized by removing SignatureObject and replacing with hash and signature bytes property * Guess protobuf doesn't like having the same enum values * Fixed comment for birth_round to be accurate * Updated state change protos to surface state change block items * Add the delete operations * Fix numbers * Change type to plural version * Prefix enum values * We seem to be serializing string values in tests * Need to verify if AccountID should be a key type or not. Putting it in for now so I can get tests passing * Need to verify if AccountID should be a key type or not. Putting it in for now so I can get tests passing * Worked out all the protos needed for various state changes * import exchange_rate.proto * import recordcache.proto * add ProtoString to store types * fix typo * Make slot_value consistent * Fix QueuePushChange types * fixed change_operation * Switched to using element instead of value for the elements in a queue * Add missing type to SingletonUpdateChange * Add missing type to SingletonUpdateChange * Add missing type to SingletonUpdateChange * Add missing type to SingletonUpdateChange * Add the round which the state proof was constructed for * Working on block node API design * Cleaned it up Signed-off-by: Nick Poorman <nick@swirldslabs.com>

* Corrected bad copy/paste of copyright header * Added file prefix comment with keyword section. * Adjusted names to remove redundant suffix * Moved files from `streams/v7` to `block/stream`. * Commented out unused imports * Fixed java_package and pbj.java_package to both be `com.hedera.hapi.block.stream`. There is no legacy code that needs a different package for block stream. * Changed package directive to `com.hedera.hapi.block.stream` as well. * "main" messages prefixed with `proto` due to poor package choice in legacy design. Hopefully we can fix that soon. * Started on service outputs * `consensus_service.proto` * `util_service.proto` * `crypto_service.proto` * `file_service.proto` * `network_service.proto` -- renamed from misc_output.proto * `schedule_service.proto` * `smart_contract_Service.proto` * `token_service.proto` -- Much of this is yet to be designed * Cleaned up related items * `transaction_result.proto` --- Adjust and progress the design. * Updated proto design and documentation * `state_changes.proto` * `system_transaction.proto` * `block.proto` * `block_item.proto` * `block_header.proto` * `block_state_proof.proto` * `block_service.proto` * Added `block_info_state_proof.proto` for the block info state proof which is intended to replace the running hashes state proof in `block_state_proof.proto`. * Old version removed and new version renamed to `block_state_proof.proto` after discussion on 2024-04-11. * Changed most int64 values to uint64 as they are never permitted to be negative. * Changed one int32 value to uint32 as it is never negative. * Removed substantially everything from services outputs * These are in state changes, and outputs should not include anything from state changes. * Added a server status request to `block_service.proto`. * This proivides clients a mechanism to learn what blocks are available on a block node, whether the node offers historical snapshots, and the current price list for that block node server. --- Work In Progress * Updates to design _and_ other details * `transaction_output` * Added three options for communicating "sibling" hash _order_ in the state proof (`block_state_proof.proto`). * Rework of `event_metdata` * Platform is creating protobufs for events, and we need to reuse those here as they are completed. --- Temporary Changes for review purposes * Committed generated documentation for review purposes Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

Block service payment and authentication are separate from the block service API and not normative. We will define a _recommended_ authentication and payment process for a block node separately from the block stream specifications. Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

* Cleaned up `transaction_output.proto`. * Removed all currently unused options. * Adjusted ordering for better efficiency and clarity. * Reserved field numbers of all remaining transaction types for future use. * Documented, in an HTML comment, field types, names, and numbers for all remaining outputs. * Cleaned up remaining items in several other block stream files. * This is a general "editing" update to clean up and clarify the text. This may include small structural changes for both clarity and data efficiency. Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

* Modified all copyright notices to remove unnecessary text and match general guidelines. * Fixed a few items noted in offline review prior to DevCon-24a * Reformat to match current guidelines for line length and field descriptions. Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

* Moved "input" item to "input" folder and package. * Moved "output" items to "output" folder and package. * Specified the "merkle" approach to generating the block hash in the block header. * Added Jasper's diagram for this process in documents. Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

* Updated block header and items with review changes * Updated state changes to support add and remove for named states * Updated block service with review changes * Major rebuild of block state proof to work with both the "block merkle tree" concept and a single TSS-BLS signature instead of 30+ individual RSA signatures. Signed-off-by: Joseph Sinclair <joseph.sinclair@swirldslabs.com>

jsync-swirlds force-pushed the continue-block-node branch from ab5cec7 to c49a233 Compare April 19, 2024 22:14

jsync-swirlds self-assigned this Apr 19, 2024

jsync-swirlds force-pushed the continue-block-node branch 4 times, most recently from 7c15751 to af0ddd0 Compare May 14, 2024 15:37

Nana-EC reviewed May 16, 2024

View reviewed changes

jsync-swirlds commented May 16, 2024

View reviewed changes

jsync-swirlds force-pushed the continue-block-node branch 3 times, most recently from 20f1c85 to 2ad0221 Compare May 22, 2024 20:16

jsync-swirlds force-pushed the continue-block-node branch 5 times, most recently from 4a16bf1 to 5028483 Compare June 5, 2024 22:59

jsync-swirlds force-pushed the continue-block-node branch 2 times, most recently from c9bc6df to efdafef Compare June 18, 2024 00:26

nickpoorman and others added 7 commits June 18, 2024 16:33

jsync-swirlds force-pushed the continue-block-node branch from efdafef to ef61ded Compare June 18, 2024 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial specification for block stream data. #342

Add initial specification for block stream data. #342

jsync-swirlds commented Apr 19, 2024

jsync-swirlds commented Apr 19, 2024

Nana-EC left a comment

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024

Nana-EC May 16, 2024

jsync-swirlds May 21, 2024

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024 •

edited

Loading

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024

jsync-swirlds May 21, 2024

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024 •

edited

Loading

Nana-EC May 16, 2024

jsync-swirlds May 16, 2024 •

edited

Loading

jsync-swirlds Apr 25, 2024

Add initial specification for block stream data. #342

Are you sure you want to change the base?

Add initial specification for block stream data. #342

Conversation

jsync-swirlds commented Apr 19, 2024

Non-Normative Documents

Work completed by Nick Poorman

jsync-swirlds commented Apr 19, 2024

Nana-EC left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsync-swirlds May 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsync-swirlds May 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsync-swirlds May 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsync-swirlds May 16, 2024 •

edited

Loading

jsync-swirlds May 16, 2024 •

edited

Loading

jsync-swirlds May 16, 2024 •

edited

Loading