Separate storage and compute #12216

petrosagg · 2022-05-03T11:40:33Z

Motivation

This PR decouples STORAGE and COMPUTE by transiting all source and table data through persist.

On the storage side, when a CreateSources command is received the corresponding ingestion dataflow is immediately instantiated and datat starts flowing into persist. This is in contrast with previous behaviour where storaged process were waiting for RenderSources messages to instantiate ingestion dataflows and publish the data to the TCP boundary. Both the RenderSources command and the TCP boundary are removed. Additionally, storaged processes are no longer responsible for handling table data and the associated Append command is removed. Instead, the storage controller directly appends the data requested to be appended by ADAPTER into the appropriate persist shard.

On the compute side, the DataflowDescription struct has been augmented with an additional storage metadata type which carries all the information required by the storage fat client running in computed processes to reach out to persist and read the correct data. Currently this type (named CollectionMetadata) carries the shard ids for the final storage collection and also the persist location details.

Even though all data is transited through persist, the shard id used is ephemeral. In some sense this PR only goes as far as using persist as a glorified TCP protocol. There will be follow up work that will make the storage controller durably record its choices for persist shard for a particular storage collection id and also change ADAPTER to anticipate storage collection being already there (adapter/storage controller reconciliation).

Fixes #12593

Testing

This PR has adequate test coverage / QA involvement has been duly considered.

Release notes

This PR includes the following user-facing behavior changes:

The previous API required that the user came up with an iterator of references to values but that meant that every called that had owned values had to write unecessary boilerplate. For example if a user had a `Vec<((K, V), T, D)` ready to go they would need to first make this an iterator over `&((K, V), T, D)` and then use a map adaptor to convert that to an iterator over `((&K, &V), &T, &D)`. This patch changes the `append` APIs to accept any combination of owned and borrowed values (a total of 32 combinations) which makes calling these APIs much nicer. The usecase for this change isn't visible in this PR because it is part of the larger MaterializeInc#12216 PR, but I thought I'd break it up for easier review. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

The previous API required that the user came up with an iterator of references to values but that meant that every called that had owned values had to write unecessary boilerplate. For example if a user had a `Vec<((K, V), T, D)>` ready to go they would need to first make this an iterator over `&((K, V), T, D)` and then use a map adaptor to convert that to an iterator over `((&K, &V), &T, &D)`. This patch changes the `append` APIs to accept any combination of owned and borrowed values (a total of 32 combinations) which makes calling these APIs much nicer. The original usecase for this change isn't visible in this PR because it is part of the larger MaterializeInc#12216 PR, but I thought I'd break it up for easier review. You can see however how the usage becomes nicer in `persist_open_loop_benchmark.rs` and also in `src/persist-client/src/lib.rs`. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

philip-stoev · 2022-05-08T13:27:07Z

@petrosagg I understand this is the PR to end all PRs, to provide table persistence among other things. Please let me know when I can jump in and start testing it.

src/dataflow-types/src/client/controller/storage.rs

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

…ble data through `persist`. NOTE: The original PR is at MaterializeInc#12216 On the storage side, when a `CreateSources` command is received the corresponding ingestion dataflow is immediately instantiated and datat starts flowing into `persist`. This is in contrast with previous behaviour where `storaged` process were waiting for `RenderSources` messages to instantiate ingestion dataflows and publish the data to the TCP boundary. Both the `RenderSources` command and the TCP boundary are removed. Additionally, `storaged` processes are no longer responsible for handling table data and the associated `Append` command is removed. Instead, the storage controller directly appends the data requested to be appended by ADAPTER into the appropriate persist shard. On the compute side, the `DataflowDescription` struct has been augmented with an additional storage metadata type which carries all the information required by the storage fat client running in `computed` processes to reach out to `persist` and read the correct data. Currently this type (named `CollectionMetadata`) carries the shard ids for the final storage collection and also the persist location details. Even though all data is transited through `persist`, the shard id used is **ephemeral**. In some sense this PR only goes as far as using persist as a glorified TCP protocol. There will be follow up work that will make the storage controller durably record its choices for persist shard for a particular storage collection id and also change ADAPTER to anticipate storage collection being already there (adapter/storage controller reconciliation). Fixes MaterializeInc#12593

…ble data through `persist`. NOTE: The original PR is at #12216 On the storage side, when a `CreateSources` command is received the corresponding ingestion dataflow is immediately instantiated and datat starts flowing into `persist`. This is in contrast with previous behaviour where `storaged` process were waiting for `RenderSources` messages to instantiate ingestion dataflows and publish the data to the TCP boundary. Both the `RenderSources` command and the TCP boundary are removed. Additionally, `storaged` processes are no longer responsible for handling table data and the associated `Append` command is removed. Instead, the storage controller directly appends the data requested to be appended by ADAPTER into the appropriate persist shard. On the compute side, the `DataflowDescription` struct has been augmented with an additional storage metadata type which carries all the information required by the storage fat client running in `computed` processes to reach out to `persist` and read the correct data. Currently this type (named `CollectionMetadata`) carries the shard ids for the final storage collection and also the persist location details. Even though all data is transited through `persist`, the shard id used is **ephemeral**. In some sense this PR only goes as far as using persist as a glorified TCP protocol. There will be follow up work that will make the storage controller durably record its choices for persist shard for a particular storage collection id and also change ADAPTER to anticipate storage collection being already there (adapter/storage controller reconciliation). Fixes #12593

petrosagg · 2022-05-25T23:07:41Z

superseded by #12715 to get over CLA assistant being stuck

Since MaterializeInc#12216 separated storage and compute by transiting all data via the persist library, the compute layer no longer needs to communicate directly with the storage layer. So remove the configuration parameters that pertained to the old storage-compute network protocol.

Since #12216 separated storage and compute by transiting all data via the persist library, the compute layer no longer needs to communicate directly with the storage layer. So remove the configuration parameters that pertained to the old storage-compute network protocol.

Since MaterializeInc#12216, tables are now entirely handled by the controller. There is no longer a need to send a `CreateSourceCommand` to the `storaged` process for table sources. We should eventually adjust the types here to enforce this statically, but this quick fix will unblock MaterializeInc#12770.

Since #12216, tables are now entirely handled by the controller. There is no longer a need to send a `CreateSourceCommand` to the `storaged` process for table sources. We should eventually adjust the types here to enforce this statically, but this quick fix will unblock #12770.

PR MaterializeInc#12082 converted source tokens to thread-safe `Arc`s to be compatible with the TCP storage/compute boundary, but since MaterializeInc#12216 replaced the TCP boundary with persist we can go back to Rcs.

PR #12082 converted source tokens to thread-safe `Arc`s to be compatible with the TCP storage/compute boundary, but since #12216 replaced the TCP boundary with persist we can go back to Rcs.

…ble data through `persist`. NOTE: The original PR is at MaterializeInc#12216 On the storage side, when a `CreateSources` command is received the corresponding ingestion dataflow is immediately instantiated and datat starts flowing into `persist`. This is in contrast with previous behaviour where `storaged` process were waiting for `RenderSources` messages to instantiate ingestion dataflows and publish the data to the TCP boundary. Both the `RenderSources` command and the TCP boundary are removed. Additionally, `storaged` processes are no longer responsible for handling table data and the associated `Append` command is removed. Instead, the storage controller directly appends the data requested to be appended by ADAPTER into the appropriate persist shard. On the compute side, the `DataflowDescription` struct has been augmented with an additional storage metadata type which carries all the information required by the storage fat client running in `computed` processes to reach out to `persist` and read the correct data. Currently this type (named `CollectionMetadata`) carries the shard ids for the final storage collection and also the persist location details. Even though all data is transited through `persist`, the shard id used is **ephemeral**. In some sense this PR only goes as far as using persist as a glorified TCP protocol. There will be follow up work that will make the storage controller durably record its choices for persist shard for a particular storage collection id and also change ADAPTER to anticipate storage collection being already there (adapter/storage controller reconciliation). Fixes MaterializeInc#12593

petrosagg force-pushed the separate-compute-storage branch from 98bcd18 to ed261ce Compare May 3, 2022 11:57

benesch mentioned this pull request May 4, 2022

dataflow-types: split into STORAGE and COMPUTE #12211

Closed

petrosagg force-pushed the separate-compute-storage branch from 6084c8a to 8897a41 Compare May 4, 2022 10:06

petrosagg mentioned this pull request May 4, 2022

persist: make append functions accept a lot more types #12253

Merged

nmeagan11 mentioned this pull request May 6, 2022

[Epic] De-couple STORAGE and COMPUTE #11434

Closed

petrosagg force-pushed the separate-compute-storage branch 8 times, most recently from f4a581a to eb2d4fb Compare May 10, 2022 12:51

danhhz reviewed May 11, 2022

View reviewed changes

src/dataflow-types/src/client/controller/storage.rs Outdated Show resolved Hide resolved

petrosagg force-pushed the separate-compute-storage branch 10 times, most recently from 665e3aa to 534a49d Compare May 16, 2022 13:33

This comment was marked as resolved.

Sign in to view

petrosagg enabled auto-merge May 25, 2022 17:34

petrosagg disabled auto-merge May 25, 2022 17:34

petrosagg enabled auto-merge (squash) May 25, 2022 17:35

petrosagg force-pushed the separate-compute-storage branch from 5c3a94f to 7112454 Compare May 25, 2022 17:51

petrosagg added 6 commits May 25, 2022 23:48

retire storage/compute boundary

fc54d0a

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

aljoscha code review

2e3d5f1

attempt to reduce persist consensus contention

1706ba9

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

slow down default timestamp frequency

7f600a6

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

rebase fixups

c5bf2fe

ignore no_block test

b33b650

petrosagg force-pushed the separate-compute-storage branch from 7112454 to b33b650 Compare May 25, 2022 21:56

umanwizard mentioned this pull request May 25, 2022

resubmit of 12216 #12715

Merged

petrosagg closed this May 25, 2022

auto-merge was automatically disabled May 25, 2022 23:07
Pull request was closed

benesch mentioned this pull request May 29, 2022

compute: remove storage_addr configuration parameter #12742

Merged

1 task

benesch mentioned this pull request May 31, 2022

storage: don't render sources for tables #12779

Merged

1 task

benesch mentioned this pull request Jun 1, 2022

storage: convert Arc source tokens back to Rcs #12825

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate storage and compute #12216

Separate storage and compute #12216

petrosagg commented May 3, 2022 •

edited

philip-stoev commented May 8, 2022

This comment was marked as resolved.

This comment was marked as resolved.

petrosagg commented May 25, 2022

Separate storage and compute #12216

Separate storage and compute #12216

Conversation

petrosagg commented May 3, 2022 • edited

Motivation

Testing

Release notes

philip-stoev commented May 8, 2022

This comment was marked as resolved.

This comment was marked as resolved.

petrosagg commented May 25, 2022

petrosagg commented May 3, 2022 •

edited