Send DDL commands in parallel to worker nodes. #131

sumedhpathak · 2016-02-03T19:54:22Z

DDL commands can be slow. So, if users have a high number of shards, then they may have to wait for a long time while the DDL commands complete.

Consider sending these commands in parallel to the worker nodes.

ozgune · 2016-08-01T17:26:59Z

I'm copy/pasting @aamederen's notes to this issue.

This logic is in multi_utility.c file. The main call hieararchy is:

ExecuteDistributedDDLCommand
ExecuteCommandOnWorkerShards 
ExecuteCommandOnShardPlacements
PQexec

For the change, I think we need to use PQsendQuery instead of PQexec in ExecuteCommandOnShardPlacements and put a result collecting function, similar to SendCommandToWorkersInParallel function of MX.

ozgune · 2016-08-01T17:31:24Z

@aamederen @marcocitus -- I have a question on the DDL propagation changes. Let's say the user has a table with 10K shards. They then run an ALTER TABLE. Do we open 10K connections to worker nodes? If the user wanted to use 2PC, do we then need to have max_prepared_transactions set to at least 10K?

If we do, could you open another issue to run a few safety checks before we propagate DDL changes? That way, if we think the cluster doesn't have the resources needed to propagate these changes, we error out early. (What happens today if we don't?)

marcocitus · 2016-08-01T18:35:02Z

It would end up opening a connection for each shard placement and there would need to be connections and prepared transaction slots available for that to succeed. Otherwise, it would error out on the worker and roll back, potentially causing some queries to error out as well. A safety check would probably be worth adding. An alternative approach would be to open a fixed number of connections per worker and do multiple DDL commands per connection. A drawback of this approach would be that it significantly complicates transaction and connection management as it creates a somewhat arbitrary mapping between sessions and locks held by that session.

sumedhpathak · 2016-10-10T18:20:28Z

We will open up a separate PR after PR #855 is merged, as that sets up the infrastructure for this change.

ozgune mentioned this issue Feb 5, 2016

Propagate DDL commands to workers v2 #25

Closed

aamederen mentioned this issue Jun 17, 2016

Propagate DDL commands to workers through 2PC #513

Closed

ozgune added this to the 5.3 Release milestone Aug 9, 2016

ozgune mentioned this issue Aug 11, 2016

Unified connection & transaction management #704

Closed

12 tasks

marcocitus mentioned this issue Oct 19, 2016

Parallelise DDL commands #885

Merged

marcocitus closed this as completed in #885 Oct 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Send DDL commands in parallel to worker nodes. #131

Send DDL commands in parallel to worker nodes. #131

sumedhpathak commented Feb 3, 2016

ozgune commented Aug 1, 2016

ozgune commented Aug 1, 2016

marcocitus commented Aug 1, 2016 •

edited

sumedhpathak commented Oct 10, 2016

Send DDL commands in parallel to worker nodes. #131

Send DDL commands in parallel to worker nodes. #131

Comments

sumedhpathak commented Feb 3, 2016

ozgune commented Aug 1, 2016

ozgune commented Aug 1, 2016

marcocitus commented Aug 1, 2016 • edited

sumedhpathak commented Oct 10, 2016

marcocitus commented Aug 1, 2016 •

edited