Consider issuing DDL commands on the master before the worker nodes #357

samay-sharma · 2016-02-23T19:23:35Z

"Executing the DDL on the workers before doing so on the master strikes me as rather fragile. There's always going to be problems without using 2PC, but if we were to do this on the master first, we'd at least have a chance to mark the shards where the statement failed as broken. This way round there's no way of doing that."

The rationale for doing this on the worker nodes first was to verify that the command can go through on atleast one shard before we commit to doing it on the master.

However, since we can rollback on the master node in case a query fails on the first worker node, we could run it on the master first. So, there don't seem to be advantages of doing it on the workers before doing it on the master.

However, if we do execute it on the master first, we have the ability to rollback and that makes it better for certain cases.

Plan

Reorganize multi_ProcessUtility to better separate DDL mode from other modes
Determine call sites which modify parse tree
Refactor modifications into separate methods, as needed
Change call order of standard_ProcessUtility to happen first
Evaluate locking semantics
Verify no unit tests break
Run against PostgreSQL tests

The text was updated successfully, but these errors were encountered:

lithp · 2016-03-29T14:54:40Z

I think this is the real way to fix #350. It's PR is just a quick fix for this underlying problem.

jasonmp85 · 2017-03-09T19:28:31Z

Finally getting around to doing a brain dump here. Basically, as @anarazel has pointed out, it would be nicer to execute the master's DDL command first as this allows some piggybacking on the locking semantics already provided by PostgreSQL.

The existing ways we handle DDL commands in the utility hook are:

ProcessIndexStmt
ProcessDropIndexStmt
ProcessAlterTableStmt
ProcessAlterObjectSchemaStmt
WorkerProcessAlterTableStmt (runs on worker against distributed table for certain commands)

Each of these returns a possibly-modified parse tree which is subsequently executed by a call to standard_ProcessUtility. Other non-DDL commands are also processed by multi_ProcessUtility:

FORMAT = 'transmit' statements
ProcessCopyStmt (performs copy, then potentially runs local copy)
EXPLAIN EXECUTE
ProcessVacuumStmt

After poking around the various Process- functions, the scope of this is:

Refactor multi_ProcessUtility to separate concerns (verification, caveats, utilities, DDL). This function is presently nearly 300 lines and breaking it up will help understand the modes it operates in (some functions will still need standard_ProcessUtility to run after some remove operations)
Determine parse modifications performed by each DDL-related Process- function
Break up parse modifications into separate function
Change flow to: modify, execute (local), execute (remote)
Evaluate removal of any shard-specific locking in favor of PostgreSQL's built-in locking

I'm putting a checklist up top with what I've done and am doing. Estimated time: PR out today.

aamederen · 2017-03-10T07:16:03Z

@jasonmp85 I think execute (remote) part can be separated to execute (worker distributed table) and execute (shards).

This was referenced Feb 23, 2016

Mechanism for CREATE/DROP INDEX replication is flawed #355

Closed

Propagate DDL commands to workers v2 #25

Closed

ozgune mentioned this issue Mar 8, 2016

Incorrect behavior while creating indexes of same name on different tables #350

Closed

aamederen mentioned this issue Jun 17, 2016

Propagate DDL commands to workers through 2PC #513

Closed

ozgune mentioned this issue Jul 27, 2016

Regression: Rolling upgrades don't work with changes to worker_apply_shard_ddl_command #676

Closed

ozgune mentioned this issue Feb 2, 2017

CREATE INDEX CONCURRENTLY is not supported for distributed tables. #1007

Closed

ozgune added this to the 6.2 Release milestone Feb 2, 2017

sumedhpathak assigned jasonmp85 Feb 28, 2017

marcocitus mentioned this issue Mar 8, 2017

Concurrent DDL might deadlock in the presence of foreign keys #1276

Closed

jasonmp85 mentioned this issue Mar 12, 2017

Execute DDL on coordinator before workers #1278

Merged

4 tasks

jasonmp85 added the needs review label Mar 12, 2017

jasonmp85 closed this as completed in #1278 Mar 22, 2017

jasonmp85 removed the needs review label Mar 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider issuing DDL commands on the master before the worker nodes #357

Consider issuing DDL commands on the master before the worker nodes #357

samay-sharma commented Feb 23, 2016 •

edited by jasonmp85

lithp commented Mar 29, 2016

jasonmp85 commented Mar 9, 2017

aamederen commented Mar 10, 2017

Consider issuing DDL commands on the master before the worker nodes #357

Consider issuing DDL commands on the master before the worker nodes #357

Comments

samay-sharma commented Feb 23, 2016 • edited by jasonmp85

Plan

lithp commented Mar 29, 2016

jasonmp85 commented Mar 9, 2017

aamederen commented Mar 10, 2017

samay-sharma commented Feb 23, 2016 •

edited by jasonmp85