Change Default Write Distribution Mode #6679

RussellSpitzer · 2023-01-27T16:03:36Z

Feature Request / Improvement

Merge Writes as well as some inserts end up generating many files with our default write distirbution mode of None. While this is the cheapest method and is our old default behavior, we now have several reasons to default to Range (or Hash).

Spark AQE now has both skew handling and and adaptive coalesce
With Merge operations None is never the correct mode to request since we are always shuffling anyway
More users are coming to Iceberg who don't understand how Spark Partitioning works (required to get good perf with default None)

I suggest we change the default distribution mode to Range and add some documentation around configuring AQE to the Spark docs. I think this will be a better behavior for most first users and power users can still manually configure a different mode for their specific requirements.

Query engine

Spark

RussellSpitzer · 2023-01-27T16:04:56Z

@aokolnychyi + @danielcweeks + @rdblue + @jackye1995 + @szehon-ho

Please ping anyone else as well who would have strong opinions about this change as well

dramaticlly · 2023-01-27T18:32:35Z

Thank you @RussellSpitzer , I understand where this change is coming from but some of the GDPR like deletion on V1 table will benefit from the none write distribution mode (to avoid shuffle if possible). I am aware currently we can configure it via setting the table properties like write.delete.distribution-mode or write.update.distribution-mode, but I am wondering if there's any way we can configure it on per spark job level (also delete is done via SQL only which only makes it harder)

RussellSpitzer · 2023-01-27T18:34:12Z

The "none" mode in GDPR cases still only helps in case in which the data has already been aligned with the partitioning of the table. This is rarely the case in my experience.

jackye1995 · 2023-01-27T21:06:26Z

+1 for using range as default. Overall we probably need a dedicated doc section about how to configure those parameters in the Iceberg Spark documentation for people to make informed decisions.

singhpk234 · 2023-01-27T23:57:41Z

+1 on changing the default from none and having a dedicated doc section for the configuring these. Happy to contribute to this if possible.

Side note : I also see write.merge.distribution-mode and write.update.distribution-mode missing in table props in doc section as well https://iceberg.apache.org/docs/latest/configuration/

dramaticlly · 2023-01-28T01:03:02Z

Side note : I also see write.merge.distribution-mode and write.update.distribution-mode missing in table props in doc section as well https://iceberg.apache.org/docs/latest/configuration/

yeah @singhpk234 I noticed that before and had my attempt #5280 to fix it but need some help on merge case to provide better narrative.

rdblue · 2023-01-28T20:15:17Z

+1 for range as default.

aokolnychyi · 2023-01-31T19:26:37Z

I would be careful with range as it may cause performance regressions. Especially, for MERGE. The range distribution requires sampling that leads to double scanning and re-evaluating of particular nodes in the plan. This will cause the same issues we have today where the default would perform poorly.

The upcoming Spark 3.4 has support for rebalancing partitions via AQE for hash distributions requested by v2 writes. That means, we can request a hash distribution without worrying about having too much data per task and OOM. I'd rather switch to hash as default and let users configure if it fails. I don't know a single use case where the range distribution performs well in MERGE at any reasonable scale.

aokolnychyi · 2023-01-31T19:40:30Z

We have examples in TestSparkDistributionAndOrderingUtil that should become a section in the docs.

RussellSpitzer · 2023-02-03T22:45:00Z

@dramaticlly Did you want to write up another issue for specifying write distribution mode as a Spark SqlConf option?

JunchengMa · 2023-02-04T01:55:56Z

+1 on @dramaticlly 's comment, changing the write distribution mode affects Spark job performance (causes heavy shuffle) when using Spark SQL like

DELETE FROM db_name.tbl_name WHERE date < '20220801'

or

UPDATE db_name.tlb_name SET col_a = NULL WHERE date <= '20220801'

setting write.delete.distribution-mode=none and write.update.distribution-mode='none' at table properties would reduce shuffle, but could affect other normal jobs writing to the same table.
So having an option for specifying the write distribution mode would be ideal.

aokolnychyi · 2023-02-08T17:39:49Z

I will submit a PR to change the default distribution modes for insert and merge. I'll be also happy to review a PR for #6741.

jackye1995 mentioned this issue Jan 28, 2023

Doc: Add update and merge distribution mode table props #6683

Merged

aokolnychyi mentioned this issue Jan 31, 2023

Docs: add missing table properties for update and merge write distrib… #5280

Closed

dramaticlly mentioned this issue Feb 3, 2023

Support write distribution mode as a Spark SqlConf option in Iceberg #6741

Closed

aokolnychyi mentioned this issue Feb 13, 2023

Spark 3.3: Change default distribution modes #6828

Merged

dramaticlly mentioned this issue Feb 14, 2023

Spark 3.3: Add a new Spark SQLConf to influence the write distribution mode #6838

Merged

aokolnychyi closed this as completed in #6828 Feb 16, 2023

jackye1995 added this to the Iceberg 1.2.0 milestone Feb 21, 2023

RussellSpitzer mentioned this issue May 6, 2023

dynamic write partition with an extra shuffle #7541

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change Default Write Distribution Mode #6679

Change Default Write Distribution Mode #6679

RussellSpitzer commented Jan 27, 2023 •

edited

Loading

RussellSpitzer commented Jan 27, 2023 •

edited

Loading

dramaticlly commented Jan 27, 2023

RussellSpitzer commented Jan 27, 2023

jackye1995 commented Jan 27, 2023

singhpk234 commented Jan 27, 2023

dramaticlly commented Jan 28, 2023 •

edited

Loading

rdblue commented Jan 28, 2023

aokolnychyi commented Jan 31, 2023

aokolnychyi commented Jan 31, 2023

RussellSpitzer commented Feb 3, 2023

JunchengMa commented Feb 4, 2023

aokolnychyi commented Feb 8, 2023

Change Default Write Distribution Mode #6679

Change Default Write Distribution Mode #6679

Comments

RussellSpitzer commented Jan 27, 2023 • edited Loading

Feature Request / Improvement

Query engine

RussellSpitzer commented Jan 27, 2023 • edited Loading

dramaticlly commented Jan 27, 2023

RussellSpitzer commented Jan 27, 2023

jackye1995 commented Jan 27, 2023

singhpk234 commented Jan 27, 2023

dramaticlly commented Jan 28, 2023 • edited Loading

rdblue commented Jan 28, 2023

aokolnychyi commented Jan 31, 2023

aokolnychyi commented Jan 31, 2023

RussellSpitzer commented Feb 3, 2023

JunchengMa commented Feb 4, 2023

aokolnychyi commented Feb 8, 2023

RussellSpitzer commented Jan 27, 2023 •

edited

Loading

RussellSpitzer commented Jan 27, 2023 •

edited

Loading

dramaticlly commented Jan 28, 2023 •

edited

Loading