[MINOR][SQL] Update the DataFrameWriter.bucketBy comment #27930

maropu · 2020-03-17T02:07:47Z

What changes were proposed in this pull request?

This PR intends to update the DataFrameWriter.bucketBy comment for clearly describing that the bucketBy scheme follows a Spark "specific" one.

I saw the questions about the current bucketing compatibility with Hive in SPARK-31162 and SPARK-17495 from users and IMHO the comment is a bit confusing to users about the compatibility

Why are the changes needed?

To make users understood smoothly.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

N/A

sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala

SparkQA · 2020-03-17T06:50:12Z

Test build #119902 has finished for PR 27930 at commit 022bb72.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-03-17T07:05:01Z

Test build #119910 has finished for PR 27930 at commit a1bfa96.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2020-03-17T07:50:09Z

Retest this please.

dongjoon-hyun

+1, LGTM. Merged to master/3.0/2.4.
Thank you, @maropu and @cloud-fan .

### What changes were proposed in this pull request? This PR intends to update the `DataFrameWriter.bucketBy` comment for clearly describing that the bucketBy scheme follows a Spark "specific" one. I saw the questions about the current bucketing compatibility with Hive in [SPARK-31162](https://issues.apache.org/jira/browse/SPARK-31162?focusedCommentId=17060408&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17060408) and [SPARK-17495](https://issues.apache.org/jira/browse/SPARK-17495?focusedCommentId=17059847&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17059847) from users and IMHO the comment is a bit confusing to users about the compatibility ### Why are the changes needed? To make users understood smoothly. ### Does this PR introduce any user-facing change? No. ### How was this patch tested? N/A Closes #27930 from maropu/UpdateBucketByComment. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 124b4ce) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

maropu · 2020-03-17T08:01:13Z

Thanks, guys!

### What changes were proposed in this pull request? This PR intends to update the `DataFrameWriter.bucketBy` comment for clearly describing that the bucketBy scheme follows a Spark "specific" one. I saw the questions about the current bucketing compatibility with Hive in [SPARK-31162](https://issues.apache.org/jira/browse/SPARK-31162?focusedCommentId=17060408&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17060408) and [SPARK-17495](https://issues.apache.org/jira/browse/SPARK-17495?focusedCommentId=17059847&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17059847) from users and IMHO the comment is a bit confusing to users about the compatibility ### Why are the changes needed? To make users understood smoothly. ### Does this PR introduce any user-facing change? No. ### How was this patch tested? N/A Closes apache#27930 from maropu/UpdateBucketByComment. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

Fix

022bb72

maropu commented Mar 17, 2020

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala Outdated Show resolved Hide resolved

Fix

a1bfa96

dongjoon-hyun added the SQL label Mar 17, 2020

dongjoon-hyun changed the title ~~[SQL][MINOR] Update the DataFrameWriter.bucketBy comment~~ [MINOR][SQL] Update the DataFrameWriter.bucketBy comment Mar 17, 2020

dongjoon-hyun approved these changes Mar 17, 2020

View reviewed changes

dongjoon-hyun closed this in 124b4ce Mar 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MINOR][SQL] Update the DataFrameWriter.bucketBy comment #27930

[MINOR][SQL] Update the DataFrameWriter.bucketBy comment #27930

maropu commented Mar 17, 2020

SparkQA commented Mar 17, 2020

SparkQA commented Mar 17, 2020

dongjoon-hyun commented Mar 17, 2020

dongjoon-hyun left a comment •

edited

maropu commented Mar 17, 2020

[MINOR][SQL] Update the DataFrameWriter.bucketBy comment #27930

[MINOR][SQL] Update the DataFrameWriter.bucketBy comment #27930

Conversation

maropu commented Mar 17, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

SparkQA commented Mar 17, 2020

SparkQA commented Mar 17, 2020

dongjoon-hyun commented Mar 17, 2020

dongjoon-hyun left a comment • edited

Choose a reason for hiding this comment

maropu commented Mar 17, 2020

dongjoon-hyun left a comment •

edited