Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-10434] [SQL] Fixes Parquet schema of arrays that may contain null #8586

Closed

Conversation

liancheng
Copy link
Contributor

To keep full compatibility of Parquet write path with Spark 1.4, we should rename the innermost field name of arrays that may contain null from "array_element" to "array".

Please refer to SPARK-10434 for more details.

@SparkQA
Copy link

SparkQA commented Sep 3, 2015

Test build #41974 has finished for PR 8586 at commit 4e57d11.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@davies
Copy link
Contributor

davies commented Sep 4, 2015

LGTM

@liancheng
Copy link
Contributor Author

Thanks for the review! I'm merging this to master and branch-1.5.

asfgit pushed a commit that referenced this pull request Sep 5, 2015
To keep full compatibility of Parquet write path with Spark 1.4, we should rename the innermost field name of arrays that may contain null from "array_element" to "array".

Please refer to [SPARK-10434] [1] for more details.

[1]: https://issues.apache.org/jira/browse/SPARK-10434

Author: Cheng Lian <lian@databricks.com>

Closes #8586 from liancheng/spark-10434/fix-parquet-array-type.

(cherry picked from commit bca8c07)
Signed-off-by: Cheng Lian <lian@databricks.com>
@asfgit asfgit closed this in bca8c07 Sep 5, 2015
ashangit pushed a commit to ashangit/spark that referenced this pull request Oct 19, 2016
To keep full compatibility of Parquet write path with Spark 1.4, we should rename the innermost field name of arrays that may contain null from "array_element" to "array".

Please refer to [SPARK-10434] [1] for more details.

[1]: https://issues.apache.org/jira/browse/SPARK-10434

Author: Cheng Lian <lian@databricks.com>

Closes apache#8586 from liancheng/spark-10434/fix-parquet-array-type.

(cherry picked from commit bca8c07)
Signed-off-by: Cheng Lian <lian@databricks.com>
(cherry picked from commit 640000b)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants