[SPARK-14081][SQL] - Preserve DataFrame column types when filling nulls. by traviscrawford · Pull Request #11967 · apache/spark

traviscrawford · 2016-03-25T20:59:53Z

What changes were proposed in this pull request?

This change resolves an issue where DataFrameNaFunctions.fill changes a FloatType column to a DoubleType. We also clarify the contract that replacement values will be cast to the column data type, which may change the replacement value when casting to a lower precision type.

How was this patch tested?

This patch has associated unit tests.

traviscrawford · 2016-03-25T21:02:56Z

sql/core/src/main/scala/org/apache/spark/sql/DataFrameNaFunctions.scala

The data type pattern match has been removed because I believe its unnecessary. I'm happy to put this back, along with a comment describing what special-case it covers, if someone can clue me into why its needed.

I think this is used to convert NaN values into null. Otherwise we won't fill NaN values.

JoshRosen · 2016-03-25T21:17:28Z

Jenkins, this is ok to test.

SparkQA · 2016-03-25T22:43:32Z

Test build #54216 has finished for PR 11967 at commit 834ee69.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-26T00:40:09Z

@traviscrawford can you remove the part about this is your first contribution from the pr description? The pr description will actually become part of the commit.

rxin · 2016-03-26T00:43:22Z

Looks like there is a test failure.

…a types

SparkQA · 2016-03-30T15:51:32Z

Test build #54523 has finished for PR 11967 at commit 9dfe2dd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-30T22:07:27Z

@traviscrawford can you update the pr title to have a full sentence? github is cutting it off.

rxin · 2016-03-30T22:08:04Z

LGTM - we should merge this as soon as you update the title.

traviscrawford · 2016-03-30T23:54:28Z

Title updated.

rxin · 2016-03-30T23:59:51Z

Thanks - I've merged this in master.

traviscrawford reviewed Mar 25, 2016
View reviewed changes

SPARK-14081 - Update DataFrameNaFunctions.fill to preserve column dat…

9dfe2dd

…a types

traviscrawford force-pushed the SPARK-14081-dataframena branch from 834ee69 to 9dfe2dd Compare March 30, 2016 14:18

traviscrawford changed the title ~~[SPARK-14081][SQL] - Update DataFrameNaFunctions.fill to preserve column dat…~~ [SPARK-14081][SQL] - Preserve DataFrame column types when filling nulls. Mar 30, 2016

asfgit closed this in da54abf Mar 31, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-14081][SQL] - Preserve DataFrame column types when filling nulls.#11967

[SPARK-14081][SQL] - Preserve DataFrame column types when filling nulls.#11967
traviscrawford wants to merge 1 commit intoapache:masterfrom
traviscrawford:SPARK-14081-dataframena

traviscrawford commented Mar 25, 2016

Uh oh!

traviscrawford Mar 25, 2016

Uh oh!

rxin Mar 26, 2016

Uh oh!

JoshRosen commented Mar 25, 2016

Uh oh!

SparkQA commented Mar 25, 2016

Uh oh!

rxin commented Mar 26, 2016

Uh oh!

rxin commented Mar 26, 2016

Uh oh!

SparkQA commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

traviscrawford commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

traviscrawford commented Mar 25, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

traviscrawford Mar 25, 2016

Choose a reason for hiding this comment

Uh oh!

rxin Mar 26, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen commented Mar 25, 2016

Uh oh!

SparkQA commented Mar 25, 2016

Uh oh!

rxin commented Mar 26, 2016

Uh oh!

rxin commented Mar 26, 2016

Uh oh!

SparkQA commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

traviscrawford commented Mar 30, 2016

Uh oh!

rxin commented Mar 30, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants