Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-31492][ML] flatten the result dataframe of FValueTest #28268

Closed
wants to merge 1 commit into from

Conversation

zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

add a new method def test(dataset: DataFrame, featuresCol: String, labelCol: String, flatten: Boolean): DataFrame

Why are the changes needed?

Similar to new test method in ChiSquareTest, it will:
1, support df operation on the returned df;
2, make driver no longer a bottleneck with large numFeatures

Does this PR introduce any user-facing change?

Yes, add a new method

How was this patch tested?

existing testsuites

nit

nit

nit

nit

nit
@SparkQA
Copy link

SparkQA commented Apr 20, 2020

Test build #121496 has finished for PR 28268 at commit 3caf7d1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@zhengruifeng
Copy link
Contributor Author

Merged to master

@zhengruifeng zhengruifeng deleted the flatten_fvalue branch June 18, 2022 11:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
2 participants