Skip to content

Comments

[SPARK-39147][SQL] Code simplification, use count() instead of filter().size, etc.#36507

Closed
morvenhuang wants to merge 5 commits intoapache:masterfrom
morvenhuang:SPARK-39147
Closed

[SPARK-39147][SQL] Code simplification, use count() instead of filter().size, etc.#36507
morvenhuang wants to merge 5 commits intoapache:masterfrom
morvenhuang:SPARK-39147

Conversation

@morvenhuang
Copy link
Contributor

What changes were proposed in this pull request?

Use count() instead of filter().size, use df.count() instead of df.collect().size.

Why are the changes needed?

Code simplification.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass GA.

Copy link
Member

@srowen srowen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks fine

@srowen
Copy link
Member

srowen commented May 11, 2022

Oh I see, good catch!

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@morvenhuang
Copy link
Contributor Author

The GC overhead limit exceeded error in AppVeyor build deesn't seem relevant.

@srowen
Copy link
Member

srowen commented May 11, 2022

Yes I'm confident it is not related. The others passed. Merged to master

@srowen srowen closed this in 09564df May 11, 2022
@morvenhuang
Copy link
Contributor Author

@srowen thank you Sean.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants