Skip to content

[SPARK-46164][PYTHON] Add include/exclude parameters to DataFrame.describe in pandas API on Spark#55748

Open
devin-petersohn wants to merge 1 commit intoapache:masterfrom
devin-petersohn:devin/describe-include-exclude
Open

[SPARK-46164][PYTHON] Add include/exclude parameters to DataFrame.describe in pandas API on Spark#55748
devin-petersohn wants to merge 1 commit intoapache:masterfrom
devin-petersohn:devin/describe-include-exclude

Conversation

@devin-petersohn
Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

Add include and exclude parameters to DataFrame.describe() in the pandas API on Spark

Why are the changes needed?

Missing API coverage

Does this PR introduce any user-facing change?

Yes

How was this patch tested?

Unit tests

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Code (claude-opus-4-7)

…cribe in pandas API on Spark

Add include and exclude parameters to DataFrame.describe() in the
pandas API on Spark, matching pandas behavior. Supports dtype
filtering via select_dtypes, include="all" for mixed-type DataFrames,
and proper validation of parameter combinations.

Co-Authored-By: Claude <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants