[SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC #42499

grundprinzip · 2023-08-15T13:17:49Z

What changes were proposed in this pull request?

This patch caches the result of the df.schema call in the DataFrame to avoid the extra roundtrip to the Spark Connect service to retrieve the columns or the schema. Since the Dataframe is immutable, the schema will not change.

Why are the changes needed?

Performance / Stability

Does this PR introduce any user-facing change?

No

How was this patch tested?

Existing UT

# Conflicts: # python/pyspark/sql/connect/dataframe.py

hvanhovell

LGTM

hvanhovell · 2024-02-02T12:47:21Z

Merging to master.

[SPARK-44815] Cache df.schema to avoid extra RCP

e34dd70

grundprinzip changed the title ~~[SPARK-44815] Cache df.schema to avoid extra RPC~~ [SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC Aug 15, 2023

github-actions bot added SQL PYTHON CONNECT labels Aug 15, 2023

grundprinzip closed this Aug 15, 2023

hvanhovell reopened this Jan 24, 2024

Merge remote-tracking branch 'apache/master' into SPARK-44815

94c09e2

# Conflicts: # python/pyspark/sql/connect/dataframe.py

hvanhovell approved these changes Feb 1, 2024

View reviewed changes

hvanhovell added 3 commits February 1, 2024 16:36

Add documentation/warning.

dd77a45

Wrong version

7dd36ca

Remove redundant error checks.

7787967

hvanhovell closed this in 6f87fe2 Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC #42499

[SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC #42499

grundprinzip commented Aug 15, 2023

hvanhovell left a comment

hvanhovell commented Feb 2, 2024

[SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC #42499

[SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC #42499

Conversation

grundprinzip commented Aug 15, 2023

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

hvanhovell left a comment

Choose a reason for hiding this comment

hvanhovell commented Feb 2, 2024