Skip to content

[SPARK-52812][SQL] Make Spark Connect Catalog.createTable eager#56062

Closed
rishav23 wants to merge 3 commits into
apache:masterfrom
rishav23:fix-spark-52812-createtable-eager
Closed

[SPARK-52812][SQL] Make Spark Connect Catalog.createTable eager#56062
rishav23 wants to merge 3 commits into
apache:masterfrom
rishav23:fix-spark-52812-createtable-eager

Conversation

@rishav23
Copy link
Copy Markdown

What changes were proposed in this pull request?

This PR makes Spark Connect Catalog.createTable eager. Previously, createTable() only constructed a lazy DataFrame, requiring users to explicitly trigger an action such as .collect() for the table creation to actually execute. This change eagerly executes the command internally while preserving the existing return type. A regression test has also been added to verify that tables are created immediately without requiring an explicit action.

Why are the changes needed?

Catalog.createTable() is a side-effecting operation and should execute eagerly to match expected Catalog API semantics.

Does this PR introduce any user-facing change?

Yes. Previously spark.catalog.createTable(....) did not immediately create the table in Spark Connect unless an action was triggered. Now the table is created eagerly.

How was this patch tested?

  • Added a regression test in CatalogSuite
  • Ran build/sbt compile

Was this patch authored or co-authored using generative AI tooling?

No

@rishav23 rishav23 marked this pull request as draft May 22, 2026 13:02
@rishav23 rishav23 closed this May 22, 2026
@rishav23 rishav23 deleted the fix-spark-52812-createtable-eager branch May 22, 2026 13:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant