Skip to content

[Spark] Support in-place migration from unpartitioned table to clustered table#3174

Merged
vkorukanti merged 5 commits into
delta-io:masterfrom
zedtang:in-place-migration
May 31, 2024
Merged

[Spark] Support in-place migration from unpartitioned table to clustered table#3174
vkorukanti merged 5 commits into
delta-io:masterfrom
zedtang:in-place-migration

Conversation

@zedtang

@zedtang zedtang commented May 29, 2024

Copy link
Copy Markdown
Collaborator

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Support in-place migration from unpartitioned table to clustered table. If the table is an unpartitioned table and users run ALTER TABLE CLUSTER BY on it, it will now create a clustered table with ClusteringMetadataDomain.

Resolves #2460

How was this patch tested?

New UTs.

Does this PR introduce any user-facing changes?

No

@zedtang zedtang changed the title Support in-place migration from unpartitioned table to clustered table. [Spark] Support in-place migration from unpartitioned table to clustered table May 29, 2024
@zedtang zedtang self-assigned this May 29, 2024

@dabao521 dabao521 left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the work! Left a few comments, and let's add add more validations in the test.

@zedtang zedtang requested a review from dabao521 May 30, 2024 19:31

@dabao521 dabao521 left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@vkorukanti vkorukanti merged commit 56ce212 into delta-io:master May 31, 2024
@zedtang zedtang deleted the in-place-migration branch May 31, 2024 15:33
richardc-db pushed a commit to richardc-db/delta that referenced this pull request Jun 5, 2024
…red table (delta-io#3174)

## Description
Support in-place migration from unpartitioned table to clustered table.
If the table is an unpartitioned table and users run `ALTER TABLE
CLUSTER BY` on it, it will now create a clustered table with
ClusteringMetadataDomain.

Resolves delta-io#2460 

## How was this patch tested?
New UTs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature Request] In-place migration from unpartitioned table to clustered table

3 participants