Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Reading and Preserving Row Tracking information #2111

Closed
2 of 8 tasks
johanl-db opened this issue Sep 27, 2023 · 1 comment
Closed
2 of 8 tasks

[Feature Request] Reading and Preserving Row Tracking information #2111

johanl-db opened this issue Sep 27, 2023 · 1 comment
Labels
enhancement New feature or request
Milestone

Comments

@johanl-db
Copy link
Collaborator

johanl-db commented Sep 27, 2023

Feature request

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Overview

This feature request tracks the remaining effort to fully implement Row Tracking.
The feature required changes in Spark that are being released with Spark 3.5. With Spark 3.5 being released, fully implementing the feature is now unblocked.

Further details

Row tracking specification was added with 8dfa534 and #1747. See Row ID design document for the initial design.

Support for writing fresh row IDs / row commit versions was introduced in the following PRs and didn't require any changes in Spark:

Reading and preserving row IDs / row commit versions on the other does require changes that are shipping with Spark 3.5 (see e.g. apache/spark#40677 and apache/spark#40545). With Spark 3.5 we can now implement reading and preserving row tracking information

Willingness to contribute

The Delta Lake Community encourages new feature contributions. Would you or another member of your organization be willing to contribute an implementation of this feature?

  • Yes. I can contribute this feature independently.
  • Yes. I would be willing to contribute this feature with guidance from the Delta Lake community.
  • No. I cannot contribute this feature at this time.
@johanl-db johanl-db added the enhancement New feature or request label Sep 27, 2023
@johanl-db johanl-db modified the milestone: 3.1.0 Sep 27, 2023
@johanl-db johanl-db added this to the 3.2.0 milestone Feb 29, 2024
@johanl-db johanl-db assigned johanl-db and unassigned johanl-db Apr 26, 2024
@johanl-db
Copy link
Collaborator Author

All the changes that allow reading and preserving row tracking information are in 3.2 thanks to @longvu-db!
Some of the more relevant PRs part of that effort:
Read support:

Preserving row tracking information:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant