-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[DELTA-OSS-EXTERNAL] Added the SymlinkTextInputFormat manifest genera…
…tion for Presto/Athena support ## What changes were proposed in this pull request? This PR is the first in the sequence of PRs to add manifest file generation (SymlinkInputFormat) to OSS Delta for Presto/Athena read support (issue #76). Specifically, this PR adds the core functionality for manifest generation and rigorous tests to verify the contents of the manifest. Future PRs will add the public APIs for on-demand generation. - Added post-commit hooks to run tasks after a successful commit. - Added GenerateSymlinkManifest implementation of post-commit hook to generate the manifests. - Each manifest contains the name of data files to read for querying the whole table or partition - Non-partitioned table produces a single manifest file containing all the data files. - Partitioned table produces partitioned manifest files; same partition structured like the table, each partition directory containing one manifest file containing data files of that partition. This allows Presto/Athena partition-pruned queries to read only manifest files of the necessary partitions. - Each attempt to generate manifest will atomically (as much as possible) overwrite the manifest files in the directories (if they exist) and also delete manifest files of partitions that have been deleted from the table. Closes #250 Co-authored-by: Tathagata Das <tathagata.das1565@gmail.com> Co-authored-by: Rahul Mahadev <rahul.mahadev@databricks.com> Author: Tathagata Das <tathagata.das1565@gmail.com> Author: Rahul Mahadev <rahul.mahadev@databricks.com> #6910 is resolved by tdas/SC-25511. GitOrigin-RevId: a3e04f2fcdafb6ac29c3adcfb791a3d0611583dc
- Loading branch information
1 parent
48f5185
commit b18ffba
Showing
6 changed files
with
920 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.