-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Create a public dataset and guideline/playbook for use by public #14680
Copy link
Copy link
Open
Labels
area:usabilityUsability enhancementsUsability enhancementsfrom-jirapriority:highSignificant impact; potential bugsSignificant impact; potential bugstype:improvementImprovements to existing functionalityImprovements to existing functionality
Description
Expose a public dataset w/ schema details and how to use them.
For eg:
- We could have a parquet dump somewhere, where one could read from generate their own hudi tables.
- We could have playbook to create diff types of hudi tables(COW/MOR) by reading from this source.
- We could add a playbook to use deltastreamer to read from this source one file at a time and inject to hudi table.
JIRA info
- Link: https://issues.apache.org/jira/browse/HUDI-2125
- Type: Improvement
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
area:usabilityUsability enhancementsUsability enhancementsfrom-jirapriority:highSignificant impact; potential bugsSignificant impact; potential bugstype:improvementImprovements to existing functionalityImprovements to existing functionality