Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add external schema mappings for data without field IDs #71

Closed
rdblue opened this issue Oct 18, 2018 · 3 comments
Closed

Add external schema mappings for data without field IDs #71

rdblue opened this issue Oct 18, 2018 · 3 comments

Comments

@rdblue
Copy link
Contributor

rdblue commented Oct 18, 2018

Files written by Iceberg writers contain Iceberg field IDs that are used for column projection. Iceberg doesn't currently support tracking data files that were written by other systems and added to Iceberg tables with the API because the field IDs are missing. To support files written by non-Iceberg writers, Iceberg could support a table-level mapping from a source schema to Iceberg IDs.

For example, a table with 2 columns might have an Avro schema mapping like this one, encoded as JSON in table properties:

[ {"field-id": 1, "names": ["id"]},
  {"field-id": 2, "names": ["data"]} ]

When reading an Avro file, the read schema would be produced using the file's schema and the field IDs from the mapping. The names in each field mapping is a list to handle aliasing.

@rdblue rdblue changed the title Add schema mappings for data files without Iceberg field IDs Add external schema mappings for data without field IDs Oct 18, 2018
@govi20
Copy link

govi20 commented Oct 20, 2018

I would like to work on this issue.

@YuvalItzchakov
Copy link

@govi20 I have already started working on this issue, I'd love to pair up if you want :)

@rdblue
Copy link
Contributor Author

rdblue commented Dec 8, 2018

I've moved this to apache/iceberg#40

@rdblue rdblue closed this as completed Dec 8, 2018
Parth-Brahmbhatt pushed a commit to Parth-Brahmbhatt/iceberg that referenced this issue Apr 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants