-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the bug
Currently the field name in the lookup is hardcoded as "element", which is the parquet convention, arrow uses "item" in its convention, but when actual reading happen, the constructed name actually uses the list's field name, which means for "normal" list arrays(where the item is "item"), the lookup will never match.
This also goes against the avro behaviour, where lists should not have a name anyway, which might cause issues in the future as lookup evolves(using refs and such)
To Reproduce
Create a reader with the following schema:
{
"type": "record",
"name": "root",
"fields": [
{
"name": "items",
"type": {
"type": "array",
"items": {
"type": "record",
"name": "item_record",
"fields": [
{
"name": "id",
"type": "long"
},
{
"name": "name",
"type": "string"
}
]
}
}
}
]
}
Reading will fail as the schema lookup will use "element"
Expected behavior
Lists should not have sub names at all, and this should be read successfully.
Additional context
No response
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working