Skip to content

Support RecordBatch.flatten #6369

@kszlim

Description

@kszlim

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
I want to write flattened parquet files, as not everything has support for structs.

Describe the solution you'd like
Recursively flatten all struct columns in a recordbatch (similar to pandas json normalize), alternatively, a solution via datafusion might be acceptable.

Describe alternatives you've considered
Running pyarrow.Table.flatten in a loop until there are no more top level struct columns, though this requires you to go through python.

Metadata

Metadata

Assignees

Labels

enhancementAny new improvement worthy of a entry in the changelog

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions