Skip to content

ParquetFile.schema.to_arrow_schema() output is truncated with "(... N chars omitted)" for long lines #48087

@SaraPrager

Description

@SaraPrager

Describe the bug, including details regarding any error messages, version, and platform.

When running the following code:

import pyarrow.parquet as pq
parquet_file = pq.ParquetFile('<my file with long schema lines>.parquet')
parquet_file.schema.to_arrow_schema()

the output for long schema lines is truncated and replaced with (... N chars omitted).

For example, a schema line like the following will be truncated:

config: list<element: struct<default_value: string, name: string, keywords: list<element: string>, type: s (... 7 chars omitted)

The issue did not occur in pyarrow version 20, but is present in versions 21 and 22.

Component(s)

Parquet

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions