Skip to content

[Doc] tzdata error due to a lack of a discoverable system timezone database #49172

@shr3yas-k

Description

@shr3yas-k

Describe the enhancement requested

Some timezone-dependent PyArrow tests require a discoverable system timezone database (tzdata / zoneinfo).

On environments without standard filesystem layouts (e.g. non-FHS compliant systems or minimal containers), these tests can fail with errors like:

ArrowInvalid: Cannot locate or parse timezone 'CET' discover_tz_dir failed to find zoneinfo

There have been several mitigations over time like older interim fixes for Windows itself (#36996), PR #45425, and recent Windows CI fixes such as commit ab2c0ad.

Windows-specific failures appear to be largely handled now, but the underlying requirement and expected behavior are not documented for contributors on other platforms (e.g. non-FHS systems and hermetic environments).

Happy to follow up with a small documentation PR if this sounds reasonable. The goal would be to clarify that these failures are environment-related and expected in certain setups, rather than indicating a bug in Arrow itself.

Component(s)

Documentation

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions