Skip to content

table: add opt-in pyiceberg-core arrow reader#2

Draft
abnobdoss wants to merge 1 commit into
aba-156-157-core-adaptersfrom
aba-158-opt-in-rust-arrow-scan
Draft

table: add opt-in pyiceberg-core arrow reader#2
abnobdoss wants to merge 1 commit into
aba-156-157-core-adaptersfrom
aba-158-opt-in-rust-arrow-scan

Conversation

@abnobdoss
Copy link
Copy Markdown
Owner

Stack position: PyIceberg PR after #1 (ABA-158).

Adds an env-gated DataScan.to_arrow_batch_reader() path using pyiceberg_core ArrowReader when PYICEBERG_RUST_ARROW_SCAN is enabled. Existing PyArrow behavior remains the default. The native path is skipped for limits and for projections that would require reading filter-only fields; unsupported native shapes fall back with a warning.

Validation:

  • python3 -m py_compile pyiceberg/io/pyiceberg_core.py pyiceberg/table/init.py tests/io/test_pyiceberg_core.py
  • uv run pytest tests/io/test_pyiceberg_core.py -q
  • commit hooks: ruff, format, mypy, pydocstyle, codespell

Challenger loop: red-team found limit handling and missing-core fallback issues; both were fixed. Final review green.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant