Skip to content

feat: Add support for writing bloom filters#3265

Draft
renaudb wants to merge 2 commits intoapache:mainfrom
renaudb:renaudb-add-bloom-filters-write
Draft

feat: Add support for writing bloom filters#3265
renaudb wants to merge 2 commits intoapache:mainfrom
renaudb:renaudb-add-bloom-filters-write

Conversation

@renaudb
Copy link
Copy Markdown

@renaudb renaudb commented Apr 21, 2026

Closes #850

Note: This PR is currently held back by boto requiring pyarrow<=23.1 as bloom filter write support was added in pyarrow 24.

Rationale for this change

Add support for writing bloom filters to parquet files. This changes leverages the new bloom_filter_options write_parquet argument in pyarrow 24.

Are these changes tested?

Added tests for the metadata parsing. Added a very basic test for the writing path (there is currently no way to test for the existence of a bloomfilter in a parquet file using pyarrow).

Are there any user-facing changes?

N/A

@renaudb renaudb marked this pull request as draft April 21, 2026 21:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

support bloom-filter writing

1 participant