-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parquet list columns read incorrectly #2557
Comments
Thanks for reporting, we will have a look |
Hello everyone, this bug impacted one of the prototypes we were exploring using DuckDB + Node for, so I took a stab at fixing the problem myself. I will add some regression tests to ensure that this return in some nasty form, but in the meantime if there is anything else that needs doing to help this get merged, please let me know :) |
Mytherin
added a commit
that referenced
this issue
Nov 23, 2021
Fix bug in parquet reader causing list columns to be parsed incorrectly (#2557)
This should be fixed now. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
What happens?
When scanning parquet files with list columns, results with incorrect offsets can be returned.
To Reproduce
This should return a list-column of length 100 starting at 90,000; instead it starts with 912.
returns the correct result.
Environment (please complete the following information):
Before Submitting
master
branch?pip install duckdb --upgrade --pre
install.packages("https://github.com/duckdb/duckdb/releases/download/master-builds/duckdb_r_src.tar.gz", repos = NULL)
The text was updated successfully, but these errors were encountered: