New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SQL query filter not working with pyarrow dataset with timestamp index #9371
Comments
Probably similar to #8856 |
Indeed it does look like it's the same issue (or very similar at least). I tried running the reproducible script with latest main
Perhaps that is the real bug (instead of working around and passing |
Yea the PR I mentioned is included in 0.9.1, I didn't mean it was already fixed, sorry for the confusion. I had a look and it seems the ArrowDateTimeType is DAYS, which is not something we're expecting there currently An unfortunate detail about your reproduction is that it only works on pyarrow 13+, and that makes it almost impossible to properly debug because when I attach lldb and pyarrow13+ gets imported it causes lldb to crash
|
Actually, got a debugger attached to it with pyarrow 12 and reproducing the issue 👍 |
What happens?
This query does work when you query the parquet file directly (see reproducible). Only when you make a dataset out of it it doesn't work. It's possible that this is inherent to dataset + parquet files with timestamp indices, however, I could find any documentation suggesting this.
To Reproduce
OS:
OSX aarch64
DuckDB Version:
0.9.0
DuckDB Client:
python
Full Name:
Sam VL
Affiliation:
source.ag
Have you tried this on the latest
main
branch?I have tested with a main build
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?
The text was updated successfully, but these errors were encountered: