Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] Segfault when inspecting dataset.Source with invalid file/partitioning #23888

Closed
asfimport opened this issue Jan 21, 2020 · 1 comment
Assignees
Milestone

Comments

@asfimport
Copy link

Getting a segfault with:

In [1]: import pyarrow.dataset as ds                                                                                                                                                                               

In [2]: !touch test_empty.txt                                                                                                                                                                                      

In [3]: source_factory = ds.source("test_empty.txt", partitioning=ds.partitioning(field_names=['a', 'b']))                                                                                                         

In [4]: source_factory.inspect()                                                                                                                                                                                   
Segmentation fault (core dumped)

Didn't yet further investigate what might be the reason (there are several "wrong" things here: it's an empty file, it's not a valid file for the parquet format, the partitioning does not match the files, etc)

Reporter: Joris Van den Bossche / @jorisvandenbossche
Assignee: Francois Saint-Jacques / @fsaintjacques

PRs and other links:

Note: This issue was originally created as ARROW-7638. Please see the migration documentation for further details.

@asfimport
Copy link
Author

Ben Kietzman / @bkietz:
Issue resolved by pull request 6267
#6267

@asfimport asfimport added this to the 0.16.0 milestone Jan 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants