You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This might be a bug in parquet-cpp, I need to spend a bit more time tracking this down but basically given a file with a single row on hdfs, reading it with pyarrow yields this error
import pyarrow
import pyarrow.parquet as pq
fs = pyarrow.hdfs.connect('my-namenode-url', driver='libhdfs3') # fill in namenode information
file_object = fs.open('single-row.parquet') # update for hdfs path of file
pq.read_metadata(file_object) # this works
parquet_file = pq.ParquetFile(file_object)
parquet_file.read_row_group(0) # throws error
I am working on writing a unit test for this. Note that I am using libhdfs3.
This might be a bug in parquet-cpp, I need to spend a bit more time tracking this down but basically given a file with a single row on hdfs, reading it with pyarrow yields this error
The following code causes it:
I am working on writing a unit test for this. Note that I am using libhdfs3.
Reporter: Robbie Gruener / @rgruener
Original Issue Attachments:
Note: This issue was originally created as ARROW-2842. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: