[core] Ray session conflicts with PyArrow+HDFS #36415
Labels
bug
Something that is supposed to be working; but isn't
core
Issues that should be addressed in Ray Core
core-fundamentals
P1
Issue that should be fixed within a few weeks
stability
What happened + What you expected to happen
Using PyArrow fs with HDFS works fine outside a ray session:
However, after
ray.init()
, the same code results in a segmentation fault:Here is the log dump from java:
hs_err_pid9716.log
The segfault occurs almost every time, but not always.
It never occurs when ray is not initialized. Thus there is probably some interference between the ray session/global state and the java/pyarrow/hdfs connection.
Versions / Dependencies
Ray latest master, hadoop 3.2.4, java openjdk version "1.8.0_362"
Reproduction script
./ci/env-install-hdfs.sh
/opt/hadoop-3.2.4/bin/hdfs dfs -put /tmp/somewhere hdfs://[host]:8020/somewhere
Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered: