You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Cause
Chainerio uses the pyarrow module to access the HDFS internally, and the pyarrow uses the HDFS Java module. The HDFS connection is pooled inside, and If the connection is first created (implicitly though calls to HDFS, like (1)) and then forked, it breaks the pooling and cause future HDFS calls to freeze.
Solution
Fork before the creation of HDFS connection, i.e. fork before any calls to the HDFS, e.g. (1).
The text was updated successfully, but these errors were encountered:
Problem statement
After forking with prior HDFS calls in the main process, the program freezes at any future HDFS calls.
For example:
Cause
Chainerio uses the pyarrow module to access the HDFS internally, and the pyarrow uses the HDFS Java module. The HDFS connection is pooled inside, and If the connection is first created (implicitly though calls to HDFS, like (1)) and then forked, it breaks the pooling and cause future HDFS calls to freeze.
Solution
Fork before the creation of HDFS connection, i.e. fork before any calls to the HDFS, e.g. (1).
The text was updated successfully, but these errors were encountered: