-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[R] parallel as.data.frame.Table hangs indefinitely on Windows #26715
Comments
Neal Richardson / @nealrichardson: |
Bruno Tremblay:
|
Neal Richardson / @nealrichardson: |
Bruno Tremblay: The Table is built from a single vector of RAWSXP reprensenting an IPC stream. When this raw vector is saved to disk using saveRDS then reread using readRDS, the resulting Table has no problem behind converted to a data.frame even with multithreading. It is only in the case where the vector stays in memory that the problem occurs on multi-threading. Mind you building the Table itself is not an issue and querying the table for everyrow also yield the expected results. It's pretty hard for me to nail the problem down as I do not have any notion yet of how threads are handled in Cpp.
But I'm pretty sure it has to do with either memory management or the length/capacity of the in memory vector.
Next up is doing a memory dump to compare between in-memory only and memory-disk-memory method. |
Neal Richardson / @nealrichardson: |
Neal Richardson / @nealrichardson: |
Bruno Tremblay:
Tested with the latest R and it still hangs just now. It made me thinks about comparing pointer ownerships.
Should I reinstall arrow too? |
Bruno Tremblay: Comparing memory dump did not yield satisfactory results. I will try to get my hands on a different windows install maybe, a different CPU.
I'll update if I find something.
|
Neal Richardson / @nealrichardson: |
Neal Richardson / @nealrichardson: |
On Windows only
Tested on 2 machines, mingw.
Reprex
Session details
The text was updated successfully, but these errors were encountered: