[BUG]: `num_workers`>0 hangs on Windows and sometimes MacOS. #740

JacksonBurns · 2024-03-22T17:00:48Z

This was originally discovered in #714 - setting num_workers above 0 causes Chemprop to hang on Windows on both GitHub actions and locally, and on MacOS for GitHub actions.

See these two replies to a highly relevant issue on the PyTorch forum - we may need to refactor our calls to train, or just disallow parallel dataloading based on platform:

The text was updated successfully, but these errors were encountered:

davidegraff · 2024-03-23T18:50:06Z

Seems like the default of 8 is a remnant of v1. I don't think it's a bad change to use the torch default, and if we reintroduce caching (#697) then there's really no need to parallelize dataloading. FWIW I think this is due to differences in parallelism implementations across platforms because of python GIL; POSIX uses fork() which is significantly faster to spin up and wind down than spawn() used in Windows/MacOS

JacksonBurns · 2024-03-23T18:56:03Z

Here's the relevant pytorch docs page as well https://pytorch.org/docs/stable/notes/windows.html#usage-multiprocessing

donerancl · 2024-04-12T17:17:50Z

This has been addressed

message that num workers> 0 results in hanging for Windows and MacOS
default num workers = 0

JacksonBurns added the bug Something isn't working label Mar 22, 2024

JacksonBurns mentioned this issue Mar 22, 2024

[v2]: CI Overhaul (and make v2 actually _pass_ the CI) #714

Merged

kevingreenman added this to the v2.0.0 milestone Mar 28, 2024

KnathanM assigned shihchengli Apr 1, 2024

donerancl closed this as completed Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: `num_workers`>0 hangs on Windows and sometimes MacOS. #740

[BUG]: `num_workers`>0 hangs on Windows and sometimes MacOS. #740

JacksonBurns commented Mar 22, 2024

davidegraff commented Mar 23, 2024

JacksonBurns commented Mar 23, 2024

donerancl commented Apr 12, 2024

[BUG]: num_workers>0 hangs on Windows and sometimes MacOS. #740

[BUG]: num_workers>0 hangs on Windows and sometimes MacOS. #740

Comments

JacksonBurns commented Mar 22, 2024

davidegraff commented Mar 23, 2024

JacksonBurns commented Mar 23, 2024

donerancl commented Apr 12, 2024

[BUG]: `num_workers`>0 hangs on Windows and sometimes MacOS. #740

[BUG]: `num_workers`>0 hangs on Windows and sometimes MacOS. #740