I recently had an error when using a SLURMCluster where a module I thought I had loaded using job_extra couldn't be loaded in my python script.
Specifically I had done module load tensorflow-gpu/py3/2.1.0 in my job_extra and I couldn't import tensorflow (this on Jean Zay).
It took @lesteve 5 minutes to notice this when looking at the job script generated but it wasn't obvious to me to even look there, so I thought it deserved a bit more emphasis maybe here, that the python used to run the workers is the one used to launch the scheduler.
I am planning to submit a small PR.
I recently had an error when using a
SLURMClusterwhere a module I thought I had loaded usingjob_extracouldn't be loaded in my python script.Specifically I had done
module load tensorflow-gpu/py3/2.1.0in myjob_extraand I couldn't importtensorflow(this on Jean Zay).It took @lesteve 5 minutes to notice this when looking at the job script generated but it wasn't obvious to me to even look there, so I thought it deserved a bit more emphasis maybe here, that the python used to run the workers is the one used to launch the scheduler.
I am planning to submit a small PR.