TracerPythonScript
is stuck when using same work to run different DDP scripts
#14720
Labels
app:lightningwork
lightning_app.LightningWork
app
Generic label for Lightning App package
bug
Something isn't working
Milestone
馃悰 Bug
To Reproduce
Create 2 scripts with this code:
and
app.py
the 2nd script is stuck while creating the processes.
The reason I am using the same Tracer is that
TracerPythonScript
is aLightningWork
and if I create multiple works to run different scripts, it will eventually allocate multiple machines for each work. Ideally it should be flexible enough to run the script and exit that without any issues and users should be able to use the same machine to run different scripts.Also:
Expected behavior
Environment
conda
,pip
, source):torch.__config__.show()
:Additional context
The text was updated successfully, but these errors were encountered: