We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA_VISIBLE_DEVICES seems to contain too many entries.
CUDA_VISIBLE_DEVICES
Example from PR #3261: https://buildkite.com/horovod/horovod/builds/7041#9a807189-938e-491f-9f83-c6bc31420a67
hjob = RayExecutor(setting, num_workers=4, use_gpu=True) hjob.start() all_envs = hjob.execute(lambda _: os.environ.copy()) all_cudas = {ev["CUDA_VISIBLE_DEVICES"] for ev in all_envs} assert len(all_cudas) == 1, all_cudas > assert len(all_envs[0]["CUDA_VISIBLE_DEVICES"].split(",")) == 4 E assert 8 == 4 E +8 E -4
The text was updated successfully, but these errors were encountered:
@maxhgerlach thanks for the report, I'll take a look tonight.
Sorry, something went wrong.
Closing in favor of #3435
ashahab
No branches or pull requests
CUDA_VISIBLE_DEVICES
seems to contain too many entries.Example from PR #3261: https://buildkite.com/horovod/horovod/builds/7041#9a807189-938e-491f-9f83-c6bc31420a67
The text was updated successfully, but these errors were encountered: