Pip installation fails in virtual env and SIGILL on DGX machines #88

d3sm0 · 2022-10-18T22:17:32Z

it seems that pip install -e . does prepare the proper directories but does not include the built package. We solved by adding:

+        include_package_data=False,
+        packages=find_packages(include=['rlmeta', 'rlmeta.*']),

here:

rlmeta/setup.py

Line 87 in c43d0f1

ext_modules=[CMakeExtension("rlmeta", "./rlmeta")],

Nit: It might be useful to provide an easy way to pass a cuda/cudnn path to cmake, maybe something like DCUDNN_LIBRARY_PATH=os.einviron.get("CUDA_LIBRARY_PATH, "")

Finally the flag --march=native might cause some issues especially for HPC. We removed it for our cluster and managed to reliably train on different machines.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pip installation fails in virtual env and SIGILL on DGX machines #88

Pip installation fails in virtual env and SIGILL on DGX machines #88

d3sm0 commented Oct 18, 2022 •

edited

Loading

Pip installation fails in virtual env and SIGILL on DGX machines #88

Pip installation fails in virtual env and SIGILL on DGX machines #88

Comments

d3sm0 commented Oct 18, 2022 • edited Loading

d3sm0 commented Oct 18, 2022 •

edited

Loading