Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

Pip installation fails in virtual env and SIGILL on DGX machines #88

Open
d3sm0 opened this issue Oct 18, 2022 · 0 comments
Open

Pip installation fails in virtual env and SIGILL on DGX machines #88

d3sm0 opened this issue Oct 18, 2022 · 0 comments

Comments

@d3sm0
Copy link

d3sm0 commented Oct 18, 2022

it seems that pip install -e . does prepare the proper directories but does not include the built package. We solved by adding:

+        include_package_data=False,
+        packages=find_packages(include=['rlmeta', 'rlmeta.*']),

here:

ext_modules=[CMakeExtension("rlmeta", "./rlmeta")],

Nit: It might be useful to provide an easy way to pass a cuda/cudnn path to cmake, maybe something like DCUDNN_LIBRARY_PATH=os.einviron.get("CUDA_LIBRARY_PATH, "")

Finally the flag --march=native might cause some issues especially for HPC. We removed it for our cluster and managed to reliably train on different machines.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant