Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running pair_allegro with Kokkos on multiple GPUs #11

Closed
ryandeng1 opened this issue Jan 6, 2023 · 1 comment
Closed

Running pair_allegro with Kokkos on multiple GPUs #11

ryandeng1 opened this issue Jan 6, 2023 · 1 comment

Comments

@ryandeng1
Copy link

ryandeng1 commented Jan 6, 2023

Hi, I am trying to run pair_allegro with Kokkos on a system with 4 GPUs. I am creating 4 MPI processes with 1 GPU each.

The command looks something like this: mpirun -d -x LD_LIBRARY_PATH -np 4 --mca pml ob1 --mca btl ^openib /home/ubuntu/lammps_install/bin/lmp -sf kk -k on g 1 -pk kokkos newton on neigh full < /home/ubuntu/md_allegro.in

and I get an error saying "Specified device cuda:1 does not match device of data cuda:0". This occurs in the model forward call inside pair_allegro. I tried debugging and it seems that the device the model is initially loaded onto matches the data when later calling the compute method.

Not sure if the command I am running is incorrect or if it's some other issue. Thanks!

@anjohan
Copy link
Collaborator

anjohan commented Jan 6, 2023

Hi,

The g parameter should be set to the number of GPUs per node. See the LAMMPS docs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants