Optimization of MACE in LAMMPS #44

braydenbanks323 · 2025-08-08T22:18:31Z

braydenbanks323
Aug 8, 2025

Hi all,

I am running LAMMPS with the MACE package installed using some pretrained foundational models. I noticed some slow simulation times, so I am trying to parallelize my run to improve performance. I am running on a cluster where I submit jobs through a slurm script.
My slurm script contains

#SBATCH --nodes=1
#SBATCH --ntasks-per-node=2
#SBATCH --cpus-per-task=16
#SBATCH --gres=gpu:2

export PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True
export OMP_NUM_THREADS=16

ulimit -l unlimited
mpirun -np 2 lmp -k on g 2 -sf kk -e both -in water.in

My input file has the lines:
atom_style atomic
atom_modify map yes
pair_style mace/kk no_domain_decomposition
pair_coeff * * /path/to/mace/models/MACE-matpes-r2scan-omat-ft.model-lammps.pt O H

Yet when I try to run this script, I get this error:
Exception: Specified device cuda:1 does not match device of data cuda:0
Exception raised from make_tensor at aten/src/ATen/Functions.cpp:25 (most recent call first):

Has anybody done some optimizing work like this? What have you found to work?

ilyes319 · 2025-08-08T22:20:09Z

ilyes319
Aug 8, 2025
Maintainer

Did you try using cuequivariance with the MLIAP lammps interface ?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization of MACE in LAMMPS #44

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Optimization of MACE in LAMMPS #44

Uh oh!

braydenbanks323 Aug 8, 2025

Replies: 1 comment

Uh oh!

ilyes319 Aug 8, 2025 Maintainer

braydenbanks323
Aug 8, 2025

ilyes319
Aug 8, 2025
Maintainer