metatrain
is a command line interface (CLI) to train and evaluate atomistic
models of various architectures. It features a common yaml
option inputs to configure
training and evaluation. Trained models are exported as standalone files that can be
used directly in various molecular dynamics (MD) engines (e.g. LAMMPS
, i-PI
, ASE
...) using the metatensor atomistic interface.
The idea behind metatrain
is to have a general hub that provides a homogeneous
environment and user interface, transforming every ML architecture into an end-to-end
model that can be connected to an MD engine. Any custom architecture compatible with
TorchScript can be integrated into
metatrain
, gaining automatic access to a training and evaluation interface, as well as
compatibility with various MD engines.
Note:
metatrain
does not provide mathematical functionalities per se, but relies on external models that implement the various architectures.
Currently metatrain
supports the following architectures for building an atomistic
model:
Name | Description |
---|---|
GAP | Sparse Gaussian Approximation Potential (GAP) using Smooth Overlap of Atomic Positions (SOAP). |
PET | Point Edge Transformer (PET), interatomic machine learning potential |
NanoPET (experimental) | Re-implementation of the original PET with slightly improved training and evaluation speed |
PET (deprecated) | Original implementation of the PET model used for prototyping, now deprecated in favor of the native metatrain PET implementation. |
SOAP BPNN | A Behler-Parrinello neural network with SOAP features |
For details, tutorials, and examples, please visit our documentation.
Install metatrain
with pip:
pip install metatrain
Install specific models by specifying the model name. For example, to install the SOAP-BPNN model:
pip install metatrain[soap-bpnn]
We also offer a conda installation:
conda install -c conda-forge metatrain
⚠️ The conda installation does not install model-specific dependencies and will only work for architectures without optional dependencies such as NanoPET or PET.
After installation, you can use mtt from the command line to train your models!
To train a model, use the following command:
mtt train options.yaml
Where options.yaml is a configuration file specifying training options. For example, the following configuration trains a SOAP-BPNN model on the QM9 dataset:
# architecture used to train the model
architecture:
name: soap_bpnn
training:
num_epochs: 5 # a very short training run
# Mandatory section defining the parameters for system and target data of the training set
training_set:
systems: "qm9_reduced_100.xyz" # file where the positions are stored
targets:
energy:
key: "U0" # name of the target value
unit: "eV" # unit of the target value
test_set: 0.1 # 10% of the training_set are randomly split for test
validation_set: 0.1 # 10% of the training_set are randomly split for validation
metatrain
comes with completion definitions for its commands for bash and zsh. You
must manually configure your shell to enable completion support.
To make the completions available, source the definitions in your shell’s startup file
(e.g., ~/.bash_profile
, ~/.zshrc
, or ~/.profile
):
source $(mtt --shell-completion)
Having a problem with metatrain? Please let us know by submitting an issue.
Submit new features or bug fixes through a pull request.
Thanks goes to all people who make metatrain possible: