Skip to content

Training md17 on a GPU results in nan #3

@hwidong-na

Description

@hwidong-na

Hi,

Thanks for sharing a great work. I've tried to obtain results for all tasks in this project, while md17 consistantly gives nan when tarining. Even a small learning rate does the same.

$ uv run main.py task=md17 training.epochs=100 algebra.device=cuda training.lr=0.0000001
>>> Starting Task: MD17 (aspirin)
>>> MD17 aspirin: 50000 samples (40000/5000/5000 train/val/test)
>>> Energy: mean=-406737.25, std=5.99
>>> Force: mean=-0.0097, std=26.5588
Loss: nan | E_MAE: nan | F_MAE: nan | Val_E_MAE: nan | Val_F_MAE: nan | LR: 0.0000:   1%| | 1/100 

BTW, I also found the following paper is relavant to your work as well.

@misc{ruhe2023geometriccliffordalgebranetworks,
      title={Geometric Clifford Algebra Networks}, 
      author={David Ruhe and Jayesh K. Gupta and Steven de Keninck and Max Welling and Johannes Brandstetter},
      year={2023},
      eprint={2302.06594},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2302.06594}, 
}

Best,
Hwidong

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions