Skip to content

globus-labs/molecular-design-at-scale

Repository files navigation

Multi-Property Molecular Design on HPC

Binder

This repository contains a tutorial showing how to rapidly design molecules which meet several proeprties by using Bayesian optimization on HPC.

The objective of this application is to identify which molecules have the largest ionization energies (IE, the amount of energy required to remove an electron) _and something else (TBD).

IE can be computed using various simulation packages (here we use MOPAC); however, execution of these simulations is expensive, and thus, given a finite compute budget, we must carefully select which molecules to explore.

In this example, we use machine learning to predict molecules with high IE based on previous computations (a process often called active learning). We iteratively retrain the machine learning model to improve the accuracy of predictions.

Running the Tutorial

The tutorial is designed to work on Binder so that you can run it without installing anything.

Just click this link: Binder

Running Locally

Running with local resources can be much faster and allow you to save changes you make to the notebooks.

The demo uses a few codes that are easiest to install with Anaconda. Our environment should work on both Linux and OS X (though M1 systems can be problematic) and can be installed by:

conda env create --file environment.yml

Tutorial Guide

TBD

About

Tutorial for multi-objective active learning on HPC

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •