GitHub

CSU Chemistry Theory Group Masterclass: Artificial Neural Networks

Sabari Kumar, 20250502

This repo is meant to be a template/guide to creating simple ANN models using pytorch.
Also present is boilerplate config for pre-commit and a pyproject.toml file.

The task: use an ANN model to predict the aqueous solubility (logS) of chemical compounds. We'll use the AqSol DB (https://github.com/mcsorkun/AqSolDB/tree/master) as a raw data source. Then, we'll calculate 2D descriptors using Mordred (https://github.com/mordred-descriptor/mordred), and use them to train a simple ANN model.

WARNING: This repo is not meant to be used without modification for research tasks! Several data curation and model validation tasks have been omitted for brevity.

To install:
Fork, then git clone the repo.
Run pip install -e . to install as an editable package.

This project structure is loosely based on CCDS.
Directory structure:

data: Data files, code to process them into model inputs and load them. Data processing workflows should be set up here as scripts that act on files from raw, and write to processed. No bulk data processing in jupyter notebooks!
- raw: Raw data files. These should be treated as read-only.
- processed: Processed input data, ready to feed into a model
model: Files associated with model structure, such as layers, model definitions, etc.
runtime: Scripts used to run model training and inference, such as python training scripts, slurm batch scripts, etc.
sandbox: Exploratory/prototyping code. Jupyter notebooks should live here.
results: Model logs and checkpoint files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CSU Chemistry Theory Group Masterclass: Artificial Neural Networks

Sabari Kumar, 20250502

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
model		model
results		results
runtime		runtime
sandbox		sandbox
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml

SabariKumar/Spring2025_ANN_tutorial

Folders and files

Latest commit

History

Repository files navigation

CSU Chemistry Theory Group Masterclass: Artificial Neural Networks

Sabari Kumar, 20250502

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages