Skip to content

sandialabs/BioCompoundML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status

BioCompoundML

Rapidly screen a large number of compounds for fuel and chemical properties using machine learning. It's quick -- build in minutes, screen in seconds. It's clean -- cluster, predict, report and validate in a single interface. And it directly connects to the PubChem API and a variety of Quantitative Structure (Property and Activity) Relationship predictors (QSPR/QSAR).

Documentation

See documentation at https://sandialabs.github.io/BioCompoundML/

Build

The most difficult part of the build is getting scikit-learn up and running and beautiful-soup. It is best that you use an existing tool, like conda or canopy or another scientific python distribution. If not, it may take some effort to get BioCompoundML running on your machine. Ultimately, you will need numpy, scipy, scikit-learn, matplotlib and beautiful-soup. If you have those the rest of the setup should be fairly painless.

git clone https://github.com/sandialabs/BioCompoundML.git
pip install -r requirements.txt
python setup.py install

Dependencies


BioCompoundML is currently only tested to work under Python 2.6 and 2.7 - Python 3 support is being added. There will be inconsistencies in the tests between the Python 2 and Python 3, with certain tests failing. It is advised, at this point, to use Python 2.6 or 2.7.

  • numpy==1.10.4
  • scikit-learn==0.17.1
  • scipy==0.17.0
  • beautiful-soup==4.4.1
  • matplotlib==1.4.0

Publication (please cite)

Whitmore, Leanne S., et al. "BioCompoundML: a general biofuel property screening tool for biological molecules using Random Forest Classifiers." Energy & Fuels 30.10 (2016): 8410-8418.

http://pubs.acs.org/doi/abs/10.1021/acs.energyfuels.6b01952

License

BSD - 3-Clause Copyright 2016 Sandia Corporation. Under the terms of Contract DE-AC04-94AL85000 with Sandia Corporation, the U.S. Government retains certain rights in this software.

About

BioCompoundML is a software tool for rapidly screening chemicals by chemical properties, using machine learning.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages