Skip to content

Parameter estimation and model calibration using Genetic Algorithm optimization in Python.

License

Notifications You must be signed in to change notification settings

blakeaw/GAlibrate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GAlibrate

Python version badge license version release anaconda cloud Codacy Badge Code style: black Static Badge DOI

GAlibrate is a python toolkit that provides an easy to use interface for model calibration/parameter estimation using an implementation of continuous genetic algorithm-based optimization. Its functionality and API were designed to be familiar to users of the PyDREAM, simplePSO, and Gleipnir packages.

Although GAlibrate provides a general framework for running continuous genetic algorithm-based optimizations, it was created with systems biology models in mind. It therefore supplies additional tools for working with biological models in the PySB format.

What's new in

version 0.7.0

  • Julia integration - New version of core GA that ports some key funtions to Julia using the PyJulia package.
  • New benchmarks module defining a set of functions used to benchmark and test single objective optimazation routines.
  • Test suite using pytest with 63% overall coverage.
  • Updated profiling and performance benchmarking Jupyter notebooks.
  • Function to resume/continue GAO runs for additional generations: GAO.resume.
  • Several new example cases under examples

version 0.6.0

  • core GA now returns an array with fitness value of the fittest individual from each generation which can be accessed from the GAO property GAO.best_fitness_per_generation.
  • Bug fix in core GA for sorting the population before selection and mating.

version 0.5.0

  • Optional progress bar to monitor passage of generations during GAO run that is only displayed if tqdm is installed
  • Optional multiprocessing based parallelism when evaluating the fitness function over the population during a GAO run.

Table of Contents

  1. Install
    1. pip install
    2. conda install
    3. Recomended additional software
  2. License
  3. Change Log
  4. Documentation and Usage
    1. Quick Overview
    2. Examples
  5. Contact

Install

! Note
GAlibrate is still in version zero development so new versions may not be backwards compatible.

GAlibrate installs as the galibrate package. It is compatible (i.e., tested) with Python 3.10.11.

Note that galibrate has the following core dependencies:

pip install

You can install the latest release of the galibrate package using pip sourced from the GitHub repo -

Fresh install:

pip install https://github.com/blakeaw/GAlibrate/archive/refs/tags/v0.7.2.zip

Or to upgrade from an older version:

pip install --upgrade https://github.com/blakeaw/GAlibrate/archive/refs/tags/v0.7.2.zip

PyPI

galibrate can also be pip installed from PyPI,

pip install galibrate

but this version currently doesn't include the Cython accelerated version of the core GA algorithm.

conda

You can install the galibrate package from the blakeaw channel:

conda install -c blakeaw galibrate

NumPy and SciPy dependencies will be automatically installed with this version.

Recommended additional software

The following software is not required for the basic operation of GAlibrate, but provides extra capabilities and features when installed.

Cython

GAlibrate includes an implementation of the core genetic algorithm that is written in Cython, which takes advantage of Cython-based optimizations and compilation to accelerate the algorithm. This version of genetic algorithm is used if Cython is installed.

Numba

GAlibrate also includes an implementation of the core genetic algorithm that takes advantage of Numba-based JIT compilation and optimization to accelerate the algorithm. This version of genetic algorithm is used if Numba is installed.

Julia

GAlibrate also includes an implementation of the core genetic algorithm that takes advantage of porting some key functions to Julia for JIT compilation and optimization to accelerate the algorithm. This version of genetic algorithm requires Julia and PyJulia; note that the Python-based CLI tool jill is also an option for automating the process of downloading and installing Julia.

tqdm

GAO runs will display a progress bar that tracks the passage of generations when the tqdm package installed.

PySB

PySB is needed to run PySB models, and it is therfore needed if you want to use tools from the `galibrate.pysb`` package.


Testing

Tests and coverage analysis use

Running locally from the GAlibrate repo folder:

coverage run -m pytest

then to see coverage report:

coverage report -m

License

This project is licensed under the MIT License - see the LICENSE file for details


Change Log

See: CHANGELOG


Documentation and Usage

Quick Overview

Principally, GAlibrate defines the GAO (continuous Genetic Algorithm-based Optimizer ) class,

from galibrate import GAO

which defines an object that can be used setup and run a continuous genetic algorithm-based optimization (i.e., a maximization) of a user-defined fitness function over the search space of a given set of (model) parameters.

multiprocessing-based parallelism

The multiprocessing-based parallelism (single node) can be invoked by passing the keyword argument nprocs with a value greater than one when calling the GAO.run function; for example, gao.run(nprocs=2) will use two processes. A full example is provided in this script.

Parallelism is used when evaluating the fitness function across the population (whole population during initialization and half the population during subsequent generations). You can expect the most parallel speedup when the fitness function is expensive to evaluate, such as when evaluating a PySB model. You may also get speedup when the population is very large, depending on how expensive the fitness function is to evaluate. Note however, that if the fitness function is fast to evaluate then the parallel overhead may actually slow down the run.

PySB models

Additionally, GAlibrate has a pysb sub-package that provides the galibrate_it module, which defines the GaoIt and GAlibrateIt classes (importable from the galibrate.pysb package level),

from galibrate.pysb import GaoIt, GAlibrateIt

which create objects that abstract away some of the effort to setup and generate GAO instances for PySB models; examples/pysb_dimerization_model provides some examples for using GaoIt and GAlibrateIt objects. The galibrate_it module can also be called from the command line to generate a template run script for a PySB model,

python -m galibrate.pysb_utils.galibrate_it pysb_model.py output_path

which users can then modify to fit their needs.

Examples

Additional example scripts that show how to setup and launch Genetic Algorithm runs using GAlibrate can be found under examples.


Contact

For Support

Email support inquiries to blakeaw1102@gmail.com

Interested in contributing

See CONTRIBUTING