Gremlin, an adversarial evolutionary algorithm that discovers biases or weaknesses in machine learners

Gremlin learns where a given machine learner (ML) model performs poorly via an adversarial evolutionary algorithm (EA). The EA will find the worst performing feature sets such that a practitioner can then, say, tune the training data to include more examples of those feature sets. Then the ML model can be trained again with the updated training set in the hopes that the additional examples will be sufficient for the ML to train models that perform better for those sets.

Gremlin is a 2022 R&D 100 Award Winner!

Requires

Python >= 3.7.0
[LEAP https://github.com/AureumChaos/LEAP/tree/develop](https://github. com/AureumChaos/LEAP/tree/develop) -- **Note that this is for the LEAP develop branch.

Installation

Activate your conda or virtual environment
cd into top-level gremlin directory
pip install .

Configuration

Gremlin is a thin convenience wrapper around [LEAP] (https://github.com/AureumChaos/LEAP). Instead of writing a script in LEAP, one would instead point the gremlin executable at a YAML file that describes what LEAP classes, subclasses, and functions to use, as well as other salient run-time characteristics. gremlin will parse the YAML file and generate a CSV file containing the individuals from the run. This CSV file should contain information that can be exploited to tune training data.

Example Gremlin configuration YAML:

pop_size: 25
algorithm: async # or bgen
async: # parameters for asynchronous steady-state EA
  init_pop_size: ${pop_size}
  max_births: 2000
  ind_file: inds.csv # optional file for writing individuals as they are evaluated
  ind_file_probe: probe.log_ind # optional functor or function for writing ind_file

pop_file: pop.csv # where we will write out each generation in CSV format
problem: problem.QLearnerBalanceProblem("${oc.env:GREMLIN_QLEARNER_CARTPOLE_MODEL_FPATH}")
representation: representation.BalanceRepresentation()
preamble: |
  import probe # need to import our probe.py so that LEAP sees our probe pipeline operator
pipeline: # isotropic means we mutate all genes with the given stds
  - ops.random_selection
  - ops.clone
  - mutate_gaussian(expected_num_mutations='isotropic', std=[0.1, 0.001, 0.01, 0.001], hard_bounds=representation.BalanceRepresentation.genome_bounds)
  - ops.pool(size=1)

Essentially, you will have to define the following

A LEAP Representation subclass, as denoted in the above config file
- This, in turn, will mean defining a LEAP Decoder and optionally a bespoke LEAP Individual subclass.
  - The latter is mostly to override a __str__() member function for pretty printing.
- We also suggest using a python named tuple to define the genes in the phenotype as a convenience. (See examples/MNIST/representation.py)
A LEAP Problem subclass.
- This is the meat of defining your problem for Gremlin to tackle.
- This subclass may be responsible for loading your models and then applying them as denoted by genes values per individual.
- E.g., the examples/MNIST/representation.py dictates that each individual has a single gene that represents a given digit, and the corresponding problem.py defines an evaluate() function that decodes a given individuals digit as denoted in its gene, and then checks to see how well the MNIST model predicts for that digit across a test dataset.
- Similarly you will need to define a LEAP Problem subclass that is responsible for loading datasets and models and then predicting how effective they are for a given feature represented by a single individual.

Examples

Example code and configuration for a real problem can be found in examples/MNIST. This problem involves Gremlin discovering that one of the digits for the MNIST training data is poorly represented.

This can be run simply by (must be in examples/MNIST directory):

$ gremlin config.yml

Documentation

The wiki has more detailed documentation, particularly on how the YAML config files can be set up for Gremlin runs.

Versions

Note that more detailed explanations for version changes can be found in the CHANGELOG.

v0.6, in progress on develop
v0.5, 2/3/23
- Main installed executable now gremlin and not gremlin.py. Added optional async.with_client config section. Improvements made to setup. py.
v0.4, 9/30/22
- Added config variable async.with_client that allows for interacting with Dask before the EA runs; e.g., client.wait_for_workers() or client.upload_file()
- Replaced imports with preamble in YAML config files thus giving more flexibility for importing dependencies, but also allows for defining functions and variables that may be referred to in, say, the pipeline.
v0.3, 3/9/22
- Add support for config variable algorithm that denotes if using a traditional by-generation EA or an asynchronous steady-state EA
v0.2dev, 2/17/22
- revamped config system and heavily refactored/simplified code
v0.1dev, 10/14/21
- initial raw release

Sub-directories

gremlin/ -- main gremlin code
examples/ -- examples for using gremlin; currently only has MNIST example

Main web site

The gremlin github repository is https://github.com/markcoletti/gremlin.
main is the release branch and active work occurs on the develop branch.

Name		Name	Last commit message	Last commit date
Latest commit History 197 Commits
examples/MNIST		examples/MNIST
gremlin		gremlin
.gitignore		.gitignore
CHANGELOG		CHANGELOG
LICENSE		LICENSE
RD100_2022_Winner_Logo-small.png		RD100_2022_Winner_Logo-small.png
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gremlin, an adversarial evolutionary algorithm that discovers biases or weaknesses in machine learners

Requires

Installation

Configuration

Examples

Documentation

Versions

Sub-directories

Main web site

About

Releases 6

Packages

Languages

License

leap-ec/gremlin

Folders and files

Latest commit

History

Repository files navigation

Gremlin, an adversarial evolutionary algorithm that discovers biases or weaknesses in machine learners

Requires

Installation

Configuration

Examples

Documentation

Versions

Sub-directories

Main web site

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages