Skip to content
More interactive weak supervision with FlyingSquid
Python
Branch: master
Clone or download

Latest commit

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
examples Update images in tutorials Feb 28, 2020
figs Add logo Feb 26, 2020
flyingsquid Add examples folder, update FSLoss interface, update tutorials README Feb 25, 2020
.gitignore Install instructions Feb 24, 2020
LICENSE Create LICENSE Mar 2, 2020
README.md Update install instructions Mar 3, 2020
environment.yml Install instructions Feb 24, 2020
setup.py Install instructions Feb 24, 2020

README.md

More Interactive Weak Supervision with FlyingSquid

FlyingSquid is a new framework for automatically building models from multiple noisy label sources. Users write functions that generate noisy labels for data, and FlyingSquid uses the agreements and disagreements between them to learn a label model of how accurate the labeling functions are. The label model can be used directly for downstream applications, or it can be used to train a powerful end model:

FlyingSquid can be used to build models for all sorts of tasks, including text applications, video analysis, and online learning. Check out our blog post and paper on arXiv for more details!

Getting Started

  • Quickly install FlyingSquid
  • Check out the examples folder for tutorials and some simple code examples

Sample Usage

from flyingsquid.label_model import LabelModel
import numpy as np

L_train = np.load('...')

m = L_train.shape[1]
label_model = LabelModel(m)
label_model.fit(L_train)

preds = label_model.predict(L_train)

Installation

We recommend using conda to install FlyingSquid:

git clone https://github.com/HazyResearch/flyingsquid.git

cd flyingsquid

conda env create -f environment.yml
conda activate flyingsquid

pip install -e .

cd ..

Alternatively, you can install the dependencies yourself:

  • Pgmpy
  • PyTorch (only necessary for the PyTorch integration)

And then install the actual package:

git clone https://github.com/HazyResearch/flyingsquid.git

cd flyingsquid

pip install -e .

cd ..

Citation

If you use our work or found it useful, please cite our arXiv paper for now:

@article{fu2020fast,
  author = {Daniel Y. Fu and Mayee F. Chen and Frederic Sala and Sarah M. Hooper and Kayvon Fatahalian and Christopher R\'e},
  title = {Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods},
  journal = {arXiv preprint arXiv:2002.11955}
  year = {2020},
}
You can’t perform that action at this time.