@HazyResearch

HazyResearch

We are a CS research group led by Prof. Chris Ré.

Pinned repositories

  1. snorkel

    A system for quickly generating training data with weak supervision

    Jupyter Notebook 1.3k 303

  2. EmptyHeaded

    Make databases great again!

    C++ 83 14

  • 📰 a knowledge base construction engine for richly formatted data

    Python 49 16 MIT 1 issue needs help Updated Aug 16, 2018
  • A system for quickly generating training data with weak supervision

    Jupyter Notebook 1,296 303 Apache-2.0 4 issues need help Updated Aug 16, 2018
  • Snorkel MeTaL: A framework for training models with multi-task weak supervision

    Python 5 2 Apache-2.0 Updated Aug 16, 2018
  • 📓 a collection of simple tutorials for using Fonduer

    Jupyter Notebook 12 3 MIT Updated Aug 10, 2018
  • 🌲 A tool for parsing PDF documents into a heirarchical, HTML-like tree.

    Python 38 9 MIT 3 issues need help Updated Aug 9, 2018
  • A system for generating training labels via natural language explanations

    30 1 Apache-2.0 Updated Aug 9, 2018
  • HTML 3 4 Updated Jul 28, 2018
  • Hyperbolic Embeddings

    Julia 47 7 Updated Jul 25, 2018
  • Low precision random Fourier features for kernel approximation

    Python 1 Apache-2.0 Updated Jul 17, 2018
  • Weakly Supervised MRI Series Classification for the UK Biobank

    Python 2 Apache-2.0 Updated Jun 16, 2018
  • Python 9 5 Updated Jun 9, 2018
  • Jupyter Notebook 6 5 Updated Jun 1, 2018
  • Numba-based version of DimmWitted Gibbs sampler

    Python 20 8 Updated May 15, 2018
  • Learning to Compose Domain-Specific Transformations for Data Augmentation

    Python 85 19 MIT Updated May 12, 2018
  • Reproducible code for Augmentation paper

    Python 2 Apache-2.0 Updated May 7, 2018
  • Python 3 1 MIT Updated May 3, 2018
  • Automatically labeling training data

    Jupyter Notebook 11 2 Apache-2.0 Updated Apr 14, 2018
  • Table Extraction Tool

    Jupyter Notebook 38 18 Updated Feb 28, 2018
  • DeepDive

    Shell 1,546 473 Apache-2.0 Updated Jan 29, 2018
  • A PCA-based engine for embeddings

    Python 3 680 Apache-2.0 Updated Dec 9, 2017
  • Public materials for the Fall 2016 offering of CS145

    Jupyter Notebook 31 33 Updated Oct 6, 2017
  • Accelerated Stochastic Power Iteration with Momentum

    Jupyter Notebook 2 Updated Oct 4, 2017
  • Tools for iterative knowledge base development with DeepDive

    CoffeeScript 71 20 Updated Sep 13, 2017
  • DeepDive Biomedical Tools

    Python 9 4 Updated Apr 4, 2017
  • Make databases great again!

    C++ 83 14 Updated Mar 16, 2017
  • Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿

    Scala 17 3 Updated Jan 24, 2017
  • DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿

    C++ 14 9 Apache-2.0 Updated Jan 24, 2017
  • imgaug

    Forked from aleju/imgaug

    Image augmentation for machine learning experiments.

    Python 2 757 MIT Updated Jan 1, 2017
  • JavaScript 14 10 Updated Dec 6, 2016
  • Models built with TensorFlow

    Jupyter Notebook 2 23,706 Apache-2.0 Updated Oct 5, 2016