Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data
Enabling queries on compressed data.
Scripts used to setup a Spark cluster on EC2
Drizzle integration with Apache Spark
R Codebase for BISCUIT: Infinite Mixture Model to cluster and impute single cells.
Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.
Code for Ernest
Simplifying robust end-to-end machine learning on Apache Spark.
An efficient updatable key-value store for Apache Spark
A example skeleton for an application built on top of KeystoneML
Distributed Matrix Library
Distributed Neural Networks for Spark
An API for Distributed Machine Learning
Experiments for the Ray backend
Build artifacts for Ray Core
Caffe: a fast open framework for deep learning.
Large scale query engine benchmark
Integration Tests for KeystoneML
Fine-Grained Distributed Computing
Swift Transformations for RegEx queries