• Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data

    C++ 159 54 Updated Jul 19, 2018
  • Enabling queries on compressed data.

    Java 197 50 Apache-2.0 Updated Jun 22, 2018
  • Cyclades

    C++ 20 6 Apache-2.0 Updated Apr 7, 2018
  • HTML Updated Jan 15, 2018
  • spark-ec2 Archived

    Scripts used to setup a Spark cluster on EC2

    Python 314 278 Apache-2.0 Updated Nov 22, 2017
  • Drizzle integration with Apache Spark

    Scala 93 23 Apache-2.0 Updated Nov 19, 2017
  • R Codebase for BISCUIT: Infinite Mixture Model to cluster and impute single cells.

    R 11 Updated Nov 3, 2017
  • Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.

    Scala 306 95 Apache-2.0 Updated Aug 29, 2017
  • A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.

    Python 50 14 Apache-2.0 Updated Jul 3, 2017
  • Code for Ernest

    Python 14 6 Apache-2.0 Updated Jul 1, 2017
  • Simplifying robust end-to-end machine learning on Apache Spark.

    Scala 470 123 Apache-2.0 Updated Apr 19, 2017
  • Scala 108 25 Apache-2.0 Updated Apr 18, 2017
  • Rust 1 Updated Mar 31, 2017
  • An efficient updatable key-value store for Apache Spark

    Scala 242 78 Apache-2.0 Updated Mar 12, 2017
  • A example skeleton for an application built on top of KeystoneML

    Shell 8 5 Updated Mar 6, 2017
  • Distributed Matrix Library

    Scala 67 36 Apache-2.0 Updated Jan 28, 2017
  • Scala 8 7 Apache-2.0 Updated Dec 11, 2016
  • HTML 8 2 Updated Nov 29, 2016
  • Distributed Neural Networks for Spark

    Scala 564 180 MIT Updated Sep 28, 2016
  • An API for Distributed Machine Learning

    Scala 151 60 Updated Sep 23, 2016
  • Experiments for the Ray backend

    C++ 2 2 Updated Aug 7, 2016
  • Build artifacts for Ray Core

    C++ Apache-2.0 Updated Aug 7, 2016
  • Numerical Buffers

    C++ Apache-2.0 Updated Jul 28, 2016
  • C++ 60 Updated Jun 17, 2016
  • caffe

    Forked from BVLC/caffe

    Caffe: a fast open framework for deep learning.

    C++ 3 15,253 Updated May 19, 2016
  • Large scale query engine benchmark

    Python 97 65 Updated Apr 6, 2016
  • Integration Tests for KeystoneML

    Shell 1 1 Apache-2.0 Updated Mar 25, 2016
  • Fine-Grained Distributed Computing

    Python 9 1 Updated Feb 16, 2016
  • Swift Transformations for RegEx queries

    C++ 6 3 Updated Feb 13, 2016
  • Objective-C 3 4 Updated Jan 5, 2016