• Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.

    Shell 24 32 Updated Mar 27, 2017
  • ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.

    Scala 602 213 Updated Mar 26, 2017
  • A scalable genome browser. Apache 2 licensed.

    Scala 41 15 Updated Mar 22, 2017
  • General (non-omics) code used across BDG products. Apache 2 licensed.

    Scala 19 21 Updated Mar 21, 2017
  • Conversions to and from Big Data Genomics Avro Formats. Apache 2 licensed.

    Java 2 Updated Mar 14, 2017
  • Scala 2 10 Updated Mar 8, 2017
  • A Variant Caller, Distributed. Apache 2 licensed.

    Scala 56 37 Updated Mar 1, 2017
  • Web Site for the Big Data Genomics Group

    HTML 9 6 Updated Jan 9, 2017
  • A refreshing treatment for all quality control ailments. Apache 2 licensed.

    Scala 2 6 Updated Oct 13, 2016
  • Exemplar API that mediates Toil with a WDL front-end and workflow tracking.

    Java 1 Updated Jul 31, 2016
  • Parallel alignment using SNAP on ADAM. Apache 2 licensed.

    Scala 2 1 Updated Jul 6, 2016
  • An RNA pipeline built on top of ADAM. Apache 2 licensed.

    Scala 17 15 Updated Apr 29, 2016
  • Ready-to-go Parquet-formatted public 'omics datasets

    Python 26 7 Updated Nov 2, 2015
  • Recipes using BDG projects. Apache 2 licensed.

    Shell 4 3 Updated Mar 25, 2015
  • Assembler for PacBio reads. Apache 2 licensed.

    Scala 4 4 Updated Mar 14, 2015
  • Read error correction utilities.

    2 Updated Mar 1, 2015
  • Notebook tools for Big Data Genomics. Apache 2 licensed.

    JavaScript 3 432 Updated Mar 1, 2015
  • Utility classes for wrapping services or other interfaces around a Spark/ADAM cluster. Apache 2 licensed.

    Java 5 7 Updated Nov 17, 2014