Block or report user

Report or block tomwhite

Hide content and notifications from this user.

Contact Support about this user’s behavior.

Report abuse


@apache @cloudera @jclouds @bigdatagenomics @HadoopGenomics @lasersonlab

Pinned repositories

  1. hadoop-book

    Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

    Makefile 2.3k 2.1k

  2. broadinstitute/gatk

    Official code repository for GATK versions 4 and up

    Java 459 181

  3. HadoopGenomics/Hadoop-BAM

    Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework

    Java 62 49

  4. disq-bio/disq

    A library for manipulating bioinformatics sequencing formats in Apache Spark

    Java 2 2

  5. set-game

    Play SET using image recognition and deep learning

    Java 5 2

  6. lasersonlab/single-cell-experiments

    Experiments to run single cell analyses efficiently at scale using Zarr, anndata, Scanpy, and Apache Spark

    Python 3 2

481 contributions in the last year

Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Mon Wed Fri

Contribution activity

September 2018

Created a pull request in disq-bio/disq that received 3 comments

[DISQ-29] Use a separate instance of VCFCodec for each partition

This exposes the bug in #29, by using all available cores on a machine for testing (in BaseTest). (I was also able to repoduce the bug when testing…

+24 −11 3 comments

Created an issue in disq-bio/disq that received 1 comment

HtsjdkVariantsRdd fails when reading with concurrent tasks

Running the unit tests with multiple cores results in exceptions like testReadAndWriteMultiple(HiSeq.10000.vcf.bgz, 131072, VCF_BGZ) [2](org.disq_b…

1 comment

Seeing something unexpected? Take a look at the GitHub profile guide.