Block or report user

Report or block tomwhite

Hide content and notifications from this user.

Contact Support about this user’s behavior.

Report abuse


@apache @cloudera @jclouds @bigdatagenomics @HadoopGenomics @lasersonlab @disq-bio

Pinned repositories

  1. hadoop-book

    Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

    Makefile 2.6k 2.2k

  2. broadinstitute/gatk

    Official code repository for GATK versions 4 and up

    Java 560 223

  3. HadoopGenomics/Hadoop-BAM

    Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework

    Java 62 48

  4. disq-bio/disq

    A library for manipulating bioinformatics sequencing formats in Apache Spark

    Java 10 4

  5. set-game

    Play SET using image recognition and deep learning

    Java 8 2

  6. lasersonlab/zappy

    Distributed processing with NumPy and Zarr

    Python 3 1

572 contributions in the last year

Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mon Wed Fri

Contribution activity

February 2019

Created a pull request in disq-bio/disq that received 4 comments

Add a method to FileSystemWrapper to get file lengths when doing a directory listing

This saves many calls to the file system when merging part files. On the local filesystem, and even HDFS, the current behaviour is not noticeable, …

+114 −29 4 comments

Created an issue in broadinstitute/gatk that received 2 comments

ReadsPipelineSpark fails with "Interval not within the bounds of a contig"

Bug Report Affected tool(s) or class(es) ReadsPipelineSpark Affected version(s) Latest public release version Latest master branch as of…

1 of 2 2 comments

Seeing something unexpected? Take a look at the GitHub profile guide.