Engineer and author of "Hadoop in Practice".
Source code to accompany the book "Hadoop in Practice", published by Manning.
Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark
Utility to easily copy files into HDFS
InputFormat that can split multi-line JSON
A set of Hadoop utilities to make working with Hadoop a little easier.