Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
Branch: 3e
Pull request Compare This branch is 81 commits behind tomwhite:3e.

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
app3/src/main/sh
avro
book
ch02
ch03
ch04
ch05
ch07
ch08
ch09/src/main
ch11
ch12
ch13
ch14
ch15
ch16
common
experimental
hadoop-examples
hadoop-meta
hbase-examples
hive-examples
input
pig-examples
snippet
sqoop-examples
zookeeper-examples
.gitignore
CHANGES
README
build-examples.xml
build.xml
findbugs-exclude.xml
ivy.xml
pom.xml

README

Example code for "Hadoop: The Definitive Guide, Third Edition" by Tom White.
Copyright (C) 2011 Tom White, 978-1-449-31152-0

http://www.hadoopbook.com/
http://oreilly.com/catalog/9781449311520/

The code is hosted at http://github.com/tomwhite/hadoop-book/. You can find code
for the first edition at http://github.com/tomwhite/hadoop-book/tree/1e, and
for the second edition at http://github.com/tomwhite/hadoop-book/tree/2e.

This version of the code has been tested with:
 * Hadoop 0.20.2
 * Avro 1.4.1
 * Pig 0.8.3
 * Hive 0.7.1-SNAPSHOT
 * HBase 0.90.3
 * ZooKeeper 3.3.3
 * Sqoop 1.2.0-cdh3u0

Before running the examples you need to install Hadoop, Pig, Hive, HBase,
ZooKeeper, and Sqoop (as appropriate) as explained in the book.

You also need to install Maven.

Then you can build the code with:

% mvn package -DskipTests

You should then be able to run the examples from the book.

Chapter names for "Hadoop: The Definitive Guide", Second Edition

ch01 - Meet Hadoop
ch02 - MapReduce
ch03 - The Hadoop Distributed Filesystem
ch04 - Hadoop I/O
ch05 - Developing a MapReduce Application
ch06 - How MapReduce Works
ch07 - MapReduce Types and Formats
ch08 - MapReduce Features
ch09 - Setting Up a Hadoop Cluster
ch10 - Administering Hadoop
ch11 - Pig
ch12 - Hive
ch13 - HBase
ch14 - ZooKeeper
ch15 - Sqoop
ch16 - Case Studies

app1 - Installing Apache Hadoop
app2 - Cloudera's Distribution for Hadoop
app3 - Preparing the NCDC Weather Data
Something went wrong with that request. Please try again.