Skip to content

PacktPublishing/Hadoop-Essentials

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop-Essentials

This is the code repository for Hadoop-Essentials, published by Packt. It contains all the supporting project files necessary to work through the book from start to finish.

Instructions and Navigation:

All of the code is organized into folders. Each folder starts with C followed by the chapter number. For example, HDFS commands.txt is a file from Chap 1, and HBase commands.txt is one from Chap 5.

What you need for the code files:

A prerequisite of Java programming and basics of distributed computing will be very helpful and an interest to understand about Hadoop and its ecosystem components. The code and syntax have been tested in Hadoop 2.4.1 and other compatible ecosystem component versions, but may vary in the newer version.

Software and Hardware requirements:

  1. Apache Hadoop 2.x - Install Ambari 1.7.0 - Atleast 4 node cluster with average configuration and at 16 GBit Ethernet - Linux
  2. Hive, Pig - Install Ambari 1.7.0 - Atleast 4 node cluster with average configuration and at 16 GBit Ethernet - Linux
  3. HBase - Install Ambari 1.7.0 - Atleast 4 node cluster with average configuration and at 16 GBit Ethernet - Linux
  4. Sqoop, Flume - sqoop 1.4.5, Flume 1.5.2 - Atleast 4 node cluster with average configuration and at 16 GBit Ethernet - Linux

Related Hadoop books:

Download a free PDF

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781784396688

Releases

No releases published

Packages

No packages published

Languages