Apache Hadoop 3 Quick Start Guide

This is the code repository for Apache Hadoop 3 Quick Start Guide, published by Packt.

Learn about big data processing and analytics

What is this book about?

Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.

This book covers the following exciting features:

Store and analyze data at scale using HDFS, MapReduce and YARN
Install and configure Hadoop 3 in different modes
Use Yarn effectively to run different applications on Hadoop based platform
Understand and monitor how Hadoop cluster is managed
Consume streaming data using Storm, and then analyze it using Spark

If you feel this book is for you, get your copy today!

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

  <dependencies>
         <dependency>
             <groupId>org.apache.hadoop</groupId>
             <artifactId>hadoop-client</artifactId>
             <version>3.1.0</version>
         </dependency>
     </dependencies>

Following is what you need for this book: Aspiring Big Data professionals who want to learn the essentials of Hadoop 3 will find this book to be useful. Existing Hadoop users who want to get up to speed with the new features introduced in Hadoop 3 will also benefit from this book. Having knowledge of Java programming will be an added advantage.

With the following software and hardware list you can run all code files present in the book (Chapter 1-8).

Software and Hardware List

Chapter	Software required	OS required
2 to 8	OpenJDK 1.8.0_171 64 bit Apache Hadoop-3.1.0	Ubuntu 16.04.3_LTS

Code in Action

Click on the following link to see the Code in Action:

http://bit.ly/2AznxS3

Related products

Hadoop 2.x Administration Cookbook [Packt] [Amazon]
Hadoop Real-World Solutions Cookbook - Second Edition [Packt] [Amazon]

Get to Know the Author

Hrishikesh Vijay Karambelkar is an innovator and an enterprise architect with 16 years of software design and development experience, specifically in the areas of big data, enterprise search, data analytics, text mining, and databases. He is passionate about architecting new software implementations for the next generation of software solutions for various industries, including oil and gas, chemicals, manufacturing, utilities, healthcare, and government infrastructure. In the past, he has authored three books for Packt Publishing: two editions of Scaling Big Data with Hadoop and Solr and one of Scaling Apache Solr. He has also worked with graph databases, and some of his work has been published at international conferences such as VLDB and ICDE.

Other books by the authors

Suggestions and Feedback

Click here if you have any feedback or suggestions.

Download a free PDF

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781788999830

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Chapter2		Chapter2
Chapter3		Chapter3
Chapter4		Chapter4
Chapter5		Chapter5
Chapter7		Chapter7
Chapter8		Chapter8
RemoteSystemsTempFiles		RemoteSystemsTempFiles
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter2

Chapter2

Chapter3

Chapter3

Chapter4

Chapter4

Chapter5

Chapter5

Chapter7

Chapter7

Chapter8

Chapter8

RemoteSystemsTempFiles

RemoteSystemsTempFiles

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Apache Hadoop 3 Quick Start Guide

What is this book about?

Instructions and Navigations

Software and Hardware List

Code in Action

Related products

Get to Know the Author

Other books by the authors

Suggestions and Feedback

Download a free PDF

About

Releases

Packages

Contributors 5

Languages

License

PacktPublishing/Apache-Hadoop-3-Quick-Start-Guide

Folders and files

Latest commit

History

Repository files navigation

Apache Hadoop 3 Quick Start Guide

What is this book about?

Instructions and Navigations

Software and Hardware List

Code in Action

Related products

Get to Know the Author

Other books by the authors

Suggestions and Feedback

Download a free PDF

About

Resources

License

Stars

Watchers

Forks

Languages