Skip to content

agilemobiledev/Essentials

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 

Repository files navigation

Hadoop Essentials

This GitHub project stores content related to the Hadoop Essentials course offering from Hortonworks Univeristy.

Status: Under Development

Approach & Setup

The Hadoop Essentials course uses demonstrations instead of hands-on labs due to the short duration of the offering. That said, the demos are closely aligned with the publicly available tutorials.

Additionally, to allow participants to recreate the demos performed during the course, the Hortonworks Sandbox is utilized. See Sandbox Setup for specific setup and configuration details regarding this course.

The target audience for this repo is the instructors themselves to provide them with guidance for presenting these demos to a live audience, but all are welcome to utilize and feedback (and fixes via pull requests) is surely appreciated.

The Demonstrations

Operational Overview with Ambari

Loading Data into HDFS

Streaming Data into HDFS << Time Permittting

Foundational Processing with MapReduce << Time Permittting

Data Manipulation with Hive

Risk Analysis with Pig

Risk Analysis with Spark

Data Pipeling with Falcon << Time Permittting

Securing Hive with Ranger << Time Permittting

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%