Toy Hadoop cluster combining various SQL-on-Hadoop variants
-
Updated
Nov 16, 2017 - Shell
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
A storage reference to a comprehensive guide on installing Hadoop on Windows
This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.
🐘Yet another Hadoop playground
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
Sorting of large dataset files(80GB) using Hadoop(Mapreduce) techniques and Apache Spark in Java and scheduled job on the virtual cluster(using 4 nodes) using a SLURM scheduler with bash scripting
Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.
Virtual Machine with Hadoop environment setup and ready to run map-reduce applications
Apache Hadoop docker image | Running Python MapReduce
Hadoop Hive practice
My Apache Hadoop 3 config files.
Hadoop cluster on Docker (single host)
DCL-700: Big Data Essentials
This repository contains my personal project to generate mapreduce using apache hadoop
Add a description, image, and links to the hadoop-mapreduce topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-mapreduce topic, visit your repo's landing page and select "manage topics."