#

hadoop-mapreduce

Here are 20 public repositories matching this topic...

waltherg / distributable_docker_sql_on_hadoop

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Updated Nov 16, 2017
Shell

hyeonsangjeon / dataplatform

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

hive hadoop hadoop-cluster hadoop-mapreduce hadoop-docker pyspark-notebook zeppelin-notebook hadoop-ecosystem

Updated Nov 7, 2019
Shell

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

A storage reference to a comprehensive guide on installing Hadoop on Windows

hadoop-cluster hadoop-mapreduce hadoop-framework

Updated Jun 11, 2018
Shell

NikhilURao / H1B_VisaProject

This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.

mysql hadoop bigdata apache shell-script sqoop hadoop-filesystem hadoop-mapreduce apache-pig apache-hive

Updated Jun 26, 2019
Shell

arkady-emelyanov / hadoop-playground

🐘Yet another Hadoop playground

hadoop hadoop-mapreduce yarn-hadoop-cluster hadoop-hdfs

Updated May 28, 2018
Shell

alex-ber / docker-hive

EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5

Updated Jan 6, 2020
Shell

Sabareesh19 / Sort-on-Hadoop-Spark

Sorting of large dataset files(80GB) using Hadoop(Mapreduce) techniques and Apache Spark in Java and scheduled job on the virtual cluster(using 4 nodes) using a SLURM scheduler with bash scripting

java linux spark virtual-machine slurm-job bash-script rdd hadoop-mapreduce virtual-clusters

Updated May 4, 2018
Shell

gmarciani / mapreduce-app

Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.

bigdata mapreduce scaffolding batch-processing hadoop-mapreduce

Updated Jun 19, 2017
Shell

lucasmior / hadoop-vm

Virtual Machine with Hadoop environment setup and ready to run map-reduce applications

vagrant hadoop vagrant-environments hadoop-mapreduce hadoop-hdfs

Updated Jan 22, 2020
Shell

arminZolfaghari / docker-hadoop

Apache Hadoop docker image | Running Python MapReduce

hadoop hadoop-mapreduce docker-hadoop hadoop-hdfs mapreduce-python

Updated May 28, 2023
Shell

darule0 / yarndiff

A rudimentary command line utility for contrasting Apache Yarn container logs.

diff spark yarn hive hadoop log4j pig mapreduce diffing difference hadoop-mapreduce yarn2

Updated Jan 8, 2024
Shell

kenten132 / hadoop-Sandbox

Testing and learning Hadoop.

java hadoop-mapreduce

Updated Jun 15, 2017
Shell

lhuaquisto / hadoop-multicluster

spark ubuntu virtual-machine hadoop-mapreduce

Updated Aug 27, 2019
Shell

s-evsyukov / hadoop_hive

Hadoop Hive practice

yarn aws-s3 hadoop-mapreduce hadoop-hdfs hadoop-hive hive-sql

Updated Aug 9, 2022
Shell

aaa121 / Big-Data-Analytics

python r scala sql hive pyspark pig sparkr hadoop-filesystem hadoop-mapreduce

Updated Jul 22, 2017
Shell

cevheri / hadoop.3-config

My Apache Hadoop 3 config files.

hadoop hadoop-filesystem hadoop-mapreduce hadoop-conf hadoop-hdfs hadoop-core linux-bash pom-xml

Updated Jan 30, 2021
Shell

jbw / hadoop-docker-cluster

Hadoop cluster on Docker (single host)

docker hadoop hadoop-cluster hadoop-mapreduce hadoop-docker

Updated Aug 3, 2018
Shell

deepcloudlabs / big-data-essentials

DCL-700: Big Data Essentials

machine-learning spark spark-streaming hdfs hadoop-mapreduce spark-sql spark-ml hadoop-3

Updated Jun 10, 2020
Shell

Bayunova28 / Spotify_Lyrics

This repository contains my personal project to generate mapreduce using apache hadoop

spotify apache-derby hadoop-mapreduce apache-hive apache-hadoop mapreduce-python

Updated Dec 31, 2022
Shell

xuyinhao / lgpbenchmark

公司内部对接Hadoop 基本正确性测试

hadoop hadoop-mapreduce hadoop-hdfs

Updated Jun 23, 2020
Shell

Improve this page

Add a description, image, and links to the hadoop-mapreduce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop-mapreduce topic, visit your repo's landing page and select "manage topics."