#

hadoop-cluster

Here are 146 public repositories matching this topic...

akshayavb99 / Ansible-Examples

The repository contains all the Playbooks and other files used to work with different applications for Ansible

docker ansible webserver ansible-playbooks yum hadoop-cluster explanation webservers loadbalancer dynamic-inventory-aws webserver-setup rhel8 linux-scripting

Updated Apr 4, 2023
Python

comoyi / docker-hadoop-cluster

A docker hadoop cluster

docker hadoop hadoop-cluster

Updated Feb 24, 2018
Shell

aogunwoolu / Ethereum-analysis

ETH analysis using big data for the QMUL Big Data Processing module. Intended to promote analysis of data retrieved via big data processing

python big-data hadoop ethereum hadoop-cluster hadoop-filesystem hadoop-mapreduce mrjob big-data-analytics hadoop-hdfs mrjob-dataproc

Updated Dec 25, 2021
Jupyter Notebook

HeliaHashemipour / Hadoop-Spark

Third homework of CloudComputing - Fall 2022

spark hadoop collaborative-filtering hadoop-cluster als cosine-similarity spark-sql

Updated Feb 9, 2023
Jupyter Notebook

dhitaj / bdc-sapienza

Assignments of Big Data course during the Spring 2017 semester at Sapienza

java big-data hadoop hadoop-cluster hadoop-filesystem hadoop-mapreduce hadoop-hdfs

Updated Mar 8, 2018
Java

shreyasshivakumara / Reddit-Analysis-Large-Dataset-Scientific-Application

Architected and developed a horizontally scalable data processing solution for the reddit dataset. Demonstrated the scalability (Weak Scalability and Strong Scalability) tests in suitable computational analysis.

github reddit spark python3 master-slave data-analysis hadoop-cluster hadoop-mapreduce large-dataset hadoop-hdfs

Updated Jul 3, 2020
Jupyter Notebook

DanMolenhouse / Distributed-Systems-Project5-Hadoop-and-Spark

In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.

spark apache-spark hadoop hadoop-cluster mapreduce hadoop-mapreduce spark-cluster mapreduce-java hadoop-hdfs

Updated Oct 31, 2022
Java

vineetdcunha / Hadoop_Ecosystem

Processing and transforming data via Hadoop Ecosystem

python hive hadoop python-script hbase pyspark mahout pig hadoop-cluster hadoop-mapreduce hadoop-streaming hadoop-ecosystem hiveql multinode hadoop-hdfs hbase-standalone

Updated Nov 26, 2020
Python

deepakag5 / Cloud-Computing-AWS

Cloud Computing Tutorials for AWS

s3-bucket load-balancer vpc hadoop-cluster aws-rds disaster-recovery hadoop-streaming rds-database iam-users emr-cluster

Updated Nov 14, 2019
Python

uncleislearning / learning-Hadoop

HDFS、MapReduce、Hive、Zookeeper原理以及实践操作

hadoop hadoop-cluster hadoop-filesystem hadoop-mapreduce hadoop-ecosystem

Updated Feb 15, 2018

jbw / hadoop-docker-cluster

Hadoop cluster on Docker (single host)

docker hadoop hadoop-cluster hadoop-mapreduce hadoop-docker

Updated Aug 3, 2018
Shell

silencebingo / hadoop-spark-cluster

A Hadoop and Spark Cluster on Docker

hadoop-cluster spark-cluster

Updated Apr 12, 2018
Shell

lk5164 / hadoop-cluster-setup

ubuntu hadoop-cluster fully-distributed

Updated Aug 25, 2019
Shell

shubhambhardwaj007 / Ansible-Hadoop-DataNode-Role

An Ansible Role to Configure and setup Hadoop Data Node.

ansible big-data hadoop cluster ansible-role hadoop-cluster ansible-roles ansible-galaxy hadoop-hdfs hadoop-data-platform

Updated May 18, 2021
Jinja

sbathehwx / failhadoop

A framework for running various failure tests against a Hadoop cluster

Updated Dec 4, 2017
Python

huangyueranbbc / Hadoop_MapReduce_Yarn

yarn hadoop-cluster hadoop-mapreduce hadoop-yarn

Updated Apr 29, 2017
Java

kangli914 / hadoopwork

My work and note stuff including Hadoop & Spark ecosystem

spark bigdata hadoop-cluster

Updated May 20, 2019
Scala

davidleiti / Hadoop-Reversed-Index

Simple inverted indexing algorithm implemented with Hadoop

hadoop inverted-index hadoop-cluster hadoop-mapreduce hadoop-hdfs

Updated Jan 7, 2021
Java

mchien15 / nn-Docker-Hadoop-cluster

Docker Hadoop cluster with ecosystem

docker big-data hadoop-cluster

Updated Apr 2, 2024
Shell

rishabhindoria / Big-Data-Hadoop-Pig-Latin

Apache Pig Latin script to count letters in multiple input text files, using the HortonWorks Hadoop Sandbox or Google Cloud Platform

sandbox hadoop-cluster pig-latin hadoop-filesystem googlecloud hadoop-mapreduce hadoop-docker googlecloudplatform

Updated May 10, 2017
PigLatin

Improve this page

Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."