ansible playbook to deploy cloudera hadoop components to the cluster
-
Updated
Sep 8, 2018 - Shell
ansible playbook to deploy cloudera hadoop components to the cluster
Docker image for Cloudera Hadoop components (CDH5)
A quick and dirty CDH cluster skeleton using Docker for Testing
Getting Started with Hadoop and Big Data
💂♂️ Hadoop/MapReduce Streaming
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017
This is my final project for Data Engineer Expert course at Naya College.
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …
This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.
This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query
chatbot for hipchat (cloud or onpremise) that enables you to talk to your cloudera manager
This repository includes two versions of hadoop management tools
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
Cloudera commands used for Big Data Analytics
Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."