hadoop-cluster

Here are 3 public repositories matching this topic...

This repository contains analysis work I did on the MovieLens dataset using the big data tools Pig and Hive alongside the Hadoop infrastructure

Apache Pig Latin script to count letters in multiple input text files, using the HortonWorks Hadoop Sandbox or Google Cloud Platform

Big Data Analysis of datasets for taking into account the character occurrences.

Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.

To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."