This repository contains analysis work I did on the MovieLens dataset using the big data tools Pig and Hive alongside the Hadoop infrastructure
-
Updated
Jan 10, 2021 - PigLatin
This repository contains analysis work I did on the MovieLens dataset using the big data tools Pig and Hive alongside the Hadoop infrastructure
Apache Pig Latin script to count letters in multiple input text files, using the HortonWorks Hadoop Sandbox or Google Cloud Platform
Big Data Analysis of datasets for taking into account the character occurrences.
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."