Hadoop-Pig

• Objective: To determine which characters occur how many times in a dataset of textfiles (para1.txt to para6.txt) and performing big data analysis.

• Created a script countChar.pig which automatically maps SQL-like user commands to multiple mappers and reducers in the background which can be executed all in parallel to handle big data, thus listing character count for each alphabet in the dataset.

• Created a script popularFlavor.pig which used two text files purchases.txt (which contains all the purchases made by kids over time) and kids.txt (which contains the count of purchases made by each individual kid) to come up with the answer for the most popular flavor amongst the kids (thus analyzing big data)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
countChars.pig		countChars.pig
countChars_Filter(VowelOnly).pig		countChars_Filter(VowelOnly).pig
kids.txt		kids.txt
para1.txt		para1.txt
para2.txt		para2.txt
para3.txt		para3.txt
para4.txt		para4.txt
para5.txt		para5.txt
para6.txt		para6.txt
popularFlavor.pig		popularFlavor.pig
purchases.txt		purchases.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

countChars.pig

countChars.pig

countChars_Filter(VowelOnly).pig

countChars_Filter(VowelOnly).pig

kids.txt

kids.txt

para1.txt

para1.txt

para2.txt

para2.txt

para3.txt

para3.txt

para4.txt

para4.txt

para5.txt

para5.txt

para6.txt

para6.txt

popularFlavor.pig

popularFlavor.pig

purchases.txt

purchases.txt

Repository files navigation

Hadoop-Pig

About

Releases

Packages

Languages

rishabhindoria/Big-Data-Hadoop-Pig-Latin

Folders and files

Latest commit

History

Repository files navigation

Hadoop-Pig

About

Topics

Resources

Stars

Watchers

Forks

Languages