Skip to content
This repository has been archived by the owner on Dec 10, 2018. It is now read-only.

mongodb-labs/big-data-exploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This Repository is NOT a supported MongoDB product

MongoDB Big-Data-Exploration Project

This project seeks to discover, investigate, and solve big data-set questions while utilizing MongoDB for storage and computations. This summer internship project also shows how to answer questions concerning big datasets stored in MongoDB using MongoDB's frameworks and connector. Both the MongoDB native aggregation framework and hadoop were utilized to explore the data.

The data for this project comes from two major sources:

Roadmap

This project can be divided into three sections, each with in-depth wiki pages describing our steps and observation:

  • Basic-Flights - Basic analysis on the Flights dataset using MongoDB Aggregation Framework
  • PageRank-Flights - Computing PageRank over the Flights dataset using the MongoDB MapReduce Framework
  • Twitter-Memes - Computing PageRank over the Twitter-Memes dataset using Hadoop and associated frameworks/languages (like Apache Pig, Amazon EMR)

Contributors

About

[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published