Skip to content

premmi/MapReduceAlgorithms

Repository files navigation

Python MapReduce Framework

This project implements the following MapReduce algorithms(MapReduce programming model enables processing very large datasets in parallel) for a variety of common data processing tasks.

  1. Creates an inverted index (dictionary where each word is associated with a list of document identifiers in which that word appears) for a given set of documents.

  2. Implements a relational join as a MapReduce query.

  3. Implements a MapReduce algorithm that derives the number of friends each person has from a simple social network dataset consisting of key-value pairs where each key is a person and each value is a friend of that person.

  4. Implements a MapReduce algorithm to check whether a relationship is symmetric (is A is a friend of B, B is a friend of A) and generates a list of all non-symmetric friend relationships.

  5. Designs a MapReduce algorithm to compute matrix multiplication: A * B, where A, B are two matrices in sparse matrix format.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages