Skip to content

Assignments for Big Data for Data Engineers specialization on Coursera by Yandex.

Notifications You must be signed in to change notification settings

LucasBoTang/Coursera_Big_Data_for_Data_Engineers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Specialization: Big Data for Data Engineers

Big Data Essentials: HDFS, MapReduce and Spark RDD

Assignments:

  • Hadoop Streaming Assignment 0: Word Count
  • Hadoop Streaming Assignment 1: Words Rating
  • Hadoop Streaming Assignment 2: Stop Words
  • Spark Assignment 1: Pairs
  • Spark Assignment 2: Reconstructing the path
  • Real-World Applications: TF-IDF

Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames

Assignments:

  • Hive Assignment 1. DDL: Create Tables
  • Hive Assignment 2. DML: Find Most Popular Tags
  • Spark Assignment 1: Counting number of the mutual friends
  • Spark Assignment 2: Graph based Music Recommender

About

Assignments for Big Data for Data Engineers specialization on Coursera by Yandex.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages