Skip to content

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, SPRING 2017

Notifications You must be signed in to change notification settings

MoustafaAMahmoud/big-data-mapreduce-course

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Course Information

Exam Dates

  • Midterm Exam: May (To-Be-Determined-Later), 2017 from 5:45pm to 7:00pm PST
  • Final Exam: Thursday, June 15, 2017 from 5:45pm-7:45pm PST

Course Description

The main focus of this class is to cover the following concepts:

  • Concepts of Big Data
  • Distributed File Systems
  • Distributed Computing
  • Distributed and Parallel Algorithms
  • MapReduce Paradigm
  • Scale-out Architectures (using Hadoop, Spark, PySpark)
  • Apache Spark: http://spark.apache.org/
  • Use Spark, Py-Spark, Hadoop, and Java to teach MapReduce and distributed computing

My latest book:

Data Algorithms: Recipes for Scaling up with Hadoop and Spark

Data Algorithms Book

About

Big Data, MapReduce, Spark, PySpark, Java @ Santa Clara University, SPRING 2017

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 85.6%
  • Shell 5.9%
  • Java 5.8%
  • Batchfile 1.6%
  • Python 0.6%
  • XSLT 0.3%
  • TeX 0.2%