Skip to content

COM6012 Scalable Machine Learning - University of Sheffield

Notifications You must be signed in to change notification settings

tvamsisai/ScalableML

 
 

Repository files navigation

COM6012 Scalable Machine Learning - University of Sheffield

Spring 2019 by Haiping Lu for the first half (five sessions)

  • Session 1: Introduction to Spark and ShARC (HPC)
  • Session 2: RDD, DataFrame, ML pipeline, & parallelization
  • Session 3: Scalable matrix factorisation for collaborative filtering recommender systems
  • Session 4: Scalable K-means clustering
  • Session 5: Scalable PCA for dimensionality reduction (and data types in Spark)

The second half will be taught by Mauricio A Álvarez.

Acknowledgement

The materials are built with references to the following sources:

Many thanks to

  • Mike Croucher, Neil Lawrence, Will Furnass, and Twin Karmakharm for their inputs and inspirations.
  • Mauricio A Álvarez for jointly working on this course since we joined Sheffield together.
  • Our teaching assistants (demonstrators) and students who have contributed in various ways.

About

COM6012 Scalable Machine Learning - University of Sheffield

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.1%
  • Other 0.9%