Skip to content

jpgard/distributed-machine-learning-spark-MOOC

Repository files navigation

Distributed Machine Learning with Apache Spark

Course files for UC Berkeley CS120x course on edX, Jul/Aug 2016.

The goal of this course is to learn the underlying principles required to develop scalable machine learning pipelines and gain hands-on experience using Apache Spark.

The code in this repository represents a sample of the work I completed in the course, and includes course assignments as I submitted them, as well as others I worked on after the completion of the course.

Note that substantial portions of the code for this assignment were written by the course instructors, and the assignments consisted of filling in missing components of skeleton code--these notebooks were not written by me from scratch. Because this was a MOOC and I was utilizing the assignments as a learning opportunity to supplement my coursework, I also extensively utilized outside resources to complete these assignments, including the resources listed below:

About

Course files for UC Berkeley CS120x course on edX, Jul/Aug 2016

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published