Skip to content
/ CS167 Public

Labs and other material for CS167

Notifications You must be signed in to change notification settings

aseldawy/CS167

Repository files navigation

CS167: Introduction to Big-data Management

This repository contains labs and materials for the CS167 "Introduction to Big-data Management" at UC Riverside.

General Instructions

Unless otherwise mentioned:

  1. Each student should attend their assigned session.
  2. Unless otherwise mentioned, each lab is to be completed individually.
  3. For each lab, there is one hour pre-lab work to be done at home before the lab starts. It is important to do this part on your own to be ready for the lab.
  4. If you have issues in the pre-lab part, try to resolve it asap with the TA and the instructor. If not resolved, please bring your questions to the TA during your lab session.
  5. You are expected to finish the in-lab work by the end of the lab session.
  6. Each lab is due by the end of the lab session.
  7. Late penalty will be applied if the lab is submitted after Friday 5:00 PM on the week when the lab is due.

Table of Contents

  • Remote Access: Setup remote access to your CS167 machine.
  • Lab 1: Development Setup for Java.
  • Lab 2: Functional Programming in Java.
  • Lab 3: Hadoop Distributed File System (HDFS).
  • Lab 4: Hadoop MapReduce.
  • Lab 5: Spark RDD.
  • Lab 6: Spark RDD and Spark SQL using Scala
  • Lab 7: Spark SQL and Parquet
  • Lab 8: MongoDB
  • Lab 9: Machine Learning with MLlib
  • Projects

Links

About

Labs and other material for CS167

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published