Skip to content
This repository was archived by the owner on Jul 11, 2025. It is now read-only.

ryqdev/COMP7305-Cluster-and-cloud-computing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

COMP7305-Cluster-and-cloud-computing

Course project in HKU.

Term Project Part I: Construct a Hadoop‐Spark cluster using 4 machines

Task:

  • Hadoop (MapReduce): WordCount, TeraSort;
  • Spark: Logistic regressing (LR), td‐idf, TeraSort

Submission:

  • one group report (in PowerPoint format)

  • Two Demo videos:(1) Optimized Hadoop TeraSort (10GB) and (2) optimized

    Spark td‐idf results .

  • Meeting records + Peer Review (give marks ‐5 ~ +5 to your groupmates)

DDL:

11:55pm, Oct 24 (Sunday), 2021

About

Course project in HKU.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors