Skip to content
This repository has been archived by the owner on Apr 12, 2018. It is now read-only.
/ hadoop-spark-perf Public archive

(Deprecated) Hadoop and Spark CPU performance Benchmark

License

Notifications You must be signed in to change notification settings

at15/hadoop-spark-perf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hadoop-spark-perf

Deprecated, this project is no longer maintained.

Hadoop and Spark CPU performance Benchmark (Separately)

This benchmark focuses on collecting CPU performance and scaling down input size while keeping the same characteristics. It does not try to monitor nor compare overall performance.

Roadmap

Deployment

  • deploy Hadoop cluster #6
  • deploy Spark cluster #10

Control

  • may use consul for configuration
  • start cluster based on configuration use fixed size cluster
  • let perf attach to process when job start

Perf

  • use perf without agent
  • use perf with agent
  • convert perf data to more common format

Monitor

  • Web UI
  • vm for collecting monitor metrics #9
  • collect perf data, generate report of each experiment