Skip to content

gsastry/hadoop-mapreduce

 
 

Repository files navigation

FlowScheduler: A New Task Scheduling Algorithm for Hadoop/MapReduce
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

by: Girish Sastry

FlowScheduler is a new task scheduler for hadoop, optimal within an
additive constant. It consists primarily of two phases:

1) max-cover: a flow augmenting algorithm based of Fold-Fulkerson
2) bal-assign: a greedy task assignment algorithm that exploits data locality
    and network latency

About

Mirror of Apache Hadoop MapReduce

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 93.1%
  • JavaScript 3.8%
  • C++ 1.6%
  • Shell 0.7%
  • C 0.6%
  • Python 0.1%
  • Perl 0.1%