Parallel Bellman-Ford Algorithm

This document compares/contrasts three parallel versions of the Bellman-Ford algorithm, highlighting key implementation decisions and performance observations.

What is Bellman-Ford?

The Bellman-Ford algorithm is a graph search algorithm that calculates the shortest paths from a single source vertex to all other vertices in a weighted graph. It is capable of handling graphs with negative weight edges, distinguishing it from algorithms like Dijkstra's, which cannot properly handle negative weights. By iteratively relaxing the edges of the graph, the Bellman-Ford algorithm efficiently updates the shortest path estimates until it achieves the final shortest path values or detects a negative weight cycle.

Framework for Parallelization

Parallelizable Tasks: The tasks involving the examination of outgoing neighbors for each vertex emerged as prime candidates for parallel execution. By dividing the vertices among multiple threads, each thread could independently assess the neighbors of its allocated subset of vertices.
What can't be parallelized: We recognized that the algorithm's iterative nature, the sequential update of the distance array in the outer loop, cannot be parallelized. This sequentiality ensures the integrity of the shortest path calculations.

Approach 1: Futures

Design: Utilized futures to synchronize tasks, with the outer loop executed sequentially.
Implementation: Tasks were submitted for each iteration of the outer loop, examining neighboring vertices of specific subsets.
Observation: Realized that futures were unnecessary due to the inherent sequential execution enforced by the outer loop's structure. This led to concurrently assigning subsets of vertices to tasks within the same iteration, eliminating the wait for individual task completion.

Appraoch 2: One-Thread-Per-Task

Design: Attempted parallel execution by creating a thread for each task.
Implementation: Threads were assigned tasks in the inner loop but executed them outside the loop, requiring synchronization through thread joining after task completion.
Challenges: Significant overhead from high thread count and synchronization, leading to the slowest performance among the attempted methods.

Approach 3: Thread Pools

Design: Employed thread pools to match the number of available processors, optimizing task execution.
Implementation: Used ExecutorService for task management, assigning tasks to the thread pool for immediate execution.
Performance:
- Smaller Graphs: Setting tasks equal to the number of available processors showed improved performance, avoiding unnecessary task fragmentation.
- Larger Graphs: Allocating more than 116 tasks enhanced performance, suggesting that dividing the workload into many smaller tasks allowed threads to efficiently process remaining tasks.
Key Insight: Subdividing workloads and utilizing a flexible thread pool model adapted to graph size and processor availability significantly improved algorithm efficiency.

Experiment Results (Tested on Amherst College High-Performance Computing Cluster Using 116 Cores)

Runtime Comparison for Thread Pool Approach across Graph Sizes:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
bellman-ford		bellman-ford
plot		plot
prime		prime
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel Bellman-Ford Algorithm

What is Bellman-Ford?

Framework for Parallelization

Approach 1: Futures

Appraoch 2: One-Thread-Per-Task

Approach 3: Thread Pools

Experiment Results (Tested on Amherst College High-Performance Computing Cluster Using 116 Cores)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Parallel Bellman-Ford Algorithm

What is Bellman-Ford?

Framework for Parallelization

Approach 1: Futures

Appraoch 2: One-Thread-Per-Task

Approach 3: Thread Pools

Experiment Results (Tested on Amherst College High-Performance Computing Cluster Using 116 Cores)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages