Distributed File System (DFS) — Simulation

A pure Python simulation of a distributed file system built around a leader-follower architecture with 3 nodes. No external libraries required — just Python 3.10+.

Concepts Demonstrated

Concept	Description
Write-Ahead Log (WAL)	Every write is logged before being applied, enabling crash recovery and follower catch-up
Leader-Follower Replication	Only the leader accepts writes; followers replicate via WAL replay
Network Partitions	Simulates split-brain scenarios and healing (CAP theorem in practice)
Conflict Resolution	Last-write-wins using logical version clocks, not wall clocks
Delta Sync	Block-level diffing (rsync-style) to minimize bandwidth on large file updates

Project Structure

dfs/
├── main.py        # Entry point — runs all five scenarios
├── simulator.py   # Five failure/recovery scenario definitions
├── node.py        # Node class handling both leader and follower roles
├── storage.py     # In-memory file storage layer
├── wal.py         # Write-Ahead Log implementation
├── network.py     # Network simulation (latency, partitions, healing)
└── sync.py        # Replication engine and consistency reporting

Running

python main.py

This runs all five scenarios end to end and prints a summary of results.

Scenarios

Basic replication — leader writes propagate to all followers
Crash recovery — node crashes mid-write, replays WAL on restart
Network partition — nodes diverge during a split, reconcile after healing
Conflict resolution — concurrent writes resolved via last-write-wins
Delta sync — large file updated with only changed blocks transmitted

Key Takeaways

WAL-first writes make crash recovery always possible
Network partitions are inevitable — design for them explicitly
Logical clocks beat wall clocks for ordering concurrent events
Delta sync dramatically reduces bandwidth for large file updates
Crash ≠ partition: crashed nodes must replay WAL on restart, not just reconnect

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed File System (DFS) — Simulation

Concepts Demonstrated

Project Structure

Running

Scenarios

Key Takeaways

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
main.py		main.py
network.py		network.py
node.py		node.py
simulator.py		simulator.py
storage.py		storage.py
sync.py		sync.py
wal.py		wal.py

Folders and files

Latest commit

History

Repository files navigation

Distributed File System (DFS) — Simulation

Concepts Demonstrated

Project Structure

Running

Scenarios

Key Takeaways

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages