- little-book-of-pipelines – Scala-based guide to building robust data pipelines
- video-game-training-sql – SQL training series with realistic example data and queries
- microbatch-hourly-deduped-tutorial – Python tutorial on microbatch deduplication patterns
- personal-health-analytics – Analyze health vs social media usage trends
- DataExpert-io/data-engineer-handbook – The popular, community-curated Data Engineering Handbook (34.8K⭐, 6.7K forks)
- Data Engineer Handbook – A complete roadmap with interviews, projects, books, and community recommendations
- Bootcamp (Live) – Past cohorts include May 2024; new cohort planned for January 2025
- Online / Free Bootcamp – Community-supported cohort with shared projects and peer reviews
- 📈 Career trajectory: Analyst → Data Engineer at Teradata → FAANG → Netflix → Airbnb (earning $500–600K by 2023)
- ✍️ Popular philosophy posts:
- "You’re a data engineer, you don’t ‘use Spark and Kafka’ … focus on business outcomes"
- "Data engineering isn’t just Spark … stop boiling roles down to one technology"
- 🗺️ Learning roadmap: SQL → Data modeling → Python → Data quality → Distributed compute → Orchestration → Big data tools
Zach Wilson (“eczachly”) is a high-impact data engineer, educator, and open-source contributor. His work includes practical example repositories, the most-starred Data Engineering Handbook on GitHub, live bootcamp cohorts, and widely-shared industry insights. His guidance—from strategic mindset to detailed technical roadmaps—makes him one of the most trusted voices in the data engineering community.
Last updated: July 2025