Skip to content
A list of useful resources to learn Data Engineering from scratch
Branch: master
Clone or download
adilkhash Merge pull request #3 from arpit1997/de-cookbook
Add data engineering cookbook by Andreas Kretz
Latest commit 1fd028d Jun 21, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md Changed to GitHub link Jun 21, 2019

README.md

How To Become a Data Engineer

Useful articles

Algorithms & Data Structures

SQL

Programming

Databases

Distributed Systems

Books

Courses

Blogs

  • Martin Kleppmann author of Designing Data-Intensive Application
  • BaseDS by Vaidehi Joshi about Distributed Systems

Tools

  • Apache Airflow is a platform to programmatically author, schedule and monitor workflows in Python
  • Apache Spark is a unified analytics engine for large-scale data processing
  • Apache Kafka is a distributed streaming platform
  • Luigi is a Python package that helps you build complex pipelines of batch jobs.

Cloud Platforms

Other

Newsletters & Digests

  • Data Eng Weekly - Your weekly Data Engineering news
  • SF Data Weekly - A weekly email of useful links for people interested in building data platforms
  • Data Elixir - Data Elixir is an email newsletter that keeps you on top of the tools and trends in Data Science.
You can’t perform that action at this time.