Skip to content

Mahsamehr/data-engineering-zoomcamp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

29 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

My Data Engineering Journey

Welcome to my personal data engineering portfolio! This repo contains my notes, homework, projects, and homework as I work through the DataTalksClub Data Engineering Zoomcamp.

πŸ“š Week-by-Week Breakdown:

🧱 Week 1: Containerization & Infrastructure Setup

  • Introduction to Google Cloud Platform (GCP)
  • Working with Docker and Docker Compose
  • Running PostgreSQL in Docker containers
  • Managing infrastructure using Terraform

πŸ” Week 2: Orchestrating Workflows

  • Understanding Data Lakes and orchestration concepts
  • Building pipelines with Kestra
  • Exploring task scheduling and dependency management

βš™οΈ Workshop 1: Data Ingestion Techniques

  • Reading and ingesting data from APIs
  • Building scalable pipelines
  • Implementing data normalization and incremental loading

🏒 Week 3: Data Warehousing Essentials

  • Overview of Google BigQuery
  • Implementing table partitioning and clustering
  • Learning optimization best practices
  • Intro to ML features in BigQuery

πŸ› οΈ Week 4: Analytics Engineering

  • Building models using dbt (data build tool)
  • Testing, documenting, and deploying transformations
  • Creating dashboards with Metabase

⏱️ Week 5: Batch Processing Fundamentals

  • Introduction to Apache Spark
  • Working with DataFrames and Spark SQL
  • Exploring how groupBy and join operations work under the hood

πŸ“‘ Week 6: Real-Time Data Streaming

  • Getting started with Kafka
  • Using Kafka Streams and KSQL
  • Managing data schemas with Avro

πŸŽ“ Final Project

A final end-to-end project applying concepts learned throughout the course. Coming soon!

About

My personal projects, notes, and progress as I complete the DataTalksClub Data Engineering Zoomcamp.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors