Skip to content

pinei/data-engineer-roadmap

Repository files navigation

data-engineer-roadmap

Fundamentals

Programming

  • Programming Languages
  • Libraries
  • Notebooks

Relational Database

  • Database concepts
  • Data manipulation language (DML)
  • Data definition language (DDL)
  • Database objects
  • Data storage
  • Partitioning
  • Database security

Non-relational databases

  • Document
  • Key-value
  • Graph
  • Column family

Big Data

Data Formats

Data Modeling

  • N3F
  • Dimensional Model (Star Schema)
  • Data Vault

Data Warehouse

  • Arquitecture
  • ETL
  • Data Mart

Storage

  • Concepts
  • Object vs File vs Block
  • Data Lake
  • Cloud storage

Data Lakehouse

  • Features
  • Cloud Solutions

Analytics

  • Self-service BI

Data Processing

  • Batch Processing
  • Real-time processing
  • Online Processing
  • Multiprocessing
  • Time-sharing

Messaging

  • Streaming
  • Queue

Monitoring

  • IoT
  • Pipelines
  • Golden Signals

CI/CD

  • Git
  • DevOps

Data Governance

  • Catalog
  • Data Quality

About

Data Engineer Roadmap

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published