- Confetti data engineering curriculum
- data-engineer-interview-questions-python
- BigQuery for data warehouse practitioners
- Cost-Efficient Open Source Big Data Platform at Uber
- Building a streaming video analytics pipeline
- Data Warehouse Architecture
- Data Mart
- Data mart oracle
- Database Management System (DBMS)
- Data Warehouse Design: A Comprehensive Guide
- WHAT IS AN ENTERPRISE DATA WAREHOUSE?
- Data Warehouse Implementation
- Running a Data Warehouse on PostgreSQL
- Smart analytics reference patterns
- Building production-ready data pipelines using Dataflow: Overview
- What is BigQuery ML?
- Migrating data warehouses to BigQuery: Introduction and overview
- A guide to data warehousing clickstream data
- Batch is a special case of streaming
- How to Build a Data Warehouse Using PostgreSQL in Python?
- Using Python and MySQL in the ETL Process: Using Python and SQLAlchemy
- Data Engineering and Its Main Concepts: Explaining the Data Pipeline, Data Warehouse, and Data Engineer Role
- Using Chunksize in Pandas
- Database design basics
- First Steps With PySpark and Big Data Processing
- 17 Strategies for Dealing with Data, Big Data, and Even Bigger Data
- A Beginner’s Guide to Data Engineering — Part I
- Writing data to production is a contract that isn't free!
- How to data model correctly: Kimball vs One Big Table
- Master Apache Airflow: How to Install and Setup the Environment in 10 Minutes
- Master Apache Airflow: Write Your First DAG With Python in Minutes
- Passing Data Between Airflow Tasks
- Astronomer - Airflow Guides
- Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)
- Jupyter docker apache spark
- PySpark Read and Write Parquet File
- Big Data Analyses with Machine Learning and PySpark
- Hadoop Ecosystem
- Spark Fundamentals
- Hadoop Fundamentals
- Big Data Fundamentals
- Scala Programming for Data Science