Skip to content

bhuiyanmobasshir94/Data-Engineering

Repository files navigation

Data-Engineering

  1. Confetti data engineering curriculum
  2. data-engineer-interview-questions-python
  3. BigQuery for data warehouse practitioners
  4. Cost-Efficient Open Source Big Data Platform at Uber
  5. Building a streaming video analytics pipeline
  6. Data Warehouse Architecture
  7. Data Mart
  8. Data mart oracle
  9. Database Management System (DBMS)
  10. Data Warehouse Design: A Comprehensive Guide
  11. WHAT IS AN ENTERPRISE DATA WAREHOUSE?
  12. Data Warehouse Implementation
  13. Running a Data Warehouse on PostgreSQL
  14. Smart analytics reference patterns
  15. Building production-ready data pipelines using Dataflow: Overview
  16. What is BigQuery ML?
  17. Migrating data warehouses to BigQuery: Introduction and overview
  18. A guide to data warehousing clickstream data
  19. Batch is a special case of streaming
  20. How to Build a Data Warehouse Using PostgreSQL in Python?
  21. Using Python and MySQL in the ETL Process: Using Python and SQLAlchemy
  22. Data Engineering and Its Main Concepts: Explaining the Data Pipeline, Data Warehouse, and Data Engineer Role
  23. Using Chunksize in Pandas
  24. Database design basics
  25. First Steps With PySpark and Big Data Processing
  26. 17 Strategies for Dealing with Data, Big Data, and Even Bigger Data
  27. A Beginner’s Guide to Data Engineering — Part I
  28. Writing data to production is a contract that isn't free!
  29. How to data model correctly: Kimball vs One Big Table

Postgres

  1. Fastest Way to Load Data Into PostgreSQL Using Python

Apache Airflow

  1. Master Apache Airflow: How to Install and Setup the Environment in 10 Minutes
  2. Master Apache Airflow: Write Your First DAG With Python in Minutes
  3. Passing Data Between Airflow Tasks
  4. Astronomer - Airflow Guides

Apache Spark

  1. Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)
  2. Jupyter docker apache spark
  3. PySpark Read and Write Parquet File
  4. Big Data Analyses with Machine Learning and PySpark

Hadoop Ecosystem

  1. Hadoop Ecosystem
  2. Spark Fundamentals
  3. Hadoop Fundamentals
  4. Big Data Fundamentals
  5. Scala Programming for Data Science

Curated Article

Courses

  1. Data Engineering

Guideline

  1. 10 Steps to get a Data Engineering job

Dashboard

  1. Grafana

Database

  1. Database Structure and Design Tutorial
  2. 4 Query Optimizer Concepts

Data Catalog

  1. What is a data catalog
  2. What is Data Catalog?

Data Reposotory

  1. Database of Databases

Blogs

  1. Adnan's blog

Apache Arrow

  1. How Query Engines Work

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published