One framework to develop, deploy and operate data workflows with Python and SQL.
-
Updated
Jun 20, 2024 - Python
One framework to develop, deploy and operate data workflows with Python and SQL.
Crawls sites, to find new content and scrap it
😎 A curated list of awesome DataOps tools
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments
Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talk
A data engineering platform for maintaining a data ecosystem to support self-driving cars research.
Wraps the DB by opening a REST API for storing and retrieving documents info & recommendations
Udacity Data Engineer Nanodegree: Project Data Lake
Example project implementing best practices and testing for PySpark data pipelines.
Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.
Udacity - Data Engineering Nanodegree (Project 1)
Full code for UDACITY's Data Engineer Nano Degree project. Build a Data Warehouse in AWS with Amazon Redshift.
ETL to move data from MySQL into BigQuery using Airflow
IGTI MBA Engenharida de dados - Bootcamp Engenheiro de Dados Cloud - Desafio final
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
Data Engineering Project with Hadoop HDFS and Kafka
Automating pipelines using airflow operators
Add a description, image, and links to the data-engineer topic page so that developers can more easily learn about it.
To associate your repository with the data-engineer topic, visit your repo's landing page and select "manage topics."