end-to-end data engineering project to get insights from PyPi using python and duckdb
-
Updated
Jun 8, 2024 - Python
end-to-end data engineering project to get insights from PyPi using python and duckdb
This course is designed to provide learners with the fundamental skills needed for data engineering using Python. The objective is to introduce anyone interested in the topic to Python's data engineering-related features.
OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Web application created with Evidence and DuckDB to share stats about the running races in Cuenca.
The developer framework for your data & analytics stack
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Mini projeto desenvolvido no contexto da disciplina de Banco de Dados Não Relacional do programa de pós-graduação em Ciência de Dados e Machine Learning na PUC Campinas.
This project involves working with comprehensive football dataset covering the Top 5 leagues in Europe from 2014-2020. I worked on Data Extraction,Data cleaning and manipulation,Data Modelling and Data Loading
Apache Beam demo projects
Azure cloud projects
Azure Data Engineering and Machine Learning: Helper Functions and Code
This space showcases my work with data projects and visualisations.
This repository contains code and configuration files for an Extract, Transform, Load (ETL) project using Google Cloud Data Fusion for data extraction, Apache Airflow/Composer for orchestration, and Google BigQuery for data loading.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A dbt data pipeline capstone project.
An open source development framework to help you build data workflows and modern data architecture on AWS.
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."