An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.
-
Updated
Jun 27, 2024 - Python
An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.
Data Engineering framework written in Python based in Polars.
This project performs analysis on a loan portfolio dataset to forecast cash flows and evaluate portfolio present value. It includes code written in Python using the pandas and NumPy libraries.
A data pipeline that conducts ETL processes to AWS Redshift, utilizing Spark and coordinated by Apache Airflow.
This repository contains application code for the Wizeline Data Engineering Bootcamp (DEB) 2023. It is one of two repositories for the DEB. The other houses the infrastructure code.
Dataset Boston Housing Price prediction
Comparison of various Machine Learning algorithms for Heart Diseases (Heart Attack) prediction.
This projec about reseveation and manage Hotel
This project utilizes an ETL process of moving data files from Amazon S3 storage to a staging area in Redshift, transforming the data, and loading the data into a designed relational data model meant for easy, ad-hoc analysis of data.
Implemented SVC on the Olivetti dataset to predict if a person is wearing glasses or not by using cross-validation techniques in depth.
A microservice that automates the scanning of databases and subsequent creation of inbound pipes with namespaced identifiers.
A Sesam wrangler for automating integrations
Data Warehouse project on corona data. Analytics is done on country level, region level, and sub_region_level and visualized using python matplotlib.
Data modelling with DynamoDB (Target stores)
Built a data model, data warehouse and pipeline for extracting transforming and loading data into a star schema-based data model in a redshift database
Examples of using Aerospike's complex data types (Map and List) to implement common patterns
Companion code for Aerospike Modeling: User Profile Store
Add a description, image, and links to the datamodeling topic page so that developers can more easily learn about it.
To associate your repository with the datamodeling topic, visit your repo's landing page and select "manage topics."