Data Modeling using CassandraQL for a Music Industry Project (Sparkify). Course Project by Udactiy Data Engineering Nano Degree.
-
Updated
May 23, 2020 - Python
Data Modeling using CassandraQL for a Music Industry Project (Sparkify). Course Project by Udactiy Data Engineering Nano Degree.
A data pipeline that conducts ETL processes to AWS Redshift, utilizing Spark and coordinated by Apache Airflow.
A program to create RDF and RDF* knowledge graphs from the clinical database MIMIC-III, where a hospital layout has been created and patients move within the hospital.
This repo gives you insights on the data engineering project I built using public datasets from San Francisco open data portal.
Explore Linear Regression with Gradient Descent, Stochastic Gradient Descent, and Ridge Regression. Uncover algorithmic insights in data modeling. 📊🎶🚀
A program to create RDF and RDF* knowledge graphs from the ouput of a simulation model of patients movements inside a hospital. Since the simulation model has an incomplete hospital layout, HospitalGeneratorRDF also creates a complete hospital layout (rooms, corridors, areas, floors, etc.).
AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes
Abstract Data Structures over neo4j
Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database
A MySQL database schema and data generation tool for a social platform, created as an educational project at SEGI University. Features Malaysian localization and automated test data generation.
Titanic Dataset Survival Prediction by using Machine Learning Classifiers
All about Data science
These include some personal and school projects I created myself.
Discrete Multivariate Modeling Simulator
EDA, Visualisation And Modeling App Using Streamlit
This project aims to predict product sales based on advertising expenditures, focusing on 'TV advertising'. Machine learning techniques are employed to analyze and interpret data, enabling businesses to optimize advertising strategies and maximize sales potential.
Add a description, image, and links to the data-modeling topic page so that developers can more easily learn about it.
To associate your repository with the data-modeling topic, visit your repo's landing page and select "manage topics."