YouTube Data Harvesting and Warehousing using SQL, MongoDB and Streamlit
-
Updated
Nov 8, 2023 - Python
YouTube Data Harvesting and Warehousing using SQL, MongoDB and Streamlit
Date dimension based on Iranian calendar
ETLing Data to a Cloud Data Warehouse
Implementation ETL with Python for data integration workflows.
⚡ Automatically produce a data model on your database using its information schema using GenAI.
Check daily covid information
Space Exploration Data Fusion : Unleashing the International Space Station Insights with MongoDB and SQL Integration
This project goal is to design a Data Platform for retail Data Analytics.
Implements a support vector machine with an accompanying algorithm to manipulate text data in order to fool a target classifier.
DataWarehousing on Chicago food database.
How to manage SCD2 with Apache Hive 1.1 and HBase 1.2 w/o HiveQL UPDATE operation
Batch ETL pipeline project on GCP to load and transform daily flight data using Spark to update tables in BigQuery. The pipeline is automated using Airflow.
Generate DDL and Python (PygramETL) code from shared specification
This project enables users to fetch data from YouTube by utilizing the YouTube Data API key. The retrieved data is then stored in a MySQL database. Subsequently, the stored data is analyzed and presented in a Streamlit web application using Pandas DataFrame.
Banking Data Warehouse Pipeline
"PostgresBlend Data Pipeline" is a comprehensive data integration solution designed to seamlessly merge diverse data sources into a unified PostgreSQL Data Warehouse. This project streamlines the process of integrating data from CSVs, JSON, Parquet, and MySQL databases, utilizing Apache Spark for efficient transformation and organization.
Creating a database management system that takes in SQL statements and generates custom databases, tables, views, e.t.c
Add a description, image, and links to the datawarehousing topic page so that developers can more easily learn about it.
To associate your repository with the datawarehousing topic, visit your repo's landing page and select "manage topics."