etl-pipeline

Star

Here are 24 public repositories matching this topic...

KelvinJC / machine-learning-ETL-pipeline

Star

A Jupyter notebook documentation of an ETL (extract -> transform -> load) data pipeline

python machine-learning jupyter data-transformation feature-extraction flattened-json etl-pipeline

Updated Mar 16, 2024
HTML

AbdelrhmanSror / Data-Engineer---ETL

Star

Jupyter Notebook demonstrating ETL (Extract, Transform, Load) pipeline for bank market capitalization data.

python csv jupyter-notebook pandas etl-pipeline

Updated Jun 25, 2023
Jupyter Notebook

Mr-Chang95 / Data-Modeling-With-Postgres

Star

Data Modeling With Postgres for Udacity's Data Engineering Program. Using Python in Jupyter Notebook.

python postgres sqlalchemy sql jupyter-notebook data-engineering udacity-nanodegree etl-pipeline table-creation

Updated Apr 5, 2022
Jupyter Notebook

djanmagno / Udacity-Data-Engineer-Nanodegree

Star

Repository containing the notebooks used on classes and projects done from the Udacity Data Engineer Nanodegree.

python airflow data-model postgresql data-warehouse data-engineering apache-cassandra etl-pipeline

Updated Nov 11, 2021
Jupyter Notebook

kingsley9 / NLP-reviews-python

Star

An ETL project in Jupyter notebook that filters and analyzes app reviews from the play store using NLP

nlp machine-learning natural-language-processing etl-pipeline

Updated May 17, 2022
Jupyter Notebook

minut9 / Movies-ETL

Star

Extract, Transform, and Load (ETL) to create pipeline on movie datasets using PostgreSQL, Python, Pandas, and Jupyter Notebook

python postgres etl postgresql jupyter-notebook pgadmin4 etl-pipeline

Updated Jun 14, 2022
Jupyter Notebook

dw251414 / Movies-ETL

Star

Created a data pipeline from movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL. Implemented (ETL) - Extract, Transform, Load - to complete

python postgres json csv sql etl postgresql jupyter-notebook pandas pgadmin4 etl-framework etl-pipeline

Updated Jun 23, 2021
Jupyter Notebook

nhafer88 / Movies_ETL

Star

Performed the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

python json csv sql postgresql movie-database pandas pgadmin4 etl-pipeline

Updated Nov 14, 2021
Jupyter Notebook

enj657 / Movies-ETL

Star

Performed the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

python postgres json csv sql etl postgresql jupyter-notebook pandas pgadmin4 etl-framework etl-pipeline

Updated Oct 9, 2021
Jupyter Notebook

Liza904913 / Movies-ETL

Star

Perform the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

javascript python postgres sql etl postgresql jupyter-notebook pgadmin4 etl-pipeline

Updated Jan 25, 2022
Jupyter Notebook

anaorenstein / Movies_Extract_Transform_Load

Star

Used Pandas to extract movie data from Kaggle and web scraping, clean data on Jupyter notebook, and load data on PostrgeSQL and PgAdmin.

python pandas-dataframe postgresql dataset pgadmin4 etl-pipeline

Updated Mar 6, 2022
Jupyter Notebook

Tobi1018 / Movies-ETL

Star

Perform the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

python sql postgresql pandas pgadmin4 etl-pipeline jypyternotebook

Updated Oct 11, 2021
Jupyter Notebook

DSupps / Movies-ETL

Star

Perform the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

postgres etl postgresql jupyter-notebook pgadmin4 etl-framework etl-pipeline

Updated Apr 19, 2022
Jupyter Notebook

waqarg2001 / Amazon-Sales-Data-Analysis

Star

In this project ETL and Analysis is performed on Amazon Sales Data in notebook and Tableau. The raw data consisted of 5 files which was transformed into one Excel file.

python data-science data etl script notebook numpy excel jupyter-notebook pandas python3 data-analysis pycharm tableau data-analyst etl-pipeline

Updated Nov 12, 2022
Jupyter Notebook

ahmedlrashed / E2E-Azure-Pipeline

Star

Databricks ETL Pipeline for retrieving and processing NI TestStand test results, featuring a well-documented notebook for ETL operations, Data Lake for storage, Spark SQL+Python for transformations, and Power BI as the final visualization of factory metrics.

power-bi azure-sql-database azure-data-lake azure-data-factory databricks-notebooks powerbi-visuals etl-pipeline azure-databricks azure-pipelines ms-powerbi powerbi-dashboards

Updated May 22, 2024
Jupyter Notebook

canaleal / Traffic-Volume-Processing

Star

This project focuses on cleaning traffic volume data using Python, Jupyter Notebook, Pandas, and NumPy. The goal is to preprocess the raw data and convert it into a clean CSV/JSON format for further analysis and visualization.

jupyter-notebook python3 etl-pipeline

Updated Jul 9, 2023
Jupyter Notebook

CamiloDS16 / nlp-sentiment-analysis-X-csupport

Star

Sentiment Analysis project that focuses on classifying the interactions of customers with support agents from different brands on X (formerly Twitter). The project is developed starting from an ETL process through advanced NLP techniques and ML models for classification, written in Python leveraging Jupyter Notebooks.

nlp data-science machine-learning sqlalchemy python3 postgresql-database etl-pipeline

Updated Jan 1, 2024
Jupyter Notebook

AnushDeCosta / Crowdfunding_ETL

Star

Crowd-Quest: ETL Journey for Crowdfunding Data is a repository showcasing the ETL (Extract, Transform, Load) process. It involves extracting data from Excel files, transforming it into CSV format, designing an ERD and database schema, and loading the data into PostgreSQL. Tools used: Jupyter Notebook, VSCode, PostgreSQL, Quick DBD, Excel.

python regex postgresql pandas erd etl-pipeline

Updated May 29, 2023
Jupyter Notebook

Mr-Chang95 / Data-Modeling-With-Apache-Cassandra

Star

Data Modeling With Apache Cassandra for Udacity's Data Engineering Program. Using Python in Jupyter Notebook.

python jupyter-notebook data-engineering apache-cassandra data-modeling udacity-nanodegree etl-pipeline

Updated Apr 6, 2022
Jupyter Notebook

bigoshunane / Big-Data-ETL-Pipeline-Project

Star

Google Colaboratory Notebook files to design ETL pipeline of Amazon music reviews and connection to AWS PostgreSQL database and analysis of the ratio of five star reviews as it relates to participation in the Vine program.

aws-s3 postgresql etl-pipeline googlecolab

Updated Jun 30, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipeline

Here are 24 public repositories matching this topic...

KelvinJC / machine-learning-ETL-pipeline

AbdelrhmanSror / Data-Engineer---ETL

Mr-Chang95 / Data-Modeling-With-Postgres

djanmagno / Udacity-Data-Engineer-Nanodegree

kingsley9 / NLP-reviews-python

minut9 / Movies-ETL

dw251414 / Movies-ETL

nhafer88 / Movies_ETL

enj657 / Movies-ETL

Liza904913 / Movies-ETL

anaorenstein / Movies_Extract_Transform_Load

Tobi1018 / Movies-ETL

DSupps / Movies-ETL

waqarg2001 / Amazon-Sales-Data-Analysis

ahmedlrashed / E2E-Azure-Pipeline

canaleal / Traffic-Volume-Processing

CamiloDS16 / nlp-sentiment-analysis-X-csupport

AnushDeCosta / Crowdfunding_ETL

Mr-Chang95 / Data-Modeling-With-Apache-Cassandra

bigoshunane / Big-Data-ETL-Pipeline-Project

Improve this page

Add this topic to your repo