Skip to content

shindayou/tiny-etl-pipeline-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tiny ETL Pipeline (Beginner Python Project)

🔧 Overview

This beginner-level ETL project demonstrates how to extract data from a CSV file, transform it using Python and Pandas, and load the cleaned output into a new file.

🛠️ Tools Used

  • Python
  • Pandas
  • Jupyter Notebooks
  • GitHub for version control

📁 Folder Structure

/data         → raw input CSV files
/output       → cleaned/transformed CSV files
/scripts      → Python ETL scripts
/notebooks    → Jupyter notebooks for exploration

📌 Key Steps

  1. Extract: Load raw CSV data
  2. Transform: Clean, filter, and reshape the data
  3. Load: Save the cleaned dataset to /output
  4. Document the process in a notebook

✅ Results

  • A simple, reproducible ETL pipeline
  • Cleaned dataset ready for analysis

🚀 Next Steps

  • Add logging
  • Add error handling
  • Turn the ETL script into a scheduled automation

About

Beginner ETL pipeline built in Python that extacts a CSV, transforms it, and load the cleaned output.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors