Skip to content

gryAI/End-to-End-Data-Project

Repository files navigation

End-to-End Data Project

Instructions

  • Clone this repository
  • Create a python virtual environment python -m venv env
  • Activate the environment source .\env\Scripts\activate
  • Install packages pip install -r requirements.txt

Overview

Step 1: Setup an FTP Server using WSL
Step 2: Download .csv files from the web using Python
Step 3: Upload downloaded .csv files to the FTP Server either manually or based on a schedule

Running the app:

  • Run wsl
  • Manually — python web_to_ftp_app.py manual
  • Scheduled — python web_to_ftp_app.py scheduled

carbon


Step 1: Create a pipeline in SSIS to:

  • Download files from FTP Server,
  • Load it into PostgreSQL, and
  • Merge the tables in PostgreSQL

Entire Pipeline image

Merge OFAC Table Data Flow image


Step 2: Deploy the Pipeline in SQL Server Agent and create an automated job

About

End-to-end data project repo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages